{"id":1308,"date":"2011-10-06T13:07:48","date_gmt":"2011-10-06T12:07:48","guid":{"rendered":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2011\/10\/upscaling-digitisation-at-the-wellcome-library\/"},"modified":"2011-10-07T08:51:01","modified_gmt":"2011-10-07T07:51:01","slug":"upscaling-digitisation-at-the-wellcome-library","status":"publish","type":"post","link":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2011\/10\/upscaling-digitisation-at-the-wellcome-library\/","title":{"rendered":"Upscaling digitisation at the Wellcome Library"},"content":{"rendered":"<p>Wellcome library &#8211; part of Wellcome trust, a charitable foundation which funds research and includes research\/contextualisation etc of medical history<\/p>\n<p>Wellcome library has a lot of unique content &#8211; which is the focus of their digitisation efforts. Story so far:<\/p>\n<p>Image library created from transparencies\/prints &#8211; and on demand photography &#8211; 300,000 images<br \/>\nJournal backfiles digitisations<br \/>\nWellcome Filme &#8211; 500+ titles<br \/>\nAIDS poster projects<br \/>\nArabic manuscripts &#8211; 500 manuscripts (probably biggest single project)<br \/>\n17th Century recipe books<\/p>\n<p>Contribute to Europeana<\/p>\n<p>Digitisation part of longterm strategy for the library &#8211; but while aim is to eventually digitise everything, need target content.<\/p>\n<p>Digitisation archival material, around 2000 books 1850-1990 (pilot project &#8211; and of course will test waters in copyright areas). Also contributing to Early European Books project &#8211; commercial partnership with ProQuest.<\/p>\n<p>Approach to digitisation projects has changed. Previously did smaller (&lt;10,000 pages) projects, relatively ad hoc, entirely open access, library centric, no major IT investment &#8211; but now doing large project (&gt;100,000 pages) with involvement from wider range of stakeholders &#8211; within and outside organisation, needs major IT development. Also increasing commercial partnerships mean not all outputs will be &#8216;open access&#8217; &#8211; although feel that this is about additional material that would not have been done otherwise&#8230;<\/p>\n<p>Need to move<\/p>\n<ul style=\"list-style-type: disc\">\n<li>Manual processes -&gt; Automated processes (where possible)<\/li>\n<li>Centralised conservation -&gt; distributed conservation<\/li>\n<li>Low QA -&gt; increased QA, error minimization<\/li>\n<li>Using TIFF -&gt; JPEG 2000 (now 100% JPEG 2000 after digital copy created)<\/li>\n<li>From detailed and painstaking to streamlined and pragmatic<\/li>\n<\/ul>\n<p>Streamlining:<\/p>\n<ul style=\"list-style-type: disc\">\n<li>Staff dedicated to specific projects or streams of work<\/li>\n<li>Carry out sample workflow tests for new types of material<\/li>\n<li>Right equipment for right job &#8211; eliminate the &#8216;fiddly bits&#8217; &#8211; led to:<\/li>\n<li>Live-view monitors<\/li>\n<li>Easy-clean surfaces<\/li>\n<li>Foot-pedals<\/li>\n<li>&#8230;<\/li>\n<li><\/li>\n<li>Photographers do the photography<\/li>\n<li>Prepare materials separately<\/li>\n<li>Leave loose pages and bindings as they are &#8211; easier to digitise that way<\/li>\n<li>Use existing staff as support<\/li>\n<li>Minimise movement<\/li>\n<li>Plenty of shelving and working space<\/li>\n<li>Find preferred supplier for ad hoc support<\/li>\n<\/ul>\n<p>Upscaling and streamlining digitisation requires a higher level of project management<\/p>\n<p>Goobi <a href=\"http:\/\/www.goobi.org\/\">http:\/\/www.goobi.org\/<\/a>:<br \/>\nWeb-based workflow system<br \/>\nOpen source (core system)<br \/>\nUse by many libraries in Germany<br \/>\nWellcome use the Intranda version (Intranda a company who do develop Goobi)<\/p>\n<p>Goobi is task-facuse, customisable workflows &#8211; developed specifically by Intranda<br \/>\nUser-specific dashboard<br \/>\nImport\/export and store metadata<br \/>\nEncode data as METS<br \/>\nDisplay progress of tasks, stats on activities<br \/>\ntracks projects, batches and unit<br \/>\nCan call other systems &#8211; e.g. ingest or OCR<\/p>\n<p>Q: Is Goobi scalable? Can it be used for very big projects<br \/>\nA: Goobi works well for small institutions &#8211; don&#8217;t need programmers to implement and relatively cheap. But probably scalability going to be limited by hardware rather than anything else<\/p>\n<p>Q: How does Intranda version differ to other version of Goobi<br \/>\nA: at least at Wellcome &#8230; e.g Goobi doesn&#8217;t handle &#8216;batches&#8217; of material &#8211; Intranda added this material. Goobi uses Z39.50 to get metadata, Wellcome wanted to get metadata elsewhere, so adjusted to do that by Intranda<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Wellcome library &#8211; part of Wellcome trust, a charitable foundation which funds research and includes research\/contextualisation etc of medical history Wellcome library has a lot of unique content &#8211; which is the focus of their digitisation efforts. Story so far: Image library created from transparencies\/prints &#8211; and on demand photography &#8211; 300,000 images Journal backfiles [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[79,78],"class_list":["post-1308","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-libeb11","tag-liber-eblida"],"_links":{"self":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1308","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/comments?post=1308"}],"version-history":[{"count":1,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1308\/revisions"}],"predecessor-version":[{"id":1313,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1308\/revisions\/1313"}],"wp:attachment":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/media?parent=1308"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/categories?post=1308"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/tags?post=1308"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}