{"id":1293,"date":"2011-10-06T08:23:27","date_gmt":"2011-10-06T07:23:27","guid":{"rendered":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2011\/10\/impact-centre-of-competence-in-text-digitisation\/"},"modified":"2011-10-06T09:00:04","modified_gmt":"2011-10-06T08:00:04","slug":"impact-centre-of-competence-in-text-digitisation","status":"publish","type":"post","link":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2011\/10\/impact-centre-of-competence-in-text-digitisation\/","title":{"rendered":"IMPACT: Centre of Competence in Text Digitisation"},"content":{"rendered":"<p>For the next two days I&#8217;m at the 3rd LIBER-EBLIDA Workshop on Digitization of Library Material in Europe. I&#8217;m here because I&#8217;m speaking later today about the JISC Guide to Open Bibliographic Data which I co-authored, but around all that there is a very interesting programme.<\/p>\n<p>First up this morning is Hildelies Balk on the IMPACT project &#8211; <a href=\"http:\/\/www.impact-project.eu\/news\/coc\/\">http:\/\/www.impact-project.eu\/news\/coc\/<\/a>. This project is trying to tackle the issues related to OCR of digitised historical texts. The main achievements of IMPAC so far:<\/p>\n<p>Improved commercial OCR (ABBYY &#8216;IMPACT&#8217; Finereader 10 on market)<br \/>\nEffective tool for OCR correction with volunteer involvement (IBM CONCERT) ready for implementation<br \/>\nNovel approaches to preprocessing, OCR and post-correction available<br \/>\nComputer lexica for 9 languages close to delivery<br \/>\nDigitisation Framework with evaluation tools available<br \/>\nFacility to plug in other tools (if you have tools you can integrate)<br \/>\nLarge dataset with sophisticated &#8216;ground truth&#8217; close to final delivery<br \/>\nUnique network of expertise<br \/>\n&#8230;.<\/p>\n<p>Challenges in digitisation of historic material still there &#8211; there is no lak of novel approaches to improve access &#8211; both within IMPACT and many other projects<br \/>\nThe challenge is translating from these novel approaches to real life implementation &#8211; many of the developments do not integrate into library workflows well<br \/>\nWhere next? Direction needed for work &#8211; e.g. should we really be investing in mass re-keying of content?<\/p>\n<p>To sustain IMPACT, they need to have a Business Model which would keep the centre running after the end of the current EU funding. IMPACT have done workshops throughout the project &#8211; covering all levels of staff. Used approach described is <a href=\"http:\/\/www.businessmodelgeneration.com\">http:\/\/www.businessmodelgeneration.com<\/a>.<\/p>\n<p>First questions they tackled &#8211; what is the value proposition and what are the customer segments?<\/p>\n<p>Major customer segment &#8211; the &#8216;service providers&#8217; (presumably companies like Proquest? &#8211; not clear). IMPACT has all major content holders in the consortium &#8211; so clear value proposition &#8211; access to the content holders through single route<\/p>\n<p>Another major customer segment &#8211; the content holders. Ideas proposed included mediating consultancy between content holders and others with expertise.<\/p>\n<p>So these ideas discussed, and of course moved onto other parts of the business model. Often find people move to the &#8216;rational&#8217; side of the model quickly &#8211; e.g. people often focus on costs before other issues sorted out.<\/p>\n<p>Outcomes:<\/p>\n<p>Centre of Competence &#8211; benefits for content holders:<br \/>\nExchange of best practice in ocmmunity of content holders<br \/>\nKnowledgeBank with comprehensive and up to date information and tech watch reports<br \/>\nTraining on demand and online tutorial<br \/>\nOnline support thtrough a helpdesk<br \/>\nSupport in the implementation of the innovateive IMPACT solution for imrpoving access to text<br \/>\nAccess to the IMPACT dataset with &#8216;groudn truth&#8217; and tools for evaluation<br \/>\nDigitisation framework &#8211; guidelines of using the open source workflow management system Tavernana<br \/>\nLanguage resources<br \/>\nand more!<\/p>\n<p>Three levels of membership:<br \/>\nOpen &#8211; access to forum &#8211; part of content<br \/>\nBasic membership (fee) &#8211; access to all facilities, reduced fee for conferences<br \/>\nPremium membership (fee) &#8211; member of the board, privileges such as free entry to conferences<\/p>\n<p>Follow IMPAC on twitter <a href=\"@impactocr\">http:\/\/twitter.com\/impactocr<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>For the next two days I&#8217;m at the 3rd LIBER-EBLIDA Workshop on Digitization of Library Material in Europe. I&#8217;m here because I&#8217;m speaking later today about the JISC Guide to Open Bibliographic Data which I co-authored, but around all that there is a very interesting programme. First up this morning is Hildelies Balk on the [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[79,78],"class_list":["post-1293","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-libeb11","tag-liber-eblida"],"_links":{"self":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1293","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/comments?post=1293"}],"version-history":[{"count":3,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1293\/revisions"}],"predecessor-version":[{"id":1296,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1293\/revisions\/1296"}],"wp:attachment":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/media?parent=1293"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/categories?post=1293"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/tags?post=1293"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}