{"id":63,"date":"2008-07-08T07:19:52","date_gmt":"2008-07-08T14:19:52","guid":{"rendered":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/?p=63"},"modified":"2008-07-08T07:19:52","modified_gmt":"2008-07-08T14:19:52","slug":"digital-preservation-challenges-planning-and-implementing-solutions-for-scientific-publishing","status":"publish","type":"post","link":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2008\/07\/digital-preservation-challenges-planning-and-implementing-solutions-for-scientific-publishing\/","title":{"rendered":"Digital Preservation Challenges: planning and implementing solutions for scientific publishing"},"content":{"rendered":"<p>This talk by Dr Andreas Rauber (as an aside, it is great to see some academics here, as opposed to librarians &#8211; although quite a few of them and publishers here as well) from Vienna University of Technology (in the Dept of Software Technology and Interactive systems)<\/p>\n<p>Andreas starting with &#8216;what is digital preservation?&#8217;, then going to cover preservation planning and a tool called &#8216;Plato&#8217; &#8211; a preservation planning tool.<\/p>\n<p>So &#8211; why do we need digital preservation?<\/p>\n<p>Basic issue of &#8216;keeping the bits alive&#8217; &#8211; but this is not really digital preservation. We know a lot about this kind of work, and it can be a lot of work, but a bottom line, can be done.<\/p>\n<p>However, maintaining the bits is just a small part of the problem. Digital Objects require specific environment to be accessible &#8211; files need specific programs, proggrams need specific operating systems, and operating systems need specific hardware components.<\/p>\n<p>Software and Hardware environment is not stable &#8211; you encounter issues where:<\/p>\n<ul>\n<li>Finels cannot be opened anymore\n<li>Embedded objects are not longer accessible\/linked\n<li>Programs won&#8217;t run\n<li>Information in digital form is lost &#8211; usually completely failure rather than gradual degradation<\/li>\n<\/ul>\n<p>Strategies for Digital Preservation (using <a href=\"http:\/\/unesdoc.unesco.org\/images\/0013\/001300.130071e.pdf\">http:\/\/unesdoc.unesco.org\/images\/0013\/001300.130071e.pdf<\/a>) for categories:<\/p>\n<ul>\n<li>Short term\n<li>Medium term\n<li>etc.<\/li>\n<\/ul>\n<p>Andreas going to look at two approaches:<\/p>\n<p><strong>Migration<\/strong><\/p>\n<ul>\n<li>Transformation into different format<\/li>\n<\/ul>\n<p>Usually get some changes in transformation &#8211; if you do this several times, will have &#8216;damage&#8217; to the digital object<\/p>\n<p><strong>Emulation<\/strong><\/p>\n<p>Emulation of h\/w or s\/w<\/p>\n<p>Both advantage and disadvantage that object is rendered identically &#8211; you can access the object, but you may not know how to use the interface.<\/p>\n<p>Looking specifically at Scientific Publishing &#8211; what are you trying to preserve?<\/p>\n<ul>\n<li>The publication\n<li>Context of the publication\n<li>Adjunct material (slides, notes, videos)\n<li>Demos, exercises, interactive elements\n<li>Data sets and simulations\n<li>Community aspects &#8211; discussion etc.\n<li>&#8230;<\/li>\n<\/ul>\n<p>So &#8211; Digital Preservation is <strong><em>complex<\/em><\/strong><\/p>\n<p>You need to under both the object, and its use and context.<\/p>\n<p>So &#8211; &#8216;Preservation Planning&#8217;&#8230;<\/p>\n<p>There are many different strategies &#8211; how do you know which one is most suitable &#8211; and how do you know if you&#8217;ve been successful 10\/20\/50 etc. years later?<\/p>\n<p>As part of the DELOS DP Cluster here was a workflow developed, which has now been refined and integrated within PLANETS. It is based on the &#8216;utility analysis&#8217; approach developed in Vienna.<\/p>\n<p>Plato is a tool which helps with preservation planning &#8211; you need to:<\/p>\n<ul>\n<li>Define requirements (requires detailed analysis of what you want and what is important &#8211; for e.g. for a web page is the appearance of the hyperlinks important, or just the target information; if there is a web counter is it preserved at a specific date, does it count hits on the archived copy, does it continue to count hits on the &#8216;live&#8217; copy? etc.)\n<li>Evaluate alternatives (including not to draw up preservation plan if you want)\n<li>Consider results\n<li>Build preservation plan<\/li>\n<\/ul>\n<p>All this looks interesting but suggests that this is going to be an incredibly expensive process (even to do the preservation planning, nevermind the actual preservation). This drives it home &#8211; we need to be good at deciding what is worth preserving in the medium\/long term &#8211; and only embark on this kind of exercise where we know we want to do the preservation.<\/p>\n<p>Plato is a &#8216;concretization&#8217; (is that a word?) of the OAIS model, which follows recommendations of TRAC and nestor &#8211; it is a pretty generic workflow, so should be easy to integrate it into different settings.<\/p>\n<p>In a case study of electronic theses, found that for these Plain text doesn&#8217;t satisfy several minimum requirements, RTF is weak in Appearance and Structure, and that the deactiviation of scripting and security are knock-out criterium (for PDF)<\/p>\n<p>Andreas stressing the key role of the the &#8216;defining requirements&#8217; stage &#8211; this is the point at which people start identifying what is important, and you can start to see cost vs. benefit<\/p>\n<p><a href=\"http:\/\/www.ifs.tuwien.ac.at\/dp\">http:\/\/www.ifs.tuwien.ac.at\/dp<\/a> <\/p>\n<p><a href=\"http:\/\/www.ifs.tuwien.ac.at\/dp\/plato\">http:\/\/www.ifs.tuwien.ac.at\/dp\/plato<\/a> <\/p>\n<p>Some conferences coming up on Digital Preservation including one at the British Library on 29th July.<\/p>\n<p>Q: Who should take responsibility?<\/p>\n<p>A: Need people from the &#8216;user&#8217; side who at least know what they want, also need skills in IT, and input from Management on cost etc.<\/p>\n<p>Once there are a number of examples of needs analysis of &#8216;type&#8217; of material &#8211; e.g. e-theses, they can consolidate into a shareable template &#8211; however, need a number of studies first to capture wide range of requirements, rather than finding requirements from first study results in others narrowing their view down to whatever the first institution identified.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This talk by Dr Andreas Rauber (as an aside, it is great to see some academics here, as opposed to librarians &#8211; although quite a few of them and publishers here as well) from Vienna University of Technology (in the Dept of Software Technology and Interactive systems) Andreas starting with &#8216;what is digital preservation?&#8217;, then [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-63","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/63","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/comments?post=63"}],"version-history":[{"count":0,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/63\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/media?parent=63"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/categories?post=63"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/tags?post=63"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}