{"id":1682,"date":"2014-11-03T17:26:40","date_gmt":"2014-11-03T16:26:40","guid":{"rendered":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/?p=1682"},"modified":"2014-11-03T17:48:51","modified_gmt":"2014-11-03T16:48:51","slug":"palimpsest-an-edinburgh-literary-cityscape","status":"publish","type":"post","link":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/2014\/11\/palimpsest-an-edinburgh-literary-cityscape\/","title":{"rendered":"Palimpsest: An Edinburgh Literary Cityscape"},"content":{"rendered":"<p>This blog post was written during a presentation at the <a href=\"http:\/\/britishlibrary.typepad.co.uk\/digital-scholarship\/2014\/10\/british-library-labs-symposium-2014.html\">British Library Labs Symposium<\/a> in November 2014. It is likely full of errors and omissions having been written real-time.<\/p>\n<p>Dr Beatrice Alex, University of Edinburgh<\/p>\n<p>Looking for mentions of places in Edinburgh using data sources including:<br \/>\n* HathiTrust<br \/>\n* British Library Nineteenth Century Books Collection (main source)<br \/>\n* Project Gutenberg<br \/>\n* Oxford Text Archive data<\/p>\n<p>Interested in using EEBO\/ECCO<\/p>\n<p>Workflow:<br \/>\n* Digitised documents from collections above<br \/>\n* Document retrieveal and filtering -&gt; to get ranked lists of Edinburgh specific candidates<br \/>\n* Manual curation &#8211; curation of Edinburgh specific literature &#8211; need a human in the loop to get the level of detail they desired<br \/>\n* Text minimg &#8211; fine-grained location extraction and geo-referencing using the Edinburgh Geoparser<br \/>\n* All data stored in database that then powers the visualisations etc.<\/p>\n<p>Big data IN -&gt; Small data OUT<\/p>\n<p>All input documents must first be:<br \/>\n* Converted to a common format<br \/>\n* Identified as written English text<br \/>\n* Post-corrected automatically if necesssary<br \/>\n* Linguistic pre-processing<\/p>\n<ul>\n<li>Document retrieval. The goal is to find all Edinburgh loco-specific items which fit our remit (fiction, autobio, travel)<\/li>\n<li>Get ranked dcouments<\/li>\n<li>Assisted Curation is done with Palimpsest Annotation Tool (developed at St Andrew&#8217;s). Human makes decisions about whether items are &#8216;in or out&#8217; (e.g. poetry marked as such and then excluded for the moment &#8211; may come back to this later)<\/li>\n<\/ul>\n<p>Gazetteer Creation<br \/>\n* Text minign tools use the Edinburgh Geoparser to mark-up place names and resolve them to coordinates with a choice of gazetteer as the reference source &#8211; e.g. Geonames<\/p>\n<p>Not all place matches in the gazetteer are interesting to the project &#8211; e.g. &#8216;Spring&#8217;. Clean these out. Have built the gazetteer and now building on this &#8211; e.g. want to do further linguistic analysis, building a mobile app so you can explore the literature based on your location<\/p>\n<p>Final outputs will be web-based visualisations and a mobile app &#8211; the aim is to create interfaces for both literary scholars and the general public.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This blog post was written during a presentation at the British Library Labs Symposium in November 2014. It is likely full of errors and omissions having been written real-time. Dr Beatrice Alex, University of Edinburgh Looking for mentions of places in Edinburgh using data sources including: * HathiTrust * British Library Nineteenth Century Books Collection [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[99],"class_list":["post-1682","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-bl_labs"],"_links":{"self":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1682","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/comments?post=1682"}],"version-history":[{"count":3,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1682\/revisions"}],"predecessor-version":[{"id":1700,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/posts\/1682\/revisions\/1700"}],"wp:attachment":[{"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/media?parent=1682"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/categories?post=1682"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.meanboyfriend.com\/overdue_ideas\/wp-json\/wp\/v2\/tags?post=1682"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}