<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Overdue Ideas</title>
	<atom:link href="http://www.meanboyfriend.com/overdue_ideas/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.meanboyfriend.com/overdue_ideas</link>
	<description>Ideas linking Libraries, Computing, E-learning, and anything else that springs to mind.</description>
	<lastBuildDate>Tue, 28 Feb 2012 11:18:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Mendeley and APIs</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2012/02/mendeley-and-apis/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2012/02/mendeley-and-apis/#comments</comments>
		<pubDate>Tue, 28 Feb 2012 11:17:36 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1408</guid>
		<description><![CDATA[Now Ian Mulvany talking about Mendeley and how they use APIs &#8211; both publishing and consuming. Try to expose all the metadata being added by users via an API &#8211; a &#8220;social catalogue&#8221;. This enables &#8216;discovery&#8217;, but not &#8216;delivery&#8217; &#8211; this is where Mendeley can make use of external APIs &#8211; such as the WorldCat [...]]]></description>
			<content:encoded><![CDATA[<p>Now Ian Mulvany talking about <a href="http://www.mendeley.com/">Mendeley</a> and how they use APIs &#8211; both publishing and consuming.</p>
<p>Try to expose all the metadata being added by users via an API &#8211; a &#8220;social catalogue&#8221;. This enables &#8216;discovery&#8217;, but not &#8216;delivery&#8217; &#8211; this is where Mendeley can make use of external APIs &#8211; such as the WorldCat API.</p>
<p>Mendeley invest in APIs because</p>
<ul>
<li>It helps them extend their product, but integrating data/functionality from other places</li>
<li>It enables others to extend their product &#8211; they don&#8217;t have time to build everything that users are asking for. E.g. Android client built by users, as company didn&#8217;t have the resource</li>
</ul>
<p>Mendeley uses WorldCat registry to find/suggest appropriate OpenURL resolver depending on users location &#8211; as most users won&#8217;t know what an OpenURL resolver is, or what the detail are.</p>
<p>Mendeley uses OAuth &#8211; which means they can integrate with institutional repositories and a users own publications in Mendeley &#8211; going to be live soon (working with JISC, Symplectic and University of Cambridge on this &#8211; http://jisc-dura.blogspot.com/). Learnt a lot about consuming their own APIs in this project &#8211; and uncovered bugs&#8230;</p>
<p>&#8220;We should have built the API first, and the product second&#8221; &#8211; the fact they didn&#8217;t now creating work. Now they are creating a new application for libraries, and building API first. Ian firmly believes this is a better approach.</p>
<p>Ian&#8217;s top 10 tips for API provision:</p>
<ul>
<li>first API, then app</li>
<li>use your own APIs (and he believes Mendeley should do this more)</li>
<li>make an (API) interface you would use yourself</li>
<li>provide lots of example docs &#8211; coders like to do stuff quickly &#8211; if they can get something working from an example quickly, they&#8217;ll then invest</li>
<li>version your API &#8211; backwards compatible</li>
<li>put rate limits in place</li>
<li>work with a 3rd party to provide keys</li>
<li>have clear licensing</li>
<li>engage with your community</li>
<li>promote, promote, promote, promote, promote</li>
</ul>
<p>In terms of consuming:</p>
<ul>
<li>know what you want to do</li>
<li>define the value &#8211; this may be service delivery, or could be development of skills for developers etc.</li>
<li>measure the value &#8211; otherwise difficult to prioritise future developments</li>
<li>understand the SLA</li>
<li>if it&#8217;s important &#8211; have a backup plan &#8211; dependence on 3rd party is a risk which you should manage</li>
<li>don&#8217;t wait on API for page loads &#8211; found that Mendeley homepage was waiting for a response from an API was down, and so the page didn&#8217;t load&#8230;</li>
<li>get on the mailing list/dev group</li>
<li>look for good example code</li>
<li>don&#8217;t be afraid to pay &#8211; if it&#8217;s important, it&#8217;s worth paying for</li>
<li>use Mendeley&#8217;s APIs <img src='http://www.meanboyfriend.com/overdue_ideas/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </li>
</ul>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2012/02/mendeley-and-apis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Citavi and APIs</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2012/02/citavi-and-apis/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2012/02/citavi-and-apis/#comments</comments>
		<pubDate>Tue, 28 Feb 2012 10:47:45 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1405</guid>
		<description><![CDATA[I&#8217;m at the OCLC EMEARC meeting today, talking and hearing about APIs. Having done my bit at the start, now trying to relax into the other presentations before questions and general discussion at the end. Now Antonio Tejada and Hans-Siem Schweiger are talking about Citavi which combines Reference Management and Knowledge Organisation. Citavi designed to help with [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m at the OCLC EMEARC meeting today, talking and hearing about APIs. Having done my bit at the start, now trying to relax into the other presentations before questions and general discussion at the end.</p>
<p>Now Antonio Tejada and Hans-Siem Schweiger are talking about <a href="http://www.citavi.com/">Citavi</a> which combines Reference Management and Knowledge Organisation. Citavi designed to help with searching, retrieving results, acquiring materials &#8211; all of these require interaction with library sources. Citavis supports adding data manually, from file upload, browser extensions and via APIs.</p>
<p>Manual entry is error prone and time consuming</p>
<p>File upload &#8211; uses standard formats (RIS/BibTeX); supported by a wide range of catalogues and databases; but still time consuming</p>
<p>Browser Extension &#8211; e.g. looks for embedded metadata in the page (e.g. COinS) or find standard identifiers in the page (e.g. ISBN) and import data</p>
<p>APIs &#8211; eliminates the browser &#8211; you don&#8217;t need to go to lots of different sources on the web. Fastest mechanism. Direct. Integrates in the workflow much better. However cost of implementation can vary quite a bit &#8211; it all depends on the API &#8211; some very fast (e.g. z39.50 can do in minutes now, but custom APIs can be more difficult)</p>
<p>Citavi Features which use APIs:</p>
<ul>
<li>Online search &#8211; integrated into the Citavi application</li>
<li>Retrieve by identifier (e.g. DOI, ISBN, PubMed ID)</li>
<li>Import formatted bibliography &#8211; can take a bibliography from a word file and Citavi will run a search for each item in the bibliography</li>
<li>Find Library Locations</li>
<li>Find Full Text</li>
<li>Check availability with OpenURL (seems like this actually just pushes user to their local resolver?)</li>
</ul>
<div>Citavi supports a proxy service for some resources, when needed. E.g. for WorldCat API where an API key is required.</div>
<p>Universities can get site license for Citavi &#8211; allows library to create a special settings file with authentication details for databases (that are not IP authenticated)</p>
<p>Challenges for Citavis using APIs:</p>
<ul>
<li>Administrative challenges</li>
<ul>
<li>Some libraries don&#8217;t want to be accessible (at least via a desktop application)</li>
<li>Catalogues that charge by the record for metadata</li>
<li>Inconsistent communication &#8211; e.g. change of settings on library system, don&#8217;t inform Citavi</li>
</ul>
<li>Technical challenges</li>
<ul>
<li>Custom catalog software &#8211; missing or inconsistent standards support and inconsistent field mapping</li>
<li>Legacy data &#8211; not as well-structured; inconsistent data entry</li>
</ul>
</ul>
<p>Going forward:</p>
<ul>
<li>Geographic search (WorldCat Search API)</li>
<li>Enhanced availability search (WorldCat Registry and OpenURL Gateway)</li>
<li>Acquisitions management (WMS Acquisitions) &#8211; Citavi didn&#8217;t anticipate this, but some libraries using Citavi to manage acquisitions processes</li>
<li>Metadata &#8211; looking at authority control (WorldCat Identities); Alternate editions (xISBN); ISSN lookup (xISSN)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2012/02/citavi-and-apis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CETIS Conference: Learning Registry show and tell</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-learning-registry-show-and-tell/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-learning-registry-show-and-tell/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 16:30:48 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cetis12]]></category>
		<category><![CDATA[jlern]]></category>
		<category><![CDATA[learningreg]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1402</guid>
		<description><![CDATA[First up Scott Wilson (@scottbw) describing potential use of learning registry to bring together &#8216;paradata&#8217; (activity/usage data) for &#8216;widgets&#8217; (or apps) across different widget (app) stores &#8211; the idea that you could have the same app in different stores, and want to aggregate the reviews or ratings from each store. Put in bid to JISC [...]]]></description>
			<content:encoded><![CDATA[<p>First up Scott Wilson (@scottbw) describing potential use of learning registry to bring together &#8216;paradata&#8217; (activity/usage data) for &#8216;widgets&#8217; (or apps) across different widget (app) stores &#8211; the idea that you could have the same app in different stores, and want to aggregate the reviews or ratings from each store. Put in bid to JISC under rapid innovation call for a project &#8216;SPAWS&#8217;&#8230;</p>
<p>Terry McAndrew (from JISC TechDis) &#8211; want to network experience with resources &#8211; identify accessible practice/purposes. Terry says most &#8216;OER Problems&#8217; are social not technical. Asks &#8211; can we find learning registry output via Google?</p>
<p>Walt Grata showing tools that he has built on top of Learning Registry &#8230; (on <a href="https://github.com/adlnet">github</a>):</p>
<ul>
<li>&#8216;Landing pages&#8217; for content &#8211; that can be indexed via Google (think this is new, and not up at github yet)</li>
<li><a href="https://github.com/adlnet/lr-data">Harvesting tool</a> &#8211; to grab stuff from a node and put into another storage mechanism &#8211; e.g. couchdb,  postgresql, etc etc</li>
</ul>
<p>Pat Lockley &#8211; <a href="https://docs.google.com/present/view?id=0AeVVe3qVXORUZGY1dm40aHhfMzVya2Y2bmpnZw&amp;pli=1">slides</a></p>
<ul>
<li><a href="https://chrome.google.com/webstore/detail/cegnlobohmoekpdicnplajegemnjbhgp?hl=en-GB">Chrome plugin</a>, code <a href="https://github.com/patlockley/learning_registry_chrome">on github</a></li>
<ul>
<li>No-one will search outside Google &#8211; so take learning registry to Google. Chrome plugin finds all links on the page, and checks each one on the learning registry &#8211; and looks for some common attributes &#8211; like &#8216;title&#8217; or &#8216;description&#8217; etc. &#8211; and can then manipulate browser display to make use of this data.</li>
</ul>
<li><a href="http://wordpress.org/extend/plugins/learning-registry-widget/">WordPress Widget </a> - code <a href="https://github.com/patlockley/Learning_registry_widget">on github</a></li>
<ul>
<li>plugin for WordPress to display content from a learning registry (node) in a wordpress blog</li>
</ul>
</ul>
<p>Steven Cook &#8211; used Cake (PHP framework) to extract and &#8216;slice&#8217; data from learning registry node. Also pulling data from other sources &#8211; like Topsy. <a href="http://github.com/cookiescrumbs/Datasource">Code on github</a>. Talking about how can&#8217;t expect Learning Registry to do the hard work here &#8211; have to expect to pull out data, cache it, etc. Notes learning registry API isn&#8217;t completely RESTful (? not sure what the issues are) .</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-learning-registry-show-and-tell/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CETIS Conference: Capturing Conversations About Learning Resources</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-capturing-conversations-about-learning-resources/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-capturing-conversations-about-learning-resources/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 14:36:25 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cetis12]]></category>
		<category><![CDATA[jlern]]></category>
		<category><![CDATA[learningreg]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1392</guid>
		<description><![CDATA[This session is really why I&#8217;ve come to the CETIS Conference (apart from the general opportunity to meet and chat to people which is also great) &#8211; it&#8217;s about &#8220;The Learning Registry&#8221; (@learningreg and http://learningregistry.org). The Learning Registry is not a destination &#8211; it&#8217;s about building infrastructure &#8211; and in some ways has both parallels [...]]]></description>
			<content:encoded><![CDATA[<p>This session is really why I&#8217;ve come to the CETIS Conference (apart from the general opportunity to meet and chat to people which is also great) &#8211; it&#8217;s about &#8220;The Learning Registry&#8221; (<a href="http://twitter.com/learningreg ">@learningreg </a>and <a href="http://learningregistry.org">http://learningregistry.org</a>). The Learning Registry is not a destination &#8211; it&#8217;s about building infrastructure &#8211; and in some ways has both parallels and relevance to the work the <a href="http://discovery.ac.uk">Discovery programme</a> is undertaking (which I&#8217;m involved in).</p>
<p>A simple use case for the learning registry is:</p>
<ul>
<li>Nasa publishes a physics video</li>
<ul>
<li>PBS posts a link to the video</li>
<li>NSDL posts a link to the video</li>
<li>A school uses the video in a course in their Moodle VLE</li>
</ul>
</ul>
<p>Each place/portals where the link is used or published only knows about use of their link or copy of the resource. So Learning Registry aims to support way of sharing this type of activity &#8216;in the open&#8217; &#8211; so that this can be captured and reflected &#8211; the &#8216;social metadata timeline&#8217; &#8211; Learning Registry is to provide infrastructure to support this. Learning Registry describes this type of activity/usage data associated with a resource &#8216;paradata&#8217; &#8211; although learning registry learning registry doesn&#8217;t care what type of data it stores (as long as it can be expressed as JSON)</p>
<p>The learning registry is &#8220;an idea, a research project, an open source community project, a public social metadata distribution network&#8217;&#8230;</p>
<p>The guiding principles: be enabling, capability not solutions, no barrier entry, no single point of failure &#8211; everything distributed&#8230;</p>
<p>Not going to try to blog the technical architecture of.. but summary of APIs</p>
<ul>
<li><strong>Distribute API</strong> - uses http POST. About copying data from one node to another &#8211; i.e. achieving the distributed part of the architecture</li>
<li><strong>Publish API</strong> &#8211; how you get stuff into a Learning Registry node (that is, you, the producer of information, publish it *to* the learning registry node) &#8211; uses http POST. Learning reg also supports SWORD for publishing data into a node</li>
<li><strong>Obtain API</strong> &#8211; getting data out of a learning registry node &#8211; uses http GET</li>
<li><strong>Harvest and OAI-PMH APIs</strong> &#8211; another way of getting stuff out of the node. Harvest returns JSON but supports OAI-PMH type actions. OAI-PMH also supported.</li>
</ul>
<p>It is stressed that this is really a project at the start of its work &#8211; the way to engage and to find out how to do this stuff is to join the community &#8211; join the developer list etc. and raise issues, ask questions &#8211; this is part of the experiment and will inform the development.</p>
<h2>JLern</h2>
<p>JLern is the project to setup an experimental node in the UK &#8211; being run by Mimas.</p>
<p>2 kinds of nodes in the Learning Registry:</p>
<ul>
<li>Common node</li>
<li>Gateway node</li>
</ul>
<p>JLern have setup a &#8216;common node&#8217; &#8211; this can support:</p>
<ul>
<li>Publish services</li>
<li>Access services</li>
<li>Distribution services (JLern now have a 2nd common node up and running to try these)</li>
<li>&#8230;.</li>
</ul>
<p>Common nodes can be part of &#8216;networks&#8217;. Networks can (only) be connected via &#8216;gateway nodes&#8217;</p>
<p>When networks are connected, this is called a &#8216;community&#8217;. A &#8216;network community&#8217; is a collection of interconnected resource distribution networks. A resource network can only be a member of one community.</p>
<p>Now have published the JORUM metadata (via OAI-PMH) &#8211; so about 15k resources. Open University now looking at similar activity. Now Jorum exploring framework for capturing paradata about resources.</p>
<p>Gathering ideas and use cases now &#8211; e.g. see JLern challenge from dev8D <a href="http://dev8d.org/challenges/">http://dev8d.org/challenges/</a></p>
<p>The JLern &#8216;Alpha&#8217; node is at alpha.mimas.ac.uk &#8211; you can authenticate using details given in this blog post <a href="http://jlernexperiment.wordpress.com/2012/02/02/alpha-node/">http://jlernexperiment.wordpress.com/2012/02/02/alpha-node/</a>.</p>
<p>As already mentioned they&#8217;ve harvested JORUM OAI-PMH data and published on JLern alpha node.</p>
<p>They now have &#8216;Beta&#8217; node (this doesn&#8217;t represent a level of development &#8211; just naming convention I think) &#8211; this is running on Windows (Alpha is on Linux). Also planning a &#8216;Gamma&#8217; node running on Amazon EC2.</p>
<p>JLern hackday held in January &#8211; write up at <a href="http://jlernexperiment.wordpress.com/2012/02/21/the-hackday-report-and-reflections/">http://jlernexperiment.wordpress.com/2012/02/21/the-hackday-report-and-reflections/</a>, and also a Java Library for interacting with Learning Registry nodes at <a href="https://github.com/navnorth/LRJavaLib">https://github.com/navnorth/LRJavaLib</a> .</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2012/02/cetis-conference-capturing-conversations-about-learning-resources/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CETIS Conference: Bring on the metaverse</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2012/02/bring-on-the-metaverse/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2012/02/bring-on-the-metaverse/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 11:48:29 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1387</guid>
		<description><![CDATA[Ian Hughes (@epredator) from Feeding Edge ltd. Ian describes himself as a &#8216;metaverse evangelist&#8217;. Ended up presenting a section on &#8216;Cool Stuff&#8217; on CITV - and finding he was talking about exactly the same stuff to talk to children as he&#8217;d been talking to the commercial sector/large corps about. Tries to include mentions of open [...]]]></description>
			<content:encoded><![CDATA[<p>Ian Hughes (<a href="http://twitter.com/epredator">@epredator</a>) from <a href="http://www.feedingedge.co.uk">Feeding Edge ltd</a>. Ian describes himself as a &#8216;<a href="http://en.wikipedia.org/wiki/Metaverse">metaverse</a> evangelist&#8217;. Ended up presenting a section on <a href="http://www.itv.com/citvonline/coolstuffcollective/">&#8216;Cool Stuff&#8217; on CITV </a>- and finding he was talking about exactly the same stuff to talk to children as he&#8217;d been talking to the commercial sector/large corps about. Tries to include mentions of open source, and show that children can get involve and affect stuff &#8230; in the way he did when he was young.</p>
<p>A lot of the stuff Ian talks about comes back to games &#8211; perhaps because about playing and about building &#8211; he went into computing because he wanted to build games. Not just about writing games &#8211; but using toolkits to mod characters, game play etc.</p>
<p>Also interest in animation (involved in a BCS group about animation) &#8211; and this is about art and technical skill &#8211; you need to bring together people with these different skill sets, and each needs to understand what the other has to offer. Things like <a href="http://unity3d.com">http://unity3d.com</a> and <a href="http://opensimulator.org">http://opensimulator.org</a> allow you to write stuff yourself . Tools like <a href="http://evolver.com">http://evolver.com</a> give you easy ways into building characters etc. Also platforms are available <a href="http://smartfoxserver.com">http://smartfoxserver.com</a> &#8211; used to delivered <a href="http://www.clubpenguin.com/">Club Penguin</a> &#8211; and if you have &lt;100 people connecting it is free to run.</p>
<p><a href="http://en.wikipedia.org/wiki/Forza_Motorsport">Forza</a> &#8211; racing game/driving sim &#8211; that allows you to mod the car &#8211; including paintwork etc. &#8211; and then when you race against someone they see <em>your </em>car. Can include things like logos etc&#8230;</p>
<p>Ian now demonstrating how he can run <a href="http://opensimulator.org">OpenSim</a> on his laptop &#8211; his customised avatar wears digital version of the leather jacket he is currently wearing &#8211; identity and links between virtual and real. Can create virtual objects immediately &#8211; and all viewers of shared space see it immediately &#8211; you&#8217;ve distributed just be creating it. Ian talking about how he finds virtual objects as useful cues for talking &#8211; using &#8217;3 dimensional&#8217; cues for what he is going to say (reminds me of &#8216;palace of memories&#8217; type stuff).</p>
<p>Ian says, a shared &#8216;space&#8217; when presenting gives different effects and works well for some people &#8211; you can share presentations in the space, and also place discussion in the space.</p>
<p>Now moving onto <a href="http://www.minecraft.net/">Minecraft</a> &#8211; much more game based, but lots of similarity with virtual worlds. You can run a <a href="http://www.minecraft.net/">Minecraft</a> server yourself, or on the web. Starting to see some use of this in schools &#8211; mentions &#8220;Minecraft Teacher&#8221; <a href="http://minecraftteacher.net/">http://minecraftteacher.net/</a>. Ian describing how his children used Minecraft together for the first time &#8211; collaboration, exploration, building etc. Minecraft also allows building of mechanical devices &#8211; using things like trip switches, trains, etc.</p>
<p>Ian mentions Arduino and 3D printing as things he&#8217;s got onto the Cool Stuff program. Ian is especially enthusiastic about 3d printing &#8211; highlighting possibilities of moving between environments like Skylander &#8211; you could print out your own figures, with RFID chips in them&#8230;.</p>
<p>Finally Ian closes by talking about how engaging children is about presenting this stuff in fun/interesting ways &#8211; but perhaps also about trusting children  will be interested and will learn, if you make it interesting.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2012/02/bring-on-the-metaverse/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Experimenting with British Museum data</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2011/12/experimenting-with-british-museum-data/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2011/12/experimenting-with-british-museum-data/#comments</comments>
		<pubDate>Mon, 05 Dec 2011 11:59:28 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[british museum]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[sparql]]></category>
		<category><![CDATA[ukdiscovery]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1366</guid>
		<description><![CDATA[In September 2011 the British Museum started publishing descriptions of items in its collections as RDF (the data structure that underlies Linked Data). The data is available from http://collection.britishmuseum.org/ where the Museum have made a &#8216;SPARQL Endpoint&#8217; available. SPARQL is a query language for extracting data from RDF stores &#8211; it can be seen as a [...]]]></description>
			<content:encoded><![CDATA[<p>In September 2011 the British Museum started publishing descriptions of items in its collections as RDF (the data structure that underlies Linked Data). The data is available from <a href="http://collection.britishmuseum.org/">http://collection.britishmuseum.org/</a> where the Museum have made a &#8216;SPARQL Endpoint&#8217; available. SPARQL is a query language for extracting data from RDF stores &#8211; it can be seen as a parallel to SQL, which is a query language for extract data from traditional relational databases.</p>
<p>Although I knew what SPARQL was, and what it looked like, I really hadn&#8217;t got to grips with it, and since I&#8217;d just recently purchased &#8220;<a href="http://shop.oreilly.com/product/0636920020547.do">Learning SPARQL</a>&#8221; it seemed like a good opportunity to get familiar with the British Museum data and SPARQL syntax. So I had a play (more below). Skip forward a few months, and I noticed some tweets from a JISC meeting about the <a href="http://pelagios-project.blogspot.com/p/about.html">Pelagios project</a> (which is interested in the creation of linked (geo)data to describe &#8216;ancient places&#8217;), and in particular from <a href="http://twitter.com/mia_out">Mia Ridge</a> and <a href="http://twitter.com/alexsdutton">Alex Dutton</a> which indicated they were experiementing with the British Museum data. My previous experience seemed to gel with the experience they were having, and prompted me to finally get on with a blog post documenting my experience so hopefully others can benefit.</p>
<p>Perhaps one reason I&#8217;ve been a bit reluctant to blog this is that I struggled with the data, and I don&#8217;t want this post to come across as overly critical of the British Museum. The fact they have their data out there at all is amazing &#8211; and I hope other museums (and archives and libraries) follow the lead of the British Museum in releasing data onto the web. So I hope that all comments/criticisms below come across as offering suggestions for improving the Museum data on the web (and offering pointers to others doing similar projects), and of course the opportunity for some dialogue about the issues. There is also no doubt that some of the issues I encountered were down to my own ignorance/stupidity &#8211; so feel free to point out obvious errors.</p>
<p>When you arrive at the British Museum SPARQL endpoint the nice thing is there is a pre-populated query that you can run immediately. It just retrieves 10 results, of any type, from the data &#8211; but it means you aren&#8217;t staring at a blank form, and those ten results give a starting point for exploring the data set. Most URIs in the resulting data are clickable, and give you a nice way of finding what data is in the store, and to start to get a feel for how it is structured.</p>
<p>For example, running the default search now brings back the triple:</p>
<table id="Results">
<tbody>
<tr>
<td width="100px">Subject</td>
<td width="744px"><a href="http://collection.britishmuseum.org/id/object/EAF119772">http://collection.britishmuseum.org/id/object/EAF119772</a></td>
</tr>
<tr>
<td width="100px">Predicate</td>
<td width="744px"><a href="http://collection.britishmuseum.org/id/crm/P3F.has_note">http://collection.britishmuseum.org/id/crm/P3F.has_note</a></td>
</tr>
<tr>
<td width="100px">Object</td>
<td width="744px">Object type :: marriage equipment ::</td>
</tr>
</tbody>
</table>
<p>&nbsp;</p>
<p>Which is intriguing enough to make you want to know more (I am married, and have to admit I don&#8217;t remember any special equipment). Clicking on the URI <a href="http://collection.britishmuseum.org/id/object/EAF119772">http://collection.britishmuseum.org/id/object/EAF119772</a> in a browser takes you to an HTML representation of the resource &#8211; a list of all the triples that make statements about the item in the British Museum identified by that URI.</p>
<p>While I think it would be an exaggeration to say this is &#8216;easily readable&#8217;, sometimes, as with the triple above, there is enough information to guess the basics of what is being said &#8211; for example:</p>
<table id="Results">
<tbody>
<tr>
<td width="100px">Subject</td>
<td width="744px"><a href="http://collection.britishmuseum.org/id/object/EAF119772">http://collection.britishmuseum.org/id/object/EAF119772</a></td>
</tr>
<tr>
<td width="100px">Predicate</td>
<td width="744px"><a href="http://collection.britishmuseum.org/id/crm/P3F.has_note">http://collection.britishmuseum.org/id/crm/P3F.has_note</a></td>
</tr>
<tr>
<td width="100px">Object</td>
<td width="744px">Acquisition date :: 1994 ::</td>
</tr>
</tbody>
</table>
<p>&nbsp;</p>
<p>From this it is perhaps easy enough to see that there is some item (identified by the URI <a href="http://collection.britishmuseum.org/id/object/EAF119772">http://collection.britishmuseum.org/id/object/EAF119772</a>) which has a note related to it stating that it was acquired (presumably by the museum) in 1994.</p>
<p>So far, so good. I&#8217;d got an idea of the kind of information that might be in the database. So the next question I had was &#8220;what kind of queries could I throw at the data that might produce some interesting/useful results?&#8221; Since I&#8217;d recently been playing around with <a href="http://www.meanboyfriend.com/overdue_ideas/2011/07/compose-yourself/">data about composers</a> I thought it might be interesting to see if the British Museum had any objects that were related to a well-known composer &#8211; say Mozart.</p>
<p>This is where I started to hit problems&#8230;. In my initial explorations, while some information was obvious, I&#8217;d also realised that the data was modelled using something called <a href="http://www.cidoc-crm.org/index.html">CIDOC CRM</a>, which is intended to model &#8216;cultural heritage&#8217; data. With some help from Twitter (including staff at the British Museum) I started to read up on CIDOC CRM &#8211; and struggled! Even now I&#8217;m not sure I&#8217;d say I feel completely on top of it, but I now have a bit of a better understanding. Much of the CIDOC model is based around &#8216;events&#8217; &#8211; things that happened at a certain time/in a certain place. This means that often what might seem like a simple piece of information &#8211; such as where a item in the museum originates from &#8211; become complex.</p>
<p>To give a simple example, the &#8216;discovery&#8217; of an item is a kind of event. So to find all the items in the British Museum &#8216;discovered&#8217; in Greenwich you have to first find all the &#8216;discovery&#8217; events that &#8216;took place at&#8217; Greenwich, then link these discovery events back to the items they are a related to:</p>
<p>An item -&gt; was discovered by a discovery event -&gt; which took place at Greenwich</p>
<p>This adds extra complexity to what might seem initially (naively?) a simple query. This example was inspired by discussion at the Pelagios event mentioned earlier &#8211; the full query is:</p>
<pre>SELECT ?greenwichitem WHERE
{
	?s &lt;http://collection.britishmuseum.org/id/crm/P7F.took_place_at&gt; &lt;http://collection.britishmuseum.org/id/thesauri/x34215&gt; .
	?subitem &lt;http://collection.britishmuseum.org/id/crm/bm-extensions/PX.was_discovered_by&gt; ?s .
	?greenwichitem &lt;http://collection.britishmuseum.org/id/crm/P46F.is_composed_of&gt; ?subitem
}</pre>
<pre><span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, serif; font-size: 13px; line-height: 19px; white-space: normal;">and the results can be seen at </span><a style="font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, serif; font-size: 13px; line-height: 19px; white-space: normal;" href="http://bit.ly/vojTWq">http://bit.ly/vojTWq</a><span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', 'Bitstream Charter', Times, serif; font-size: 13px; line-height: 19px; white-space: normal;">.</span></pre>
<p>To make things even more complex the British Museum data seems to describe all items actually as made up of (what I&#8217;m calling) &#8216;sub-items&#8217;. In some cases this makes some sense. If a single item is actually made up of several pieces, all with their own properties and provenance, it clearly makes sense to describe each part separately. Each part of the object will have it&#8217;s own properties and provenance, and it makes sense to describe these separately.</p>
<p>However, the British Museum data describes even single items as made up of &#8216;pieces&#8217; &#8211; just that the single item consists of a single piece &#8211; and it is then that piece that has many of the properties of the item associated with it. To illustrate. A multi-piece item is like:</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/12/item-hierarchy.png"><img class="aligncenter size-full wp-image-1369" title="item hierarchy" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/12/item-hierarchy.png" alt="" width="524" height="208" /></a></p>
<p>Which makes sense to me. But a single piece item is like:</p>
<p>&nbsp;</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/12/single-item.png"><img class="aligncenter size-full wp-image-1370" title="single item" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/12/single-item.png" alt="" width="524" height="100" /></a></p>
<p>Which I found (and continue to find) this confusing. This isn&#8217;t helped in my view by the fact that some properties are attached the the &#8216;parent&#8217; object, and some to the &#8216;child&#8217; object, and I can&#8217;t really work out the logic associated with this. For example it is the &#8216;parent&#8217; object that belongs to a department in the British Museum, while it is the &#8216;child&#8217; object that is made of a specific material. Both the parent and child in this situation are classified as physical objects, and this feels wrong to me.</p>
<p>Thankfully a link from the Pelagios meeting alerted me to some more detailed documentation around the British Museum data (<a href="http://www.researchspace.org/Stage-2-Outputs">http://www.researchspace.org/Stage-2-Outputs</a>), and this suggests that the British Museum are going to move away from this model:</p>
<blockquote><p>Firstly, after much debate we have concluded that preserving the existing modelling relationship as described earlier whereby each object always consists of at least one part is largely nonsense and should not be preserved.</p>
<p>While arguments were put forward earlier for retaining this minimum one part per object scheme, it has now been decided that only objects which are genuinely composed of multiple parts will be shown as having parts.</p></blockquote>
<p>The same document notes that the current modelling &#8220;may be slightly counter-intuitive&#8221; &#8211; I can back up this view!</p>
<p>So &#8211; back to finding stuff related to Mozart&#8230; apart from struggling with the data model, the other issue I encountered was that it was difficult to approach the dataset through anything except a URI for a entity. That is to say, if you knew the URI for &#8216;Wolfgang Amadeus Mozart&#8217; in the museum data set, the query would be easy, but if you only know a name, then it is much more difficult. How could I find the URI for Mozart, to then find all related objects?</p>
<p>Just using SPARQL, there are two approaches that might work. If you know the exact (and I mean exact) form of the name in the data, you can query for a &#8216;literal&#8217; &#8211; i.e. do a SPARQL query for a textual string such as &#8220;Mozart, Wolfgang Amadeus&#8221;. If this is the exact for used in the data, the query will be successful, but if you get this slightly wrong then you&#8217;ll fail to get any result. A working example for the British Museum data is:</p>
<pre>SELECT * WHERE
{
	?s ?p "Mozart, Wolfgang Amadeus"
}</pre>
<p>The second approach you can use is to do a more general query and &#8216;filter&#8217; the result using a regular expression. Regular expressions are ways of looking for patterns in text strings, and are incredibly powerful (supporting things like wildcards, ignoring case etc. etc.). So you can be a lot less precise than searching for an exact string, and for example, you might try to retrieve all the statements about &#8216;people&#8217; and filter for those containing the (case insensitive) word &#8216;mozart&#8217;. While this would get you Leopold Mozart as well as Wolfgang Amadeus if both are present in the data, there are probably a small enough number of mozarts that you would be able to pick out WA Mozart by eye, and get the relevant URI which identifies him.</p>
<p>A possible query of this type is:</p>
<pre>SELECT * WHERE
{
	?s &lt;http://xmlns.com/foaf/0.1/Name&gt; ?o
	FILTER regex(?o, "mozart", "i")
}</pre>
<p>Unfortunately these latter type of &#8216;filter&#8217; queries are pretty inefficient, and the British Museum SPARQL endpoint has some restrictions which mean that if you try to retrieve more than a relatively small amount of data at one time you just get an error. Since this is essentially how &#8216;filter&#8217; queries work (retrieve a largish amount of data first, then filter out the stuff you don&#8217;t want), I couldn&#8217;t get this working. The issue of only being able to retrieve small sets of data was a bit of a frustration overall with the SPARQL endpoint, not helped by the fact that it seemed to be relatively arbitrary in terms of what &#8216;size&#8217; of result set caused an error &#8211; I assume it is something about the overall amount of data retrieved, as it seemed unrelated to the actual number of results retrieved &#8211; for example using:</p>
<pre>SELECT * WHERE
{
	?s ?p ?o
}</pre>
<p>You can retrieve only 123 results before you get an error, while using</p>
<pre>SELECT ?s WHERE
{
	?s ?p ?o
}</pre>
<p>You can retrieve over 300 results without getting an error.</p>
<p>This limitation is an issue in itself (and the British Museum are by no means alone in having performance issues with an RDF triple store), but it is doubly frustrating that the limit is unclear.</p>
<p>The difficulty of exploring the British Museum data from a simple textual string became a real frustration as I explored the data &#8211; it made me realise that while the Linked Data/RDF concept of using URIs and not literals is something I understand and agree with, as people all we know is textual strings that describe things, so to make the data more immediately usable, supporting textual searches (e.g. via a solr index over the literals in the data) might be a good idea.</p>
<p>I got so frustrated that I went looking for ways of compensating. The British Museum data makes extensive use of &#8216;thesauri&#8217; &#8211; lists of terms for describing people, places, times, object types, etc. In theory these thesauri would give the text string entry points into the data, and I found that one of the relevant thesauri (object types) was available on the Collections Link website (<a href="http://www.collectionslink.org.uk/assets/thesaurus/Objintro.htm">http://www.collectionslink.org.uk/assets/thesaurus/Objintro.htm</a>). Each term in this data corresponds to a URI in the British Museum data, and so I wrote a ScraperWiki script which would search for each term in the British Museum data and identify the relevant URI and record both the term and the URI. At the same time a conversation with <a href="http://twitter.com/portableant">@portableant</a> on twitter alerted me to the fact that the &#8216;<a href="http://finds.org.uk/">Portable Antiquities</a>&#8216; site uses a (possibly modified) version of the same thesaurus for classifying objects, so I added in a lookup of the term on this site to start to form connections between the Portable Antiquities data and the British Museum data. This script is available at <a href="https://scraperwiki.com/scrapers/british_museum_object_thesaurus/">https://scraperwiki.com/scrapers/british_museum_object_thesaurus/</a>, but comes with some caveats about the question of how up to date the thesaurus on the Collections Link website is, and the possible imperfections of the matching between the thesaurus and the British Museum data.</p>
<p>Unfortunately it seems that this &#8216;object type&#8217; thesaurus is the only one made publicly available (or at least the only one I could find), while clearly the people and place thesauri would be really interesting, and provide valuable access points into the data. But really ideally these would be built from the British Museum data directly, rather than being separate lists.</p>
<p>So, finally back to Mozart. I discovered another way into the data &#8211; which was via the really excellent British Museum website, which offers the ability to <a href="http://www.britishmuseum.org/research/search_the_collection_database">search the collections via a nice web interface</a>. This is a good search interface, and gives access to the collections &#8211; to be honest already solving problems such as the one I set myself here (of finding all objects related to Mozart) &#8211; but nevermind that now!  If you search this interface and find an object, when the you view the record for the object, you&#8217;ll probably be at a URL something like:</p>
<p><a href="http://www.britishmuseum.org/research/search_the_collection_database/search_object_details.aspx?objectid=3378094&amp;partid=1&amp;searchText=mozart&amp;numpages=10&amp;orig=%2fresearch%2fsearch_the_collection_database.aspx&amp;currentPage=1">http://www.britishmuseum.org/research/search_the_collection_database/search_object_details.aspx?objectid=3378094&amp;partid=1&amp;searchText=mozart&amp;numpages=10&amp;orig=%2fresearch%2fsearch_the_collection_database.aspx&amp;currentPage=1</a></p>
<p>If you extract the &#8220;objectid&#8221; (in this case &#8217;3378094&#8242;) from this, you can use this to look up the RDF representation of the same object using a query like:</p>
<pre>SELECT * WHERE
{
	?s &lt;http://www.w3.org/2002/07/owl#sameAs&gt; &lt;http://collection.britishmuseum.org/id/codex/3378094&gt;
}</pre>
<p>This gives you the URI for the object, which you can then use to find other relevant URIs. So in this case I was able to extract the URI for Wolfgang Amadeus Mozart (<a href="http://collection.britishmuseum.org/id/person-institution/39629">http://collection.britishmuseum.org/id/person-institution/39629</a>) and so create a query like:</p>
<pre>SELECT ?item WHERE
{
	?s ?p &lt;http://collection.britishmuseum.org/id/person-institution/39629&gt; .
	?item &lt;http://collection.britishmuseum.org/id/crm/P46F.is_composed_of&gt; ?s
}</pre>
<p>To find the 9 (as of today) items that are in someway related to Mozart (mostly pictures/engravings of Mozart).</p>
<p>The discussion at the Pelgios meeting identified several &#8216;anti-patterns&#8217; related to the usability of Linked Data &#8211; and some of these jumped out at me as being issues when using the British Museum data:</p>
<p><strong>Anti-patterns</strong></p>
<ul>
<li>homepages that don&#8217;t say where data can be found</li>
<li>not providing info on licences</li>
<li>not providing info on RDF syntaxes</li>
<li>not providing egs of query construction</li>
<li>not providing easy way to get at term lists</li>
<li>no html browsing</li>
<li>complex data models</li>
</ul>
<div>The Pelagios wiki has some more information on &#8216;stumbling blocks&#8217; at <a href="http://pelagios.pbworks.com/w/page/48544935/Stumbling%20Blocks">http://pelagios.pbworks.com/w/page/48544935/Stumbling%20Blocks</a>, and also the group exploring (amongst other things) the British Museum data made notes at <a href="http://pelagios.pbworks.com/w/page/48535503/UK%20Cultural%20Heritage">http://pelagios.pbworks.com/w/page/48535503/UK%20Cultural%20Heritage</a>. Also I know that Dominic Oldman from the British Museum was at the meeting, and was keen to get feedback on how they could improve the data or the way it is made available.</div>
<div>One thing I felt strongly when I was looking at the British Museum data is that it would have been great to be able to &#8216;go&#8217; somewhere that others looking at/using the data would also be to discuss the issues. The British Museum provide an email to send feedback (which I&#8217;ve used), but what I wanted to do was say things like &#8220;am I being stupid?&#8221; and &#8220;anyone else find this?&#8221; etc. As a result of discussion at the Pelagios meeting, and on twitter, <a href="http://twitter.com/mia_out">Mia Ridge</a> has setup <a href="http://museum-api.pbworks.com/w/page/48539475/British%20Museum%20Collections%20discussion%20and%20feedback">a wiki page for just such a discussion</a>.</div>
<div>A final thought. The potential of &#8216;linked data&#8217; is to bring together data from multiple sources, and combine to give something that is more than the sum of it&#8217;s parts. At the moment the British Museum data sits in isolation. How amazing would it be to join up the British Museum &#8216;people&#8217; records such as <a href="http://collection.britishmuseum.org/id/person-institution/39629">http://collection.britishmuseum.org/id/person-institution/39629</a> with the VIAF (<a href="http://viaf.org/viaf/32197206/">http://viaf.org/viaf/32197206/</a>) or Library of Congress (<a href="http://id.loc.gov/authorities/names/n80022788">http://id.loc.gov/authorities/names/n80022788</a>) identifier for the same person, and start to produce searches and results that build on the best of all this data?</div>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2011/12/experimenting-with-british-museum-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The twelve days of ChrisMash</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2011/12/the-twelve-days-of-chrismash/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2011/12/the-twelve-days-of-chrismash/#comments</comments>
		<pubDate>Thu, 01 Dec 2011 11:17:33 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1360</guid>
		<description><![CDATA[Just a little message for ChrisMash: On the first day of ChrisMash my true love sent to me, some cake and some coffee. On the second day of ChrisMash my true love sent to me, two APIs and some cake and some coffee. On the third day of ChrisMash my true love sent to me, three Google Maps, two APIs and some cake and some coffee. On the fourth [...]]]></description>
			<content:encoded><![CDATA[<p>Just a little <a href="http://chrismash.wordpress.com/2011/11/04/send-us-your-chrismash-messages-mashlib/">message for ChrisMash</a>:</p>
<p>On the first day of ChrisMash my true love sent to me, some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the second day of ChrisMash my true love sent to me, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the third day of ChrisMash my true love sent to me, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the fourth day of ChrisMash my true love sent to me, four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the fifth day of ChrisMash my true love sent to me, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the sixth day of ChrisMash my true love sent to me, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the seventh day of ChrisMash my true love sent to me, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the eighth day of ChrisMash my true love sent to me, eight <a href="http://www.flickr.com/photos/davepattern/3064584142/">mashers mashing</a>, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the ninth day of ChrisMash my true love sent to me, nine <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL queries</a>, eight <a href="http://www.flickr.com/photos/davepattern/3064584142/">mashers mashing</a>, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the tenth day of ChrisMash my true love sent to me, ten <a href="http://en.wikipedia.org/wiki/QR_code">QR codes</a>, nine <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL queries</a>, eight <a href="http://www.flickr.com/photos/davepattern/3064584142/">mashers mashing</a>, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the eleventh day of ChrisMash my true love sent to me, eleven <a href="http://www.loc.gov/marc/bibliographic/">MARC records</a>, ten <a href="http://en.wikipedia.org/wiki/QR_code">QR codes</a>, nine <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL queries</a>, eight <a href="http://www.flickr.com/photos/davepattern/3064584142/">mashers mashing</a>, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
<p>On the twelfth day of ChrisMash my true love sent to me, twelve <a href="http://www.flickr.com/photos/davepattern/3703570181">fancy cocktails</a>, eleven <a href="http://www.loc.gov/marc/bibliographic/">MARC records</a>, ten <a href="http://en.wikipedia.org/wiki/QR_code">QR codes</a>, nine <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL queries</a>, eight <a href="http://www.flickr.com/photos/davepattern/3064584142/">mashers mashing</a>, seven <a href="http://www.flickr.com/photos/smilylibrarian/4852718720/">homemade badgers</a>, six <a href="http://en.wikipedia.org/wiki/RSS">RSS feeds</a>, five <a href="http://pipes.yahoo.com/pipes/">Yahoo Pipes</a>. Four <a href="http://en.wikipedia.org/wiki/Remote_procedure_call">RPCs</a>, three <a href="http://maps.google.co.uk/">Google Maps</a>, two <a href="http://en.wikipedia.org/wiki/Application_programming_interface">APIs</a> and some <a href="http://www.flickr.com/photos/davepattern/4147602466/">cake</a> and some <a href="http://www.flickr.com/photos/benelwell/5512038526/">coffee</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2011/12/the-twelve-days-of-chrismash/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>JISC Mobile Infrastructure programme</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2011/11/jisc-mobile-infrastructure-programme/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2011/11/jisc-mobile-infrastructure-programme/#comments</comments>
		<pubDate>Fri, 18 Nov 2011 11:15:27 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[mlibs]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/2011/11/jisc-mobile-infrastructure-programme/</guid>
		<description><![CDATA[Today I’m at the programme kick-off meeting for the JISC Mobile Infrastructure. There are 5+1 projects funded in this strand, as detailed by the programme manager, Ben Showers, in this blog post. I’m working with Evidence Base on the ‘+1’ project which is a support project looking to establish a ‘mobile library community’ and ways [...]]]></description>
			<content:encoded><![CDATA[<p>Today I’m at the programme kick-off meeting for the JISC Mobile Infrastructure. There are 5+1 projects funded in this strand, as detailed by the programme manager, Ben Showers, in <a href="http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/">this blog post</a>.</p>
<p>I’m working with <a href="http://www.ebase.bcu.ac.uk/">Evidence Base</a> on the ‘+1’ project which is a support project looking to establish a ‘mobile library community’ and ways of supporting projects/libraries/people working in this area. The first step on this path is the m-libraries support website <a href="http://m-libraries.info">http://m-libraries.info</a> where <a href="http://twitter.com/joeyanne">@joeyanne</a> has posted some introductory material outlining what we mean by ‘mobile libraries’ in this context (this is about use of mobile devices by and for library services, rather than about ‘collections on wheels’).</p>
<p>Today is an opportunity to hear about some of the other projects and meet the people involved. However, the aim of our support project is not just to support the current projects, but to start a platform for a growing community.</p>
<p>Some quick introductions to the projects:</p>
<p><strong>Phonebooth</strong><br />
Twitter: <a href="http://twitter.com/jiscphonebooth">@jiscphonebooth</a><br />
Lead: London School of Economics<br />
Partners: Edina<br />
Summary (from <a href="http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/">http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/</a>): PhoneBooth will repurpose the Charles Booth Maps, Descriptive of London Poverty and selected police notebooks, which record eye-witness descriptions of London street-by-street, for delivery to mobile devices. The project will enhance the current online delivery by enabling content to be delivered directly to the location to which it refers.</p>
<p>Introducing this is Ed Fay (<a href="http://twitter.com/digitalfay">@digitalfay</a>). Existing online resource &#8211; the Charles Booth maps (<a href="http://booth.lse.ac.uk/">http://booth.lse.ac.uk/</a>) which includes maps, classification and notebooks, at a street by street level. They will keep the backend infrastructure, but put a new mobile client interface on it.</p>
<p>Accessing data at a street level is something that already occurs in the teaching of a specific course &#8211; but lots of paper based use at the moment. The mobile delivery fits really well into this teaching. Also expect interest from schools, genealogists etc. Also talked to staff at the Museum of London (who hold some of the Booth maps)</p>
<p>Ed stressed that focus is delivery of library content on mobile &#8211; not delivering a ‘teaching app’</p>
<p>Going to be an open web app &#8211; more sustainable</p>
<p>‘Support of new mobile devices’ is written into new LSE library strategy.</p>
<p><strong>M-Biblio</strong><br />
Hashtag: <a href="http://twitter.com/#!/search/realtime/#mbiblio">#mbiblio</a><br />
Lead: University of Bristol<br />
Summary (from <a href="http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/">http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/</a>): The project will enhance the learning and research activities of the University of Bristol’s academic community by developing a mobile application that can record and organise references to books, journals and other resources. These references can be added actively by scanning barcodes and QR codes, or passively by automatically recording RFID tags in items being used for study and research.</p>
<p>Mike Jones (<a href="http://www.bris.ac.uk/ilrt/people/mike-a-jones/overview.html">http://www.bris.ac.uk/ilrt/people/mike-a-jones/overview.html</a>) introducing M-Biblio. Want to develop a mobile application &#8211; and with permission collect user activity data</p>
<p>Hope that the library gets useful data &#8211; for resources that might not usually be borrowed &#8211; like journals, theses, and other “reference only” resources<br />
The staff and students get useful tool<br />
Can trail ‘near field communication’ (NFC) capabilities of newer Android phones &#8211; to read RFID tags in books (wonder if this is compatible)<br />
Maybe other technologies &#8230;</p>
<p>Will use a web service as a ‘broker’ which will connect between phone clients and bibliographic sources (e.g. their library catalogue &#8211; Aleph) and stats collection.</p>
<p>Employing two User Experience and User Interface Design experts to help with those aspects, and engaging users &#8211; staff and students &#8211; in the process.</p>
<p><strong>MACON<br />
</strong>Hashtag: <a href="http://twitter.com/#!/search/realtime/#oumacon">#oumacon</a><br />
Lead: The Open University<br />
Partners: EBSCO<br />
Summary (from <a href="http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/">http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/</a>): MACON will address challenges involved in delivering quality academic content to mobile devices in a seamless and user-friendly manner. The project will work with EBSCO, a major content and systems provider in order to prototype a mobile friendly resource discovery interface which will discover and expose quality academic content from both third party &amp; local collections.</p>
<p>Introduced by Keren Mills from the OU. Aim to create a mobile interface to the EBSCO discovery tool&#8230; &#8211; and interested in whether people use it, and how they want to use it. Do they want to read on the mobile device? Do they want to bookmark or save stuff for later? Probably a mix.</p>
<p>Library users are not necessarily (or usually!) expert searchers &#8211; so need to look at how can take basic queries and still return useful &amp; relevant results.</p>
<p>Already know that authentication can be an issue on mobile devices &#8211; often get bounced around authentication systems via redirects &#8211; and after a certain number of redirects the mobile browser can give up (more quickly than on a desktop). Thinking about ways of storing some local user information (via a bookmarklet?) to shortcut some of this and improve user experience.</p>
<p>Outcomes they are looking for:</p>
<ul style="list-style-type: disc;">
<li>Prototype discovery tool for delivery of academic content to mobile devices</li>
<li>Document detailed user requirements</li>
<li>Report on user feedback and usability, mapped to type of device used</li>
<li>Release the code through a suitable code repository</li>
</ul>
<p>Want to also look at possible delivery of audio-visual content (locally held material)</p>
<p>Want to avoid native apps &#8211; users might well be starting from the web (e.g. via the Open University VLE etc.)</p>
<p>Also want to make sure that if the user clicks on a link to a publisher site, they aren’t suddenly in a non-mobile friendly interface &#8211; that is, use a publisher’s mobile interface where possible.</p>
<p>Two further projects have been funded that aren’t represented here today, both based at City University:</p>
<p><strong>MoPED</strong><br />
Lead: City University, London<br />
Summary (from <a href="http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/">http://infteam.jiscinvolve.org/wp/2011/10/11/mobile-infrastructure-for-libraries-new-projects/</a>): The project will develop the MoPED system, which will combine mobile phone interaction with a public display in City University’s Main Library. The aim of the project is to investigate how to encourage the adoption of mobile services through a two-fold strategy: first, a strong, user-centred design process, commencing with an investigation of which mobile services are most likely to be beneficial; second, using an in-situ public display to promote (and assist getting access to) the library’s mobile services and to connect online services to the space of the library itself.</p>
<p><strong>Learnmore</strong><br />
Lead: City University, London<br />
The project will develop the Learnmore Mobile Application using a user-centred design process. Building on the current ‘desktop’ Learnmore content, the interface and content will be tailored to the actual needs of students using mobile devices, with considerations including the preferred media, topic and content size for mobile consumption.</p>
<hr />
<p><strong>Does size matter?</strong><br />
Some discussions of ‘what is mobile’ &#8211; the OU is thinking of devices with smaller than 10” screens &#8211; as with iPads and similar devices the desktop interfaces work OK.</p>
<p>The PhoneBooth app is just not going to be usable below a certain size&#8230;</p>
<p>Suspect that the question of what aspects of a service or device that make it specifically ‘mobile’ is something that will come up again&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2011/11/jisc-mobile-infrastructure-programme/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Openly Connect</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2011/11/openly-connect/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2011/11/openly-connect/#comments</comments>
		<pubDate>Wed, 16 Nov 2011 16:26:41 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/?p=1339</guid>
		<description><![CDATA[&#8220;Openly Connect&#8221; was the title of a talk I gave at Internet Library International 2011 (tipping my hat slightly to Only Connect, the BBC4 quiz show). I&#8217;ve been wondering about the best way of sharing the presentation online, and decided that really blogging the ideas is much more useful than just dumping the slides somewhere. [...]]]></description>
			<content:encoded><![CDATA[<p>&#8220;Openly Connect&#8221; was the title of a talk I gave at <a href="http://www.internet-librarian.com/2011/programme.php">Internet Library International 2011</a> (tipping my hat slightly to <a href="http://www.bbc.co.uk/programmes/b00lskhg">Only Connect, the BBC4 quiz show</a>). I&#8217;ve been wondering about the best way of sharing the presentation online, and decided that really blogging the ideas is much more useful than just dumping the slides somewhere.</p>
<p>I believe that libraries, museums and archives are not getting the most out of the data about their collections, because they aren&#8217;t publishing in ways that enable or encourage others to take the data and use it in new, innovative (or even boring), ways. I think we need to offer data more &#8216;openly&#8217;.</p>
<h2>Being open</h2>
<p><a title="More mixed messages by withassociates, on Flickr" href="http://www.flickr.com/photos/withassociates/2987550732/"><img src="http://farm4.static.flickr.com/3045/2987550732_6e77d4b739.jpg" alt="More mixed messages" width="500" height="375" /></a><br />
(Image courtesy of <a href="http://www.flickr.com/photos/withassociates/">withassociates</a>, <a href="http://creativecommons.org/licenses/by-sa/2.0/deed.en_GB">CC-BY-SA</a>)</p>
<p>But what does &#8216;open&#8217; mean in this context? For me, this is not a simple binary open/closed&#8230; but rather a continuum. There are a range of factors that affect whether others can easily take, and reuse, your data. But it is easy to focus on a single factor when talking about &#8216;openess&#8217; &#8211; <a href="http://www.opendefinition.org/">especially to focus on &#8216;rights&#8217; to reuse data</a> &#8211; copyright, database rights, licensing, terms and conditions etc. While these are an important factor, they are not the only factor.</p>
<p><a href="http://blog.paulwalk.net/">Paul Walk</a> puts it better than me <a href="http://www.slideshare.net/paulwalk/metadata-aggregation-services">in this slidedeck</a> when he argues we need a ’richer understanding of openness’ which encompasses not just permissive licensing but, more broadly, the ease with which data can be used, taking into consideration aspects such as format and access mechanisms</p>
<h2>Friction</h2>
<p>I&#8217;ve started to think about factors affecting reuse as being causes of friction (an idea I&#8217;m pretty sure I got from <a href="http://ouseful.info">Tony Hirst</a>). This may not be an exhaustive list, but the things I can see that create friction in the reuse of data are:</p>
<ul>
<li>Explicit restrictions on reuse</li>
<li>Uncertainty about possible restrictions on reuse</li>
<li>Unusual or unfamiliar interfaces and formats (if you don&#8217;t work in the library world, you&#8217;ve probably never heard of Z39.50, and yet this is a standard machine to machine interface supported by many library systems)</li>
<li>Lack of information on data and where the data is available</li>
</ul>
<p>Sometimes you might deliberately introduce friction &#8211; perhaps you don&#8217;t want your data to be reused by just anyone, for any purpose. I don&#8217;t see friction as bad per se &#8211; we just need to be aware of it, and especially avoid introducing friction when we don&#8217;t mean to.</p>
<h2>Oiling the wheels</h2>
<p>There are clear steps that a library, archive or museum can take to ensure there is no unwanted &#8216;friction&#8217; in the reuse of their data.</p>
<h3>1. Apply clear licensing or terms on reuse.</h3>
<p>As a signatory of the <a href="http://discovery.ac.uk/businesscase/principles/">Discovery Open Metadata Principles</a>, I believe descriptive metadata, such as that in library catalogue records, should be licensed as &#8216;public domain&#8217; data (using <a href="http://creativecommons.org/choose/zero/">CC0</a> or <a href="http://opendatacommons.org/licenses/pddl/1-0/">ODC-PDDL</a> or equivalent).</p>
<p>However, if reuse is restricted for some reason, be clear about what those restrictions are. Commercial services like<a href="https://dev.twitter.com/terms/api-terms"> Twitter offer clear terms of use on their APIs </a>- these are restrictive, but clear. Similary <a href="http://www.wired.com/rawfile/2011/11/creative-commons/">Wired magazine&#8217;s recent decision to offer images under Creative Commons BY-NC</a>, while falling short of &#8216;open&#8217; offers some level of clarity. In the latter case, the use of the &#8216;NC&#8217; (Non-commercial) clause can lead to uncertainty about rights for reuse &#8211; as noted in <a href="http://www.niemanlab.org/2011/11/wired-releases-images-via-creative-commons-but-reopens-a-debate-on-what-noncommercial-means/">this article</a>.</p>
<p>The <a href="http://obd.jisc.ac.uk">JISC Guide to Open Bibliographic Data</a> might help inform decisions about licensing metadata, as may the <a href="http://discovery.ac.uk/files/pdf/Licensing_Open_Data_A_Practical_Guide.pdf">Discovery licensing guide</a>.</p>
<h3>2. Adopt widely used (machine) interfaces and formats for data</h3>
<p>While any access to machine readable data increases the opportunities for reuse, adopting widely used interfaces and formats &#8211; ones for which a wide range of code libraries and tools will be available, and which the development community will be familiar with. Currently this often boils down to offering an interface that delivers data in <a href="http://www.w3schools.com/xml/xml_whatis.asp">XML</a> or <a href="http://www.json.org/">JSON</a> format over an <a href="http://en.wikipedia.org/wiki/Hypertext_Transfer_Protocol">http</a> interface. Sometimes the term &#8216;RESTful API&#8217; is used to describe this kind of interface, although it should be noted that in reality providing a RESTful interface is a bit more than just xml/json over http. <a href="http://kellabyte.com/2011/09/04/clarifying-rest/">This article tries to explain more specifically what REST is</a>.</p>
<h3>3. Document your APIs and your data</h3>
<p>Whatever interfaces/APIs and data formats you support, leaving them undocumented immediately increases friction on reuse. Many of the systems libraries, museums and archives use provide some API, but these are very rarely clearly documented by the organisations using the systems. Without documentation, it&#8217;s a huge amount of work for a developer to work out how to interface with the system.</p>
<p>For example, my local public library uses the <a href="http://www.serialssolutions.com/discovery/aquabrowser/">Aquabrowser interface</a> to their catalogue, which supports a couple of APIs &#8211; but in order to use these I had to find out the details of the API from the <a href="http://www.lib.cam.ac.uk/api/">University of Cambridge documentation</a>, and then apply the details to the public library system. Even just pointing to documentation held elsewhere helps &#8211; and sends the message &#8216;we want you to use this API&#8217; &#8211; and without this, the API will be left unused.</p>
<p>The data we deal with in libraries, museums and archives is specialist, and often confusing to those not familiar with the details &#8211; therefore not just documenting the APIs available, but also the data available via those APIs (this is also a reason to offer simple representations of data, as well as fuller, more complex, expressions as appropriate).</p>
<p>Finally, data needs to be &#8216;findable&#8217; &#8211; how would a prospective user of your data know what data you have, and where to find an API for it? In Australia the <a href="http://museumex.org/">Museum Metadata Exchange</a> is an interesting model for making this information available, but there are also more general tools/sites like like <a href="http://thedatahub.org/">http://thedatahub.org/</a> and <a href="http://getthedata.org/">http://getthedata.org/</a>.</p>
<h3>4. Use common identifiers</h3>
<p>This probably seems less fundamental than the points above,  for me it is absolutely key. The point here is that if anyone wants to combine data together, common identifiers across data sets are what they will be looking for &#8211; and I&#8217;d argue this is going to be a pretty common use case for your data, or anyone elses, by a third party developer.</p>
<p>While it is possible to write code that tries to match strings like &#8220;Austen, Jane&#8221; in your data to http://viaf.org/viaf/102333412/, this is much more effort and much less precise than if a shared identifier was used from the start. It&#8217;s no surprise that if you look at many mashups created using bibliographic data they rely on the ISBN to match across different data sources (for example, pulling in cover images from Amazon, LibraryThing, Google Books or Open Library).</p>
<h2>Supporting Discovery</h2>
<p>Much of my thinking in this area has been informed by my work with the &#8216;<a href="http://rdtf.jiscinvolve.org/wp/">Resource Discovery Taskforce</a>&#8216; and with the <a href="http://discovery.ac.uk/">Discovery initiative</a>that followed the work of the taskforce. Discovery is an initiative to improve resource discovery by establishing a clear set of principles and practices for the publication and aggregation of <strong>open, reusable, metadata. </strong>So far Discovery has published a set of <a href="http://discovery.ac.uk/businesscase/principles/">Open Metadata Principles</a>, and a set of <a href="http://blog.discovery.ac.uk/2011/11/07/introducing-the-discovery-technical-principles/">draft Technical Principles</a>, as well as running several <a href="http://discovery.ac.uk/news/">events</a> and a <a href="http://discovery.ac.uk/developers/competition/">developer competition</a>.</p>
<p>There will be a lot more coming out of the Discovery initiative over the next few months, and you can follow these via the <a href="http://blog.discovery.ac.uk/">Discovery Blog</a> (which I occaisionally write for).</p>
<h2>Outcomes of Open</h2>
<h3>Examples</h3>
<p><a href="http://rufuspollock.org/">Rufus Pollock</a>, the Director of the <a href="http://okfn.org/">Open Knowledge Foundation</a>, said &#8220;The coolest thing to do with your data will be thought of by someone else&#8221; &#8211; but is this true? Perhaps obviously, it isn&#8217;t a given that anything will happen when you publish your data for reuse. However, there are now plenty of examples of interesting applications being built on data that has been published with reuse in mind. To just pick a few examples:</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/cam-app.png"><img class="alignnone size-large wp-image-1348" title="cam-app" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/cam-app-1024x758.png" alt="" width="614" height="455" /></a></p>
<p>This <a href="http://itunes.apple.com/us/app/ucam-library-search/id459882806?mt=8">iPhone app to search Cambridge University Library</a> was developed by a postgraduate student &#8211; just because they wanted to learn how to develop an app using JSON, and found the API documentation published by the library.</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/europeana-hack.png"><img class="alignnone size-large wp-image-1349" title="europeana-hack" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/europeana-hack-1024x605.png" alt="" width="614" height="363" /></a></p>
<p>This app allows the user to take a picture of a work of art using their smartphone, and then retrieves information about the item from Europeana &#8211; it was built as part of a <a href="http://www.version1.europeana.eu/web/api/hack4europe">&#8216;hackday&#8217; for Europeana</a>.</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/timeline.png"><img class="alignnone size-large wp-image-1350" title="timeline" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/timeline-1024x697.png" alt="" width="614" height="418" /></a></p>
<p>This <a href="http://www.3zanders.co.uk/Timeline/">novel interface to pictures from the National Archive</a> was built as part of the Discovery Developer competition.</p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/whats-about.png"><img class="alignnone size-large wp-image-1351" title="whats-about" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/11/whats-about-1024x860.png" alt="" width="614" height="516" /></a></p>
<p>This <a href="http://whatsabout.lboro.ac.uk/whatsAbout/">map brings together information from English Heritage and the British National Bibliography to display location specific information</a>.</p>
<p>&#8230; and finally to blow my own trumpet, this <a href="http://www.meanboyfriend.com/overdue_ideas/2011/07/compose-yourself/">bookmarklet I&#8217;ve already written about</a></p>
<p><a href="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/07/composed-eg.jpg"><img class="alignnone size-large wp-image-1257" title="composed-eg" src="http://www.meanboyfriend.com/overdue_ideas/wp-content/uploads/2011/07/composed-eg-1024x612.jpg" alt="" width="614" height="367" /></a></p>
<h3>Supporting developments</h3>
<p>Something I don&#8217;t feel I really understand yet is how data suppliers can best engage with developers who might build on their data. Emma Mulqueeny (<a href="http://twitter.com/hubmum">@hubmum</a>) has <a href="http://mulqueeny.wordpress.com/2010/11/18/developers/">written eloquently about engaging developers</a>, but I&#8217;m still not sure I fully understand the best way that an organisation such as a museum, library or archive can engage with the development community.</p>
<p>Except the Cambridge University Library iPhone app, all the examples above are the results of some explicit stimulus &#8211; a competition or hackday. I don&#8217;t think any of them can be described as &#8216;production level&#8217; &#8211; they are, in general, proof of concept. If publishing data is going to result in sustainable developments, we need to consider how this is supported &#8211; should organisations &#8216;adopt&#8217; applications or developers? Should they work with relevant organisations to realise some commercial benefit to the developers? Are there other approaches?</p>
<p>I&#8217;d say at the least provide somewhere for developers, and potential developers, to talk to you, ask you questions, get permission to try stuff out &#8211; that dialogue is at least the first step to something more sustainable.</p>
<h2>Take action</h2>
<p>After my presentation at ILI 2011, which covered much of the same ground as this blog post, I felt that perhaps I&#8217;d missed a key point, and an opportunity while I had an audience &#8211; the question of what they should do in light of what I was saying. So, not wanting to make the same mistake again, I would encourage, even exhort, you to take the following actions:</p>
<ol>
<li>Explicitly license your data &#8211; whatever it is, put a license on it, be clear about what people can or can&#8217;t do with the data, and publish those details on your website</li>
<li>Find out about, and document, any APIs you already have to your data &#8211; it might be z39.50, it might be SRU/SRW, it might be some RSS feeds &#8211; whatever it is, write a short page that says where the API/data can be accessed, some basic instructions on how to use it. Be clear what you expect from people interacting with your data (both in terms of licensing &#8211; point 1 &#8211; and anything else like &#8220;please don&#8217;t kill our servers&#8221;)</li>
<li>Create a place for developers to communicate with you (or hang out somewhere that you can communicate them)</li>
</ol>
<p>If you can&#8217;t do any of these things yourself, find out who can answer the questions, or make this happen &#8211; find out if they are interested, and if not, why not and what the barriers are (and then let me know!)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2011/11/openly-connect/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Overcoming information overload</title>
		<link>http://www.meanboyfriend.com/overdue_ideas/2011/10/overcoming-information-overload/</link>
		<comments>http://www.meanboyfriend.com/overdue_ideas/2011/10/overcoming-information-overload/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 07:53:00 +0000</pubDate>
		<dc:creator>ostephens</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.meanboyfriend.com/overdue_ideas/2011/10/overcoming-information-overload/</guid>
		<description><![CDATA[The keynote this morning from Kevin Anderson (@kevglobal) and Suw Charman-Anderson (@suw) &#8211; journalists and technologists (http://charman-anderson.com/). Kevin kicks off: Journalists and librarians dealing with many of the same issues &#8211; helping people navigate, interpret and understand information. Going to talk about some of the challenges in this area. First playing Xerox video on ‘information [...]]]></description>
			<content:encoded><![CDATA[<p>The keynote this morning from Kevin Anderson (<a href="http://twitter.com/kevglobal">@kevglobal</a>) and Suw Charman-Anderson (<a href="http://twitter.com/suw">@suw</a>) &#8211; journalists and technologists (<a href="http://charman-anderson.com/">http://charman-anderson.com/</a>).</p>
<p>Kevin kicks off: Journalists and librarians dealing with many of the same issues &#8211; helping people navigate, interpret and understand information. Going to talk about some of the challenges in this area. First playing Xerox video on ‘information overload’ &#8211; <a href="http://www.youtube.com/watch?v=CXFEBbPIEOI">http://www.youtube.com/watch?v=CXFEBbPIEOI</a></p>
<p>Eric Schmidt noted that we are now creating huge amounts of information (5 exabytes every 2 days is the quote, but see disagreement with this figure at <a href="http://www.readwriteweb.com/cloud/2011/02/are-we-really-creating-as-much.php">http://www.readwriteweb.com/cloud/2011/02/are-we-really-creating-as-much.php</a>)</p>
<p>Amount of time people spend on Facebook massively more than they spend on Newspaper web sites. Evidence that people are having problems moving to conclusions on complex stories &#8211; people move to simple narratives instead &#8211; Kevins says this equals “car crashes and celebreties”</p>
<p>Social media offers opportunity to re-engage people and help them navigate information.</p>
<p>We are moving from “mass” to “relevance” &#8211; e.g. not about how many followers you have on twitter, but about the relevance of what you post. Try to move from information overload (a ‘mass’ problem) and have filtered relevant information (a ‘relevance’ solution)</p>
<p>Social media provides a way of filtering information. But social media has to be ‘social’ &#8211; you need people at the heart of this.</p>
<p>Examples of crowdsourcing &#8211; Guardian analysis of MP expenses (<a href="http://mps-expenses.guardian.co.uk/">http://mps-expenses.guardian.co.uk/</a>), Ushahidi crowdsourcing crisis information (<a href="http://www.ushahidi.com/">http://www.ushahidi.com/</a>).</p>
<p>Kevin also mentions ‘entity extraction’ &#8211; uses Calais as an example..<br />
Dewey D. &#8211; iphone app to manage ‘reading list’ (not in academic sense) and pulls in stories from the New York Times.</p>
<p>Poligraft &#8211; analyses funding of politicial campaigns &#8211; you can post URLs (of political stories) to Poligraft &#8211; it goes through and identifies politicians and organisations and shows you how politicians get campaign funding etc. Tells you about the major industries funding politicians etc &#8211; gives context to political story and help make sense of it.</p>
<p>We (journalists &amp; librarians) have hundreds of years of doing things in a certain way &#8211; changing culture is incredibly difficult. If you have more than 5 people in the room, inertia hits &#8230;</p>
<p>Now Suw taking the floor&#8230; to talk crowdsourcing &#8211; breaking large tasks into smaller chunks that individuals can do. Suitable tasks &#8211; computational tasks and ‘human’ tasks.</p>
<p>Computational tasks = large datasets of computation that can be split into smaller datasets or computations &#8211; e.g. SETI@Home &#8211; this is about ‘spare cycles’ from individual’s computers they can contribute to computing power.</p>
<p>Human tasks = tasks that humans find easy but computers find difficult; brain driven; uses participants spare time; individual errors are average away by having the same task completed by many people.</p>
<p>Type of human tasks:</p>
<ul style="list-style-type: disc;">
<li>Recognising and describing things in images</li>
<li>Reading and transcribing writing</li>
<li>Applying expertise to identify, sort and catalogue</li>
<li>Collecting data</li>
<li>Manipulating models</li>
</ul>
<p>Examples &#8230;</p>
<p><strong>PCF oil paintings tagger &#8211; <a href="http://tagger.thepcf.org.uk/">http://tagger.thepcf.org.uk/</a></strong></p>
<ul style="list-style-type: disc;">
<li>Public catalogue foundation, BBC</li>
<li>Digitising pictures</li>
<li>Getting people to tag content with metadata &#8211; describe what is in the painting</li>
</ul>
<p>“You don’t have to be an expert to take part”</p>
<p><strong>Old Weather &#8211; <a href="http://www.oldweather.org/">http://www.oldweather.org/</a></strong><br />
Transcribing ships logs &#8211; contributes to historic data on climate, as well as other historical background</p>
<p><strong>Ancient Lives &#8211; <a href="http://ancientlives.org/">http://ancientlives.org/</a></strong><br />
Papyrus fragments &#8211; transcribe, measure, etc.</p>
<p>Multiple people doing each task gives you confidence when agreement across results</p>
<p><strong>Herbaria@Home &#8211; <a href="http://herbariaunited.org/atHome/">http://herbariaunited.org/atHome/</a></strong></p>
<p>What’s the Score &#8211; <a href="http://www.bodleian.ox.ac.uk/bodley/library/specialcollections/projects/whats-the-score">http://www.bodleian.ox.ac.uk/bodley/library/specialcollections/projects/whats-the-score</a><br />
Digitised musical score collection from the Bodleian &#8211; will be starting crowdsourcing part of project soon</p>
<p><strong>Why crowdsource?<br />
</strong>Provide opportunities for education and knowledge maintenance<br />
Most projects don’t require prior knowledge but people often enjoy learning more about a subject<br />
Improve accessibility through addition of new metadata or improvement of existing metadata &#8211; create data for research<br />
Even when digitised, collections are hard to search/comprehend</p>
<p>Galaxy Zoo shows public were as good, or better, than professionals at classifying galaxies<br />
FoldIt found gamers could solve the structure of a protein that causes AIDs in rhesus monkeys in three weeks</p>
<p>Are your projects suitable?</p>
<ul style="list-style-type: disc;">
<li>Can the original material be digitised?</li>
<li>Can task be broken down into small chunks?</li>
<li>Can those chunks be done by humans or their computers?</li>
</ul>
<p>It also helps if&#8230;</p>
<ul style="list-style-type: disc;">
<li>There is a benefit for the public &#8211; example of Google buying out a image tagging game, which then died</li>
<li>People feel part of a community</li>
<li>There are measurable goals and targets</li>
</ul>
<p>Zooniverse are crowdsourcing gurus..<br />
Citizen Science Alliance &#8211; “Science” doesn’t just mean science &#8211; looking for projects at the moment&#8230;<br />
Events &#8211; e.g. Citizen Cyberscience Summit</p>
<p>Q &amp; A:<br />
Failure of crowdsourcing &#8211; NASA mapping craters on Mars &#8211; mid 80s. But failed to collect data in useful way.<br />
In terms of issues around the data<br />
Wikitorial &#8211; not enough community &#8211; hurdles to participation not a bad thing</p>
]]></content:encoded>
			<wfw:commentRss>http://www.meanboyfriend.com/overdue_ideas/2011/10/overcoming-information-overload/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.985 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-05-17 19:44:36 -->

