Some slightly sketchy notes of Paul Walk’s talk
Paul says: the real challenges are around:
Business case
IPR
etc.
Technical issues not trivial, but insignificant compared to other challenges
We aren’t building a system here – but thinking about an environment … although probably will need to build systems on top of this at some point
‘The purple triangle of interoperability’!:
- Shared Principles
 - Technical Standards
 - Community/Domain Conventions
 
Standards are not the whole story
- Use (open) technical standards
 - Require standards only where necessary
 - Avoid pushing standards to create adoption
 - Establish and understand high-level principles and ‘explain the working out’ – support deeper understanding
 
Paul suggests some ‘safe bets’ in terms of approaches/principles:
- Use Resource Oriented Architecture
 - identify persistently – global and public identities to your high-order entities (e.g. metadata records, actual resources)
- URLs (or http URIs) is a sensible default for us (although not the only game in town)
 
 - use HTTP and REST
 
Aggregation is a corner-stone of RDTF vision – so make your resources a target for aggregation:
- use persistent identifiers for everything
 - adopt appropriate licensing
 - ‘Share alike’ maybe easier than ‘attribution’
 
Paul still a little sceptical of ‘Linked Data’ – it’s been the future for a long time. Tools for Linked Data still not good enough – can be real challenge for developers. However, we should be a
Quote Tom Coates: “Build for normal users, developers and machines” – and if possible, build the same interface for all three [hint, a simple dump of RDF data isn’t going to achieve this!]
Expect and enable users to filter – give them ‘feeds’ (e.g. RSS/Atom) – concentrate on making your resources available
Paul sees slight risk we embrace ‘open at the expense of ‘usability’ – being open is an important first step – but need to invest in making things useful and usable
Developer friendly formats:
- XML has a lot going for it, but also quite a few issues
- well understood
 - lots of tools available
 - validation is a pain
 - very verbose
 - not everything is a tree
 
 - JSON has gained rapid adoption
- less verbose – simple
 - enables client side manipulation
 
 
Character encodings – huge number of XML records from UK IRs are invalid do to character encoding issues
Technical Foundations:
- Work going on now – Will be a website ETA June 2011
 - JISC Observatory will gather evidence of ‘good use’ of technical standards etc
 - Need to understand federated aggregation better
 
Questions for data providers:Do you want to provide a data service, or just provide data?
One thought on “Opening Data – Opening Doors: Technical Standards”