Opening Data – Opening Doors: Technical Standards

Some slightly sketchy notes of Paul Walk’s talk

Paul says: the real challenges are around:
Business case
IPR
etc.

Technical issues not trivial, but insignificant compared to other challenges

We aren’t building a system here – but thinking about an environment … although probably will need to build systems on top of this at some point

‘The purple triangle of interoperability’!:

  • Shared Principles
  • Technical Standards
  • Community/Domain Conventions

Standards are not the whole story

  • Use (open) technical standards
  • Require standards only where necessary
  • Avoid pushing standards to create adoption
  • Establish and understand high-level principles and ‘explain the working out’ – support deeper understanding

Paul suggests some ‘safe bets’ in terms of approaches/principles:

  • Use Resource Oriented Architecture
  • identify persistently – global and public identities to your high-order entities (e.g. metadata records, actual resources)
    • URLs (or http URIs) is a sensible default for us (although not the only game in town)
  • use HTTP and REST

Aggregation is a corner-stone of RDTF vision – so make your resources a target for aggregation:

  • use persistent identifiers for everything
  • adopt appropriate licensing
  • ‘Share alike’ maybe easier than ‘attribution’

Paul still a little sceptical of ‘Linked Data’ – it’s been the future for a long time. Tools for Linked Data still not good enough – can be real challenge for developers. However, we should be a
Quote Tom Coates: “Build for normal users, developers and machines” – and if possible, build the same interface for all three [hint, a simple dump of RDF data isn’t going to achieve this!]

Expect and enable users to filter – give them ‘feeds’ (e.g. RSS/Atom) – concentrate on making your resources available

Paul sees slight risk we embrace ‘open at the expense of ‘usability’ – being open is an important first step – but need to invest in making things useful and usable

Developer friendly formats:

  • XML has a lot going for it, but also quite a few issues
    • well understood
    • lots of tools available
    • validation is a pain
    • very verbose
    • not everything is a tree
  • JSON has gained rapid adoption
    • less verbose – simple
    • enables client side manipulation

Character encodings – huge number of XML records from UK IRs are invalid do to character encoding issues

Technical Foundations:

  • Work going on now – Will be a website ETA June 2011
  • JISC Observatory will gather evidence of ‘good use’ of technical standards etc
  • Need to understand federated aggregation better

Questions for data providers:Do you want to provide a data service, or just provide data?

One thought on “Opening Data – Opening Doors: Technical Standards

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.