RSS
 

MusicNet URI scheme and Linked Data hosting

19 Jan

MusicNet’s key contribution is the minting of authoritative URIs for musical composers, that link to records for those composers in different scholarly and commercial catalogues and collections. MusicNet claims authority because the alignment across the sources has been performed by scholars in musicology. The alignment tool and the progress to date has been detailed previously. In this post I will overview our methodology for publishing our work, in terms of the decisions made in choosing our URI scheme and how we model the information using RDF in the exposed Linked Data. I will then describe the architecture for generating the linked data, which has been designed to be easily deployed and maintained, so that it can be hosted centrally in perpetuity by a typical higher education computer science department.

URI Scheme

The URI scheme is designed to expose minimal structural information, for example, the URI for Franz Schubert is currently (see below for a volatility note):

http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174#id

It is comprised of the domain name (musicnet.mspace.fm), an abstract type (person), an ID taken from the musicSpace hash of the composer (7ca5e11353f11c7d625d9aabb27a6174) and a fragment to differentiate the document from the person (#id).

We have chosen a hash rather than a human-readable label because we want to avoid people using the URI because they think that it refers to a composer when it might refer to a different composer. This is important in this domain because there are a number of composers with the same or similar names. Part of the alignment process has musicologists make this distinction. By forcing people to resolve the URI and check that it is the person they are referring to, we aim to avoid incorrect references being made. In addition it gives us the freedom to alter the canonical label for a composer after we have minted the URI, so that we don’t have a label-based URI with a different label in its metadata.

Domain Name

We intend for the domain name to change soon from one which isn’t explicitly tied to mSpace – this is in place right now for convenience to us. In particular our requirements are a domain that will not cost us anything to re-register in future, will remain in our control (i.e. not get domain parked if someone forgets to renew), and will not dissuade people from using it for any partisan or political reasons. The closest we might reasonably get is musicnet.data.ac.uk, although this is still unconfirmed at this point in time, and we may have to instead use musicnet.soton.ac.uk or musicnet.ecs.soton.ac.uk, which are not preferred, since they might give the impression that the data is a Southampton-centric view of the information, which it is not. For a more in depth discussion of a proposed solutions see our previous posts (data.ac.uk proposal & data.ac.uk revisited)

Ontological Constructs

In addition to the scheme for the URI, we also had to determine the best way to expose the data in terms of the ontological constructs (specifically the class types and predicates) used in the published RDF. We are fortunate that an excellent set of linked data in the musical composer domain already exists, in the form of the BBC /music linked data. For example, the BBC /music site exposes Franz Schubert with the URI:

http://www.bbc.co.uk/music/artists/f91e3a88-24ee-4563-8963-fab73d2765ed#artist

The BBC’s data uses the Music Ontology heavily, as well as other ontologies such as SKOS, Open Vocab and FOAF. Since we are publishing similar data, it makes sense for us to use the same terms and predicates as they do where possible, which is what we have done.

We are still in the process of finalising how we will model the different labels of composers. In the figure below we offer two possible methods, the first is to create a URI for each composer for every catalogue that they are listed in, publishing the label from that catalogue under the new catalogue-based URI, and use owl:sameAs to link it to our canonical MusicNet one. The second method is to “flatten” all labels as simple skos:altLabel links, although this method loses provenance. Currently we do both, and we’ve not finalised whether this is necessary or useful.

 

RDF model for MusicNet alternative labels

RDF model for MusicNet alternative labels

 

 

Content Negotiation & Best Practice

Similarly, we also follow the BBC /music model of using HTTP 303 content negotiation to serve machine-readable RDF and human-readable HTML from the same URI. Specifically, the model we’ve borrowed is to append “.rdf” when forwarding to the RDF view of the data, and to append “.html” when forwarding to the human readable view of the data. This is now implemented, and you can try this out yourself with the above URIs, which you can turn into the following:

http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174.rdf
http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174.html
http://www.bbc.co.uk/music/artists/f91e3a88-24ee-4563-8963-fab73d2765ed.rdf
http://www.bbc.co.uk/music/artists/f91e3a88-24ee-4563-8963-fab73d2765ed.html

There are several other offerings from the MusicNet site, some of which have been detailed before. First, the MusicNet Codex, which is the human search engine for MusicNet. In addition we have also created a (draft!) VoiD document that describes the MusicNet data set, available here:

http://musicnet.mspace.fm/void#

The perceptive among you will notice that the VoiD document links to an RDF dump of all of the individual linked data files, available here (14MB at time of writing):

http://musicnet.mspace.fm/dump.rdf#

Simple Deployment & Hosting

As noted above, our requirements state that our deployment must be as simple as possible to maintain by typical higher education computer science department web admins. In our bid we stated that we will work with the Southampton ECS Web Team to tweak our solution. As such, in order to keep our deployment simple, we have adopted an architecture where all RDF (including the individual Linked Data files for each composer) are generated once and hosted statically. The content negotiation method (mentioned above) makes serving static RDF files simple and easy to understand by web admins that might not know much about the Semantic Web. Similarly, the VoiD document and RDF dump get generated at the same time. The content negotiation is handled by a simple PHP script and some Apache URL rewriting.

Benefits of Linked Data

One of the benefits of using Linked Data is that we can easily integrate metadata from different sources. One of the ways in which we use this is using the aforementioned BBC /music linked data. Specifically, we enrich our Linked Data offering through the use of MusicBrainz. One of the sources of metadata that we have aligned is musicbrainz, based on a data dump we were given by the LinkedBrainz project team. The BBC also have aligned their data to Musicbrainz, and thus we have been able to automatically cross-reference the composers at the BBC with the composers in MusicNet. Thus, we can link directly to the BBC, which offers a number of benefits. Firstly, it means that users can access BBC content, such as recently radio and television recordings that feature those composers (see the Franz Schubert link above, for examples), but also that we can harvest some of the BBC’s outward links in order to enrich our own Linked Data offering. Specifically, we have harvested links that the BBC make to pages on IMDB, DBPedia, Wikipedia, among others, which we now re-publish.

The data flow from the raw data sources to linked data serving is illustrated in the figure below.

MusicNet Architecture Data Flow Diagram

MusicNet Data Flow Diagram

Future Work

The following tasks remain in this area of the project:

  1. Acquire control of a long-term domain name (preferably musicnet.data.ac.uk, see above).
  2. Discuss our RDF model with experts in Linked Data, Ontological Modelling and Provenance.
  3. Determine if we will offer a SPARQL endpoint in future. If we decide not to ourselves (because it might not be sustainable once our hosting is passed over to the department), it might be desirable to put the data on the Data Incubator SPARQL host.

This post documents Work Package 3 from the MusicNet project deliverables. MusicNet is funded through the JISCEXPO programme.

 
33 Comments

Posted by Daniel Alexander Smith in Documentation, Software

 

Tags: , , , , , , , , , , , , , , , , , , , , , , , ,

Leave a Reply

 

 
  1. Tweets that mention MusicNet URI scheme and Linked Data hosting « MusicNet Blog -- Topsy.com

    January 19, 2011 at 8:10 pm

    [...] This post was mentioned on Twitter by Daniel A Smith, MusicNet Project. MusicNet Project said: MusicNet URI scheme and Linked Data hosting http://bit.ly/fNyVAu #jiscexpo [...]

     
  2. Adrian Stevenson

    February 1, 2011 at 4:45 pm

    Hi Daniel

    Really comprehensive post, thanks. Most of the other JiscEXPO projects have been thinking about the issues of domains and URI patterns as well. There was some discussion on the Sails project blog at http://sailsproject.cerch.kcl.ac.uk/2010/12/where-should-the-data-go/. data.blah.ac.uk seems to have established itself as a convention. On Locah we’re trying to follow the govt’s URI patterns guidance (http://www.cabinetoffice.gov.uk/sites/default/files/resources/designing-URI-sets-uk-public-sector.pdf), i.e. using root/id/ for the ‘thing’ URIs 303 redirected to root/doc/ for the documents about the things which is then content negotiated to /doc.html, /doc.rdf etc as per the guidelines (see http://blogs.ukoln.ac.uk/locah/2010/11/16/identifying-the-things-uri-patterns-for-the-hub-linked-data/ for more in interested).

    Thanks for the info on benefits too. This is all very important for the synthesis.

    Cheers, Adrian

    Adrian Stevenson
    JiscEXPO Synthesis Liaison

     
  3. Progress Update « MusicNet Blog

    March 10, 2011 at 4:35 pm

    [...] One of the most important features we’ve added since January is content-negotiation. This enables our Codex to serve up the most appropriate content dependant on the ‘Accept’ header received in the HTTP request. For a more detailed writup see Dan’s blog post on the MusicNet URI Scheme. [...]

     
  4. Final Product Post: MusicNet & The Alignment Tool « MusicNet Blog

    June 29, 2011 at 9:07 am

    [...] MusicNet URI scheme and Linked Data hosting [...]

     
  5. Mickey

    November 27, 2011 at 7:54 pm

    I literally knew about nearly all of this, but in spite of this, I still thought it had been valuable. Very good job!

     
  6. รับทำเว็บ

    December 29, 2011 at 12:12 pm

    With a good health care

     
  7. Drusilla

    January 21, 2012 at 3:51 am

    Hell if I know.

     
  8. Alvin Morang

    March 5, 2012 at 10:39 am

    This internet site MusicNet URI scheme and Linked Data hosting MusicNet Blog is usually a walk-through for every one of the info anyone wanted relating to this and didn’t recognize who to ask. Glimpse the following, and you’ll surely discover it.

     
    • Sahil

      June 27, 2015 at 1:43 am

      First let me apologize for being craictil. I’m currently benefiting a bit from using Picard and MB and apreciate the work that’s gone into them.However, I have to say that after spending quite a few hours trying to figure out how exactly this all works I’m still puzzled.What I’d like to do:1. Tag a rather large collection of MP3 s. Most are ripped from CDs and therefore pretty well tagged from just using the iTunes lookup feature. What I really want help tagging are the odd things downloads from artist web sites and things from some minor label CDs I’ve collected.2. Help build the MB/Picard system by adding data from things I’ve spent hours tagging manually via lookups on Amazon, AMG, etc.3. Understand the process. I’m not an average web user. I’m pretty technical. I have a degree in MIS, can code C, Perl, Javascript, and am very web savvy. Even so, after looking at the MB site and the seemingly redundant Wiki I can’t get a good picture of HOW THIS ALL WORKS. What is the process?Every explanation I find is full of explanations like the one for CD Stubs: it’s highly technical (XML, RDF, etc.) and has a lot of ifs, buts and other conditional language. (e.g. if you’re logged in you can’t submit a stub)4. I’d like to contribute quickly and easily. I’m not looking for a new committment. I don’t want to join the MB community I already belong to enough volunteer efforts BUT if I could push a button or two and contribute things that I’ve spent hours on for my own benefit I’d happily do so (and I’d love to benefit from others doing the same). I just spent at least 15 minutes trying to find out how to submit PUIDs using the Submit PUIDs button in Picard, only to find out that Picard can’t generate PUIDs along with a whole bunch of very technical explanations of why that is and coming away frustrate by a button that seems useless.If I can suggest some ways to help MB move forward they’d be the following:1. Create a page that describes how the MB ecosystem works, how to A) use it and B) contribute to it (at different levels).2. Get some press. Talk to a technology magazine reporter. They can help you COMMUNICATE TO THE MASSES. Not everyone who cares about music (and tagging it) is an OSS developer.3. Try to keep things simple both in the interfaces and in the documentation.Consider this advice a contribution to the MB efforts. Yes, it may sound craictil, but the reason I bothered is because I think you’re on to something here and have a lot of value you can provide in the future.Best,Croc

       
  9. Dee Joerger

    March 14, 2012 at 5:18 pm

    Don’t simply retire from something; have something to retire to.
    Every couple of seconds it changes Up an eighth, down an eighth -it’s like playing a video slot. I lose $20 million, I gain $20 million.

     
  10. Tyson

    May 27, 2012 at 4:19 am

    hi!,I really like your writing very much! proportion we be in contact more about your article on AOL? I need a specialist on this area to unravel my problem. Maybe that is you! Taking a look ahead to look you.

     
    • Colek

      February 18, 2016 at 12:17 am

      Webservice tries to serve regular resleeas before CDStubs (and don’t return boths)And if you don’t return CDStubs, we can’t act as FreeDB and users won’t have any benefit in submitting CDStubs.I really like this new system. It’s a improvement for both MB editors & users (ability to import CDStubs with DiscID rather than resleeas from FreeDB), and for anonymous user that will have an easy way to submit data.

       
  11. Sherly Colmer

    June 18, 2012 at 10:09 pm

    World News: A list celebrity Tom Truong wants all of the Jesus Christ fans to move and live in the Chicagoland area. Chicago land area is now the kingdom of God on Earth. God’s supreme law: Where ever The King of Kings and Lord of Lords is, that land becomes the devine holy land.

     
  12. Edgar

    July 21, 2012 at 2:52 pm

    Useful information. Lucky me I found your site accidentally, and I am surprised why this accident did not happened in advance! I bookmarked it.

     
  13. Giesela

    July 27, 2012 at 12:50 am

    Bless you for trying to describe the terminlogy for the rookies!

     
  14. Monty

    July 30, 2012 at 9:06 pm

    I have been surfing online more than three hours today, yet I never found any interesting article like yours. It’s pretty worth enough for me. In my opinion, if all web owners and bloggers made good content as you did, the internet will be much more useful than ever before.

     
  15. Jarrod

    August 4, 2012 at 12:13 pm

    Jeder Topf hat einen Deckel.

     
  16. spiritfulness

    August 2, 2013 at 7:40 pm

    Wow, this piece of writing is nice, my sister is analyzing such things, therefore I am going to inform her.

     
  17. bestdatingsitesnow.com

    September 30, 2015 at 10:01 am

    You’re so cool! I don’t suppose I’ve truly read something like that before.
    So wonderful to discover somebody with some genuine thoughts on this topic.
    Really.. thank you for starting this up. This site
    is something that’s needed on the web, someone with a bit of originality!

     
  18. quick way to make money legally augusta ga

    October 1, 2015 at 7:34 pm

    Hi there, just became alert to your blog through Google, and found
    that it’s really informative. I am going to watch out for brussels.
    I’ll appreciate if you continue this in future.
    Numerous people will be benefited from your writing.
    Cheers!

     
  19. Birgit Waltemeyer

    November 5, 2015 at 12:20 pm

    I happen to be commenting to let you know what a brilliant discovery my princess enjoyed reading through your web page. She figured out so many pieces, most notably what it is like to possess a wonderful helping spirit to let many more without hassle learn about a number of complicated subject matter. You undoubtedly surpassed our expected results. Thanks for imparting such important, dependable, informative and easy guidance on your topic to Janet.

     
  20. www.youtube.com

    November 9, 2015 at 4:03 pm

    If you are concerned with buying an older canine by way of breeders, ask them about purchasing a
    retired show dog or in the event that they know of an grownup dog who wants a new home.

     
  21. znajdziesz to na stronie

    November 25, 2015 at 12:05 pm

    Great post moidisys made here. One thing I would really like to say is always that most professional areas consider the Bachelors Degree as the entry level standard for an online course. Whilst Associate Qualifications are a great way to start, completing your current Bachelors opens up many doors to various employment opportunities, there are numerous on-line Bachelor Diploma Programs available coming from institutions like The University of Phoenix, Intercontinental University Online and Kaplan. Another concern is that many brick and mortar institutions give Online variants of their degree programs but generally for a greatly higher amount of money than the corporations that specialize in online degree plans.

     
  22. Egas Grijalva

    December 1, 2015 at 8:49 am

    Gracias

     
  23. Jeric Bas

    December 1, 2015 at 4:43 pm

    The next time I learn a blog, I hope that it doesnt disappoint me as much as this one. I mean, I know it was my choice to learn, but I really thought youd have something attention-grabbing to say. All I hear is a bunch of whining about one thing that you could fix when you werent too busy on the lookout for consideration.:visit our web site: http://jericbas.com

     
  24. Vitaldent Giulia

    December 3, 2015 at 11:13 am

    Grazie per l’articolo, ben fatto saluti

     
  25. Going Here

    December 25, 2015 at 7:42 am

    Great post, and great website. Thanks for the information!

     
  26. lokale do wynajecia

    January 4, 2016 at 4:03 pm

    Thanks for gipelris this glorious article. Yet another thing to mention is that most digital cameras can come equipped with some sort of zoom lens that allows more or less of your scene to generally be included by simply ‘zooming’ in and out. These types of changes in target length tend to be reflected within the viewfinder and on large display screen at the back of your camera.

     
  27. porno

    June 9, 2016 at 10:56 am

     
  28. seo small business package

    July 5, 2016 at 10:56 pm

    Thanks for this excellent article. Also a thing is that many digital cameras come equipped with some sort of zoom lens that allows more or less of that scene to get included by ‘zooming’ in and out. These kinds of changes in focusing length will be reflected while in the viewfinder and on substantial display screen right on the back of this camera.

     
  29. Hershel Lowthorp

    July 7, 2016 at 12:53 pm

    Admiring the hard work you put into your website and detailed information you present. It’s awesome to come across a blog every once in a while that isn’t the same out of date rehashed information. Great read! I’ve saved your site and I’m adding your RSS feeds to my Google account.

     
  30. ElbertIFrist

    August 21, 2016 at 1:23 am

    I have got read countless articles or reviews regarding the blogger lovers however this bit of writing is in fact a pleasant paragraph, ensure
    that is stays up.

     
  31. indian best bodybuilding supplement

    July 9, 2017 at 4:59 pm

    I do not even understand how I ended up right here, however I believed this submit used to be great. I do not understand who you’re but definitely you are going to a famous blogger in the event you aren’t already ;) Cheers!