RSS
 

Posts Tagged ‘dissemination’

Add MusicNet data to COPAC with ‘Composed’ bookmarklet

03 Oct

Here’s a great example of how the MusicNet data can be used to enhance existing sites. The ‘Composed’ bookmarklet decorates an existing COPAC composer record with all the extra information that MusicNet contains about that person.

Head on over to this blog post for more details. Incidentally this was created as an entry to the UK Discovery competition we blogged about earlier in the year.

So, who’s going to turn this into a GreaseMonkey script so that the bookmarklet isn’t need?

 
 

UK Discovery Developer Competition features the MusicNet dataset

11 Jul

The MusicNet dataset has been included as part of the UK Discovery global developer competition. The rules of the competition are simple, build an app/tool that makes use of at least one of the 10 featured datasets.

UK Discovery is working with libraries, archives and museums to open up data about their resources for free re-use and aggregation. DevCSIis working with developers in the education sector, many of who will have innovative ideas about how to exploit this open data in new applications.

This Developer Competition runs throughout July 2011. It starts on Monday 4 July – Independence Day, a good day for liberating data – and closes on Monday 1 August. It’s open to anyone anywhere in the world.

For more information about the competition see http://discovery.ac.uk/developers/competition/. Prizes are available for the best entrants, competition ends Monday 1 August 2011.

 
 

Tweets from Music Linked Data Workshop (#MLDW)

24 May

Here are the archived tweets from the Music Linked Data Workshop we held at JISC London earlier this month. Slides from the event can be found here.

Read the rest of this entry »

 

MLDW Slides

22 May

We are pleased to post below the slides from the presentations at the Music Linked Data Workshop (JISC, London, 12 May 2011). Thank you to our presenters for providing their slides.

 

MLDW Programme & Abstracts

10 May

Music Linked Data Workshop, JISC, London, 12 May 2011

Programme

10:30 – Welcome

Morning Session: Research Papers
Chaired by Richard Polfreman (Music, University of Southampton)

10:35 – MusicNet: Aligning Musicology’s Metadata
David Bretherton, Daniel Alexander Smith, Joe Lambert and mc schraefel (Music, and Electronics and Computer Science, University of Southampton)

11:05 – Towards Web-Scale Analysis of Musical Structure
J. Stephen Downie (Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign), David De Roure (Oxford e-Research Centre, University of Oxford) and Kevin Page (Oxford e-Research Centre, University of Oxford)

11:35 – LinkedBrainz Live
Simon Dixon, Cedric Mesnage and Barry Norton (Centre for Digital Music, Queen Mary University of London)

12:05 – BBC Music – Using the Web as our Content Management System
Nicholas Humfrey (BBC)

12:35 – Lunch

Afternoon Session: Funding & Project Presentations
Chaired by David Bretherton (Music, University of Southampton)

13:30 – JISC Funding Roadmap for 2011-12
David Flanders (JISC)

13:45 – Early Music Online: Opening up the British Library’s 16th-Century Music Books
Sandra Tuppen (British Library)

14:00 – Musonto – A Semantic Search Engine Dedicated to Music and Musicians
Jean-Philippe Fauconnier (Université Catholique de Louvain, Belgium) and Joseph Roumier (CETIC, Belgium)

14:15 – Listening to Movies – Creating a User-Centred Catalogue of Music for Films
Charlie Inskip (freelance music consultant)

14:30 – Q & A and Discussion Session
Chaired by Geraint Wiggins (Department of Computing, Goldsmiths, University of London)

16:00 – End

Abstracts for Morning Research Papers

MusicNet: Aligning Musicology’s Metadata

David Bretherton, Daniel Alexander Smith, Joe Lambert and mc schraefel (Music, and Electronics and Computer Science, University of Southampton)

As more resources are published as Linked Data, data from multiple heterogeneous sources should be more rapidly discoverable and automatically integrable, enabling it to be reused in contexts beyond those originally envisaged. But Linked Data is not of itself a complete solution. One of the key challenges of Linked Data is that its strength is also a weakness: anyone can publish anything. So in music, for instance, 17 sources may independently publish data about ‘Schubert’, but there is no de facto way to know that any of these Schuberts are the same, because the sources are not aligned. Alignment is a prerequisite for usable Linked Data, without which resources are effectively stranded rather than integrated. To begin to address this, the MusicNet project has minted URIs for composers, and has published as RDF basic biographical data and – crucially – alignment information for several leading providers of musicological data.

Towards Web-Scale Analysis of Musical Structure

J. Stephen Downie (Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign), David De Roure (Oxford e-Research Centre, University of Oxford) and Kevin Page (Oxford e-Research Centre, University of Oxford)

SALAMI (Structural Analysis of Large Amounts of Music Information) is an ambitious computational musicology project which applies a computational approach to the huge volume of digital recordings now available from such sources as the Internet Archive. It aims to deliver a very substantive corpus of musical analyses in a common framework for use by music scholars, students and beyond, and to establish a web-based methodology and tooling which will enable others to add to this in the future. In its first phase the project has conducted a significant exercise in ground truth collection with 1000 recordings analysed by music students and shortly to be published as open Linked Data.

LinkedBrainz Live

Simon Dixon, Cedric Mesnage and Barry Norton (Centre for Digital Music, Queen Mary University of London)

The MusicBrainz dataset is a large open (openly-licensed and open to contribution) collection of metadata about music, containing information on artists, their recorded works, and acoustic fingerprints. The LinkedBrainz project aims at making MusicBrainz Linked Data compliant. Linked Data principles require that the data is made available using an RDF serialisation over HTTP, and that this is interlinked with existing datasets. Linked Data Best Practice encourages an endpoint where queries can be made using the SPARQL query language for RDF. The LinkedBrainz project is rolling out an RDFa annotation of the relevant MusicBrainz pages, and is preparing a SPARQL endpoint and RDF-based dereferencing.  In this talk we will give further details on progress and future work, and will show the utility of the dataset as Linked Data by demonstrating the ease with which ‘mash-ups’ can be formed, based on interlinkage with resources such as DBPedia and BBC Music Reviews.

BBC Music – Using the Web as our Content  Management System

Nicholas Humfrey (BBC)

The BBC Music site provides a page for every artist played on the BBC. These pages use persistent web identifiers for each artist, which serve as an aggregation point for all content and information. By reusing structured data available elsewhere on the Web – the Web becomes our Content Management System. The core metadata is then enhanced with content, such as videos and reviews, from the BBC, thereby providing a compelling audience proposition and also making the BBC content re-aggregatable by other websites, thus contributing to the web as a whole.

 

Progress Update

10 Mar

Its time for a short update on how the project is progressing. We’ve had an incrementally feature-full prototype of our Codex available on our project web since January and we’ve been working hard to improve it. If you haven’t already then head on over to http://musicnet.mspace.fm/codex and search for a composer.

What have we added since January?

Content Negotiation

One of the most important features we’ve added since January is content-negotiation. This enables our Codex to serve up the most appropriate content dependant on the ‘Accept’ header received in the HTTP request. For a more detailed writup see Dan’s blog post on the MusicNet URI Scheme.

A simple example would be:

Franz Schuberts URI is: http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174

If we request this from a regular web browser we are dereferenced to the HTML content at: http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174.html

However, if we request this URI from a semantic web browser we are dereferenced the RDF content at: http://musicnet.mspace.fm/person/7ca5e11353f11c7d625d9aabb27a6174.rdf

Data Enrichment

We have also been working hard to leverage the data we’ve aligned over the last year to enrich the information provided by our various data partners. Last year we met with the LinkedBrainz team and they provided us with a small set of composer data from MusicBrainz for us to align against. This has allowed us to draw additional information from other open data sources such as the BBC, Wikipedia/DBPedia, IMDB and even the New York Times to provide a more complete representation of the data available about a composer.

This data is available in both the RDF and the HTML representation of the Codex.

e.g. Schubert, Franz (HTML | RDF)

Alignment Progress

Alignment is moving on well and we’re currently at 89%.

What is left to do?

One of the discussions the MusicNet team has been involved in since the start of the project has been data.ac.uk and the need for in perpetuity hosting of URI’s minted by JISC projects.

We’re currently in discussions to be one of the first projects to be able to make use of this domain and hope that by the end of the project we’ll be able to move our Codex and URI’s over to a suitable domain such as musicnet.data.ac.uk. This will ensure that the data we’ve exposed will be available after the project’s end.

MusicNet Workshop

We’re also hosting a small workshop on the 12th May at JISC HQ to try and expose more people to the potential of the MusicNet URI’s. The workshop will also be looking more broadly at the current Music & Linked Data landscape & should cater to a broad audience. It’s filling up very quickly so if you’re interested and haven’t yet made contact please do so soon.

For more details see our announcement

 
 

Strata 2011 – Big Data

07 Mar

The internet exerts an unprecedented equalizing force in bringing access to information to everyone on the planet. More information is available (and mainly for free) now than ever before, and yet it is becoming clear that access to information is not enough. The infrastructure to store and share data within sectors is a vital part of the ecosystem, and yet it is often treated as an afterthought. We need a radical change in the way we develop infrastructure in the higher education sector, to ensure that services consumed and funded by the public can do their job as efficiently as possible and at the best possible price.

http://cottagelabs.com/strata-2011-review/

This is key, we’ve found that having data isn’t enough (although its a great start!). Making that data available in some meaningful way is more of a challenge. With MusicNet we’re striving to make readily available Musicology information available in a single place under a single search. We’re leveraging Linked Data technologies to better allow others to integrate their own data with our project outputs.

 
 

MusicNet at SDH 2010

12 Nov

Austrian Parliament Building

Last month I was very pleased to be able to present the work of the musicSpace and MusicNet research teams at the Supporting the Digital Humanities 2010 conference (Vienna, 19-20 October 2010), which was jointly organized by CLARIN and DARIAH. The musicology session was convened by PhD student Richard Lewis, and also featured presentations by Alan Marsden and Frans Wiering. Our paper explained how the motivation for MusicNet came out of our previous work on the musicSpace project.

Please download the slides from our presentation below, and take a look at the other presentations from the conference at http://www.dariah.eu/index.php?option=com_docman&Itemid=200.

 
 

MusicNet at AHM 2010

13 Oct
City Hall, Cardiff

City Hall, Cardiff

Last month at the UK e-Science All Hands Meeting 2010 at City Hall in Cardiff (13-16 September 2010), we gave our first conference paper about the MusicNet project. Thank you to everyone that came to our session and asked questions. It was informative to learn that many other delegates have encountered datasets (across a range of subjects, from geography to chess!) in which synonymous entities are not aligned; precisely the problem which the alignment tool we are building for MusicNet aims to address.

A short abstract of our paper is given below, but please also take a look at the extended abstract and our presentation slides:

Thank you to the organising committee and administrators for making the event run so smoothly.

ABSTRACT: The MusicNet Composer URI Project
Daniel Alexander Smith, David Bretherton, Joe Lambert, and mc schraefel

In any domain, a key activity of researchers is to search for and synthesize data from multiple sources in order to create new knowledge. In many cases this process is laborious, to the point of making certain questions nearly intractable because the cost of the searches outstrips the time available to complete the research. As more resources are published as Linked Data, data from multiple heterogeneous sources should be more rapidly discoverable and automatically integrable, enabling previously intractable queries to be explored, and standard queries to be significantly accelerated for more rapid knowledge discovery. But Linked Data is not of itself a complete solution. One of the key challenges of Linked Data is that its strength is also a weakness: anyone can publish anything. So in classical music, for instance, 17 sources may publish data about ‘Schubert’, but there is no de facto way to know that any of these Schuberts are the same, because the sources are not aligned. Without alignment, much of the benefit of Linked Data is diminished: resources can effectively be stranded rather than discovered, or tangled nets of only guessed at associations in a particular dataset can end up costing more than their value to untangle.

The MusicNet project, which emerged out of Southampton’s musicSpace project, is set to address the challenge just outlined by “minting” URIs for key musicology assets to provide a framework for the effective exploration of Linked Data about classical music. Unique URIs will be minted for each composer that exists in our data partners’ datasets. Basic biographical data will also be exposed, as well as name variants in different sources to allow for compatibility with legacy data. Crucially, this information will be curated by domain experts so that MusicNet will become a reliable source of data about the names of classical music composers. However, the real benefit of this work is that it will align identifiers across data sources, which is a prerequisite for the creation of Linked Data classical music and musicology resources, if such resources are to be optimally useful and usable.

The establishment of authoritative URIs for composers, and moreover the disambiguation of composers in online data sources that will flow from this, is an essential first step in the provision of Linked Data services for classical music and musicology. Our work will provide a model and tools that can usefully be employed elsewhere.