Monday 31 October 2011

Back with more data!

As announced by Jim Michalko on the OCLC research blog, we've launched another dataset!

Its yet more bib data, this time comprising over 600,000 records originating from Worldcat as RDF triples. We've also loaded most of this into our triplestore. OCLC have enhanced this data with links to the FAST and VIAF authority services.

Even better, the previous two datasets we released have also been enhanced with the same links. There are still some things that could be better, especially our vocab choices around VIAF expression, but the data is there.

This data is licensed under an ODC-by Attribution License and is one of the first to make use of OCLC's newly updated community norms (details here), their preference for licensing Worldcat data for reuse. http://www.blogger.com/img/blank.gif

This is slightly in contrast to the pain free PDDL we've managed to provide so far, but we and OCLC are interested to see what users will make of this. The attribution is handled at a dataset level and should be relatively easy to implement and maintain

Dealing with attribution stacking was a major problem we encountered with COMET. That was partly due to Marc21s' inability to manage multiple record identifiers well and necessitated complex decision making regarding record ownership. Hopefully, the clear attribution policy set out here should be much easier to handle than the 'hobo stew' we encountered in our catalogue (as Jim puts it)!

I'd like to thank various folk at OCLC (especially our lead contact Eric Childress) for their support and patience over the past few months whilst we worked through a number of technical, administrative and legal points. They were voluntary partners on COMET but have given us a lot of time and assistance.

Next up, (when I find the time), will be enhanced links to Library of Congress subject headings and the recently released Name Authority File for everything in out triplestore.

14 comments:

  1. I really like you post good blog,Thanks for your sharing.

    ดูหนังเกาหลี

    ReplyDelete
  2. Thanks Chamberlain for this post. I have read your post very carefully. You talk about more data back. I found more information about it. To know about please check it.

    ReplyDelete
  3. it was an great writing about Cambridge open meta data , previously i read some article about it but after reading this i think i found new something useful site to know more.

    ReplyDelete
  4. Technology makes our life so more easy and enjoyable. We can use now big data which is very important for us. I Hope every one are like to use this. visit the site if want to know more about the writing of your academic papers, research papers and thesis editing.

    ReplyDelete
  5. My suggestion is to invite many people as you can to read that article,
    because it is helpful for others thanks from useful link

    ReplyDelete
  6. Project releted information is always effective for me. After reading out this post I inspired about my new project. In this JISC Comet progect I got some new ideas as well. Thanks. visit website and it'll help you to know more about the academic papers writing.

    ReplyDelete
  7. It is really good to know that you come back with more information to fed us. SOmetimes you need more information to complete the whole things. https://www.literaturereviewwritingservice.com/buy-literature-review-online/ You can find more info here

    ReplyDelete
  8. Dealing with data is complicated thing but still it is possible of we take it as a big challenge. You must need to know more about data and it's types. So below given site can help you a lot in this content.
    http://www.essayrevision.net/creative-essay-title-generator/

    ReplyDelete
  9. The content of your blog is exactly what I need, I like your blog
    I sincerely hope your blog has a rapid increase in traffic density.
    That helps promote your blog and we hope your blog is being updated.
    wordpress
    ufa88kh.blogspot
    youtube
    មាន់ជល់





    ReplyDelete