Thursday, 14 June 2012

Databib - An Online, Community-driven, Annotated Bibliography & Registry of Research Data Repository



A partnership between Purdue and Penn State Universities proposes the creation of a new resource called Databib that will provide a “spark” to help engage librarians in data services by providing them with an online, community-driven, annotated bibliography and registry of research data repositories. In addition to being an important reference resource to librarians, data users, data producers, publishers, and research funding agencies, the Databib platform will challenge the concept of the traditional bibliography by serving and integrating bibliographic content in Web 1.0, Web 2.0, and Web 3.0 environments in order to help overcome some shortcomings or perceptions of traditional bibliographies. One technology in particular, Linked Data, shows a great deal of promise for delivering a “web of data” (i.e., the Semantic Web) and giving librarians a new toolkit for describing and classifying data in a relational manner.
An expanded role for research libraries in digital data stewardship was forecasted by an Association of Research Libraries (ARL) workshop report to the NSF in 2006 [1]. This forecast was substantiated in August 2010 by a survey of 57 ARL libraries, of which 21 libraries reported that they currently provide infrastructure or support services for e-Science, and an additional 23 libraries are in planning stages [2]. A number of academic and research libraries are beginning to take a more active role in data management on their campuses, applying library science principles to help address the data deluge. This includes a wide range of activities such as helping researchers formulate funder-required data plans, adapting library practice to help organize and describe research datasets, developing data collections and data repositories, digital preservation, and data literacy.
Librarians are in a good position to provide these services; unfortunately, there is currently no framework in place to support the organization and discovery of data repositories. Many funding agencies are requiring their sponsored researchers to submit their data to repositories without giving further instructions to them. What repositories are appropriate for a researcher to submit his or her data to? How do potential users find appropriate data repositories and discover datasets that meet their needs? How can librarians help patrons who are looking for data find and integrate it into the patrons’ research, learning, or teaching? Databib will begin to address these needs for librarians, data users, data producers, publishers, and funding agencies.
The deliverables of this nine-month project will be 1) a functional and useful Databib platform as described in the project design; 2) the original description and annotation of primary repositories of research data represented by records in Databib; 3) a rubric for evaluating new repositories for inclusion in Databib; 4) documentation and supporting activities to catalyze a community of bibliographers; and 5) a white paper written for IMLS that describes the project design and provides an analysis of the project's results in terms of meeting these outcomes. A small panel of advisers will provide guidance throughout the project as well as to periodically review progress and give input to maximize the effectiveness of the project. The project’s design and evaluation plan establish measurable goals and outcomes for software development, the creation of new Databib records by both project personnel and community bibliographers, the number of integrations accomplished, usage statistics, and user feedback. All source code, content, and data will be sharing and dedicated to the public domain using Creative Commons Zero 1.0.
Databib is supported in part by a Sparks! Ignition National Leadership Grant (LG-46-11-0091-11) from the Institute of Museum and Library Services.

No comments:

Post a Comment