OCLC Research Will Harvest DSpace Metadata

DUBLIN, Ohio, USA, April 9, 2004—OCLC Research will periodically harvest OAI-compliant metadata from the institutional repositories of interested DSpace users. OCLC Research will convert the harvested metadata into a format suitable for re-harvesting by non-OAI services.

Much of the scholarly material on the Web is missed by harvesters. This includes metadata in OAI-PMH repositories, which DSpace uses. Google has several problems harvesting OAI repositories, which are different from standard Web pages.

The standard DSpace uses the Handle system for identifying items, which (purposely) mask the identity of the host, making harvesting difficult to schedule. The OAI protocol uses possibly non-persistent URLs to link pages of metadata. This also interferes with standard methods of harvesting.

OCLC Research is working with Google and MIT to periodically harvest interested DSpace users' metadata and transform it into a harvest-friendly format, resolve the handles so that institutions can be identified, and make the resulting URLs harvestable by search services such as Google.

More information:

Posted by Ray Matthews on April 12, 2004 at 04:12 PM | Permalink | Comments (0) | Send this story to a friend!