Thursday, July 15, 2010

2010/07/13-15 - ACES Voyager merge pain

I've been working on the tool to decorate the dublin-core for each pdf in the ACES scanning project with metadata from our Voyager catalog. I've got code running that extracts each record's metadata from the mets files the scanning contractor delivered, and also have the list of Voyager ids, so now I need a strategy to match A with B, and glue it all together.

I'm making the best of this ACES project by taking the opportunity to update some shared code and trying some new things, but I believe that the index of the pdf's full-text makes the meta-data from the catalog superfluous for discovery - especially after Google and Bing scan our sever. The librarians refuse to believe that though!

No comments:

Post a Comment