[OPEN-ILS-DEV] Deduping, 856's and located URIs

Justin Hopkins justin at mobiusconsortium.org
Thu Dec 6 16:12:29 EST 2012


Has anyone developed a method to dedupe while preserving/merging 856's?

We're looking at doing a dedupe process using the sclends dedupe sql 
script that's available in the migration-tools, but we're concerned 
about a couple of things having to do with electronic resources.

We have been using located uri's for a while now to show bibs for 
e-audiobooks, and other online resources in the OPAC despite their lack 
of volumes/items. This has worked well for the most part but now that we 
are waiting to dedupe we've found that the sclends sql has problems 
handling this.

Finding duplicates that aren't: The 245 is used by the script to 
identify potential duplicates, but it doesn't look at the $h which 
according to good cataloging standards would be a reasonable place to 
put something like [Overdrive downloadable e-audiobook]. The result is 
that bibs with physical items attached can chosen as subordinate bibs 
with the bib from the electronic resource being picked as the lead.

Handling of multiple and different 856's: The sclends script doesn't 
handle 856's at all as far as I can tell in my tests. Ideally it would 
create multiple 856's - adding the 856 from the subordinate bib onto the 
lead bib, or even better, identify duplicate 856's based on identical 
uri's and then just append any $9's from the sub to the lead.

There are other things that come to mind but this is what's really 
holding us up. We're starting to get a whole lot of duplicates in 
Missouri Evergreen so we need to dedupe but since we do have quite a few 
electronic resources with $9's we can't afford to clobber them.

I'd appreciate any suggestions.

-- 
Justin Hopkins
Manager Information Technology
573-808-2309



More information about the Open-ils-dev mailing list