[OPEN-ILS-DEV] Book covers and encoding issues

Linda Jansova skolkova at chello.cz
Thu Aug 4 15:27:37 EDT 2016


Hi all,

Back in March 2016 we were awarded a grant to create a Perl module for 
Czech book covers (and related added content like tables of contents, 
summaries or book ratings) from obalkyknih.cz.

We have already created a working version of the module and implemented 
it on our test server (currently running Evergreen 2.10.2) at 
http://eg-test.osvobozena-knihovna.cz/.

A sample catalog record with added content from obalkyknih.cz can be 
found here:

http://www.jabok.cuni.cz/eg/opac/record/13907?locg=102

However, we are struggling to *get the encoding right*. It is a case 
with*all text-based added content* such as summaries or tables of 
contents in text format. Our programmer has made sure that the content 
received from obalkyknih.cz (the provider) is okay.

We have also *tested Open Library added content in Evergreen* which is a 
default added content provider in Evergreen (we haven't tested the other 
modules like Content Cafe as these are subscription-based).

To make sure we have a book that is both in our system and in Open 
Library, we have added an appropriate table of contents to this book:

https://openlibrary.org/works/OL577950W/U%CC%81stavni%CC%81_pe%CC%81c%CC%8Ce

When we check how it looks like in our production system (Evergreen 
2.8.3), we *get corrupted letters* as you can see at:

http://www.jabok.cuni.cz/eg/opac/record/13907?locg=102 (please click on 
Additional Content in the lower part of the page)

We have also *tested a 2.10.5 instance with Open Library data *and the 
problem has remained the same**(due to the lack of free public IP 
addresses this Evergreen I am unable to send you a working link).

Therefore we assume *the problem might not be in our module (as Open 
Library module shares the same issue) but maybe in AddedContent.pm*? Or 
somewhere else?

I am not a programmer myself but I have had a look at Evergreen Git and 
have come across the following possible hints (I may be thoroughly 
mistaken, though):

  * AddedContent.pm
    (http://git.evergreen-ils.org/?p=Evergreen.git;a=blob;f=Open-ILS/src/perlmods/lib/OpenILS/WWW/AddedContent.pm;h=254d790cf630242e6a07bc57286e56ec1a7afb34;hb=HEAD)
    mentions Base64 encoding (see line 340).
  * Table of contents is mentioned in the files at
    http://git.evergreen-ils.org/?p=Evergreen.git;a=blob;f=Open-ILS/xsl/MARC21slim2MODS32.xsl
    and at
    http://git.evergreen-ils.org/?p=Evergreen.git;a=blob;f=Open-ILS/xsl/MARC21slim2MODS33.xsl.
    (We already experienced a problem with these templates but at least
    for Zotero it has been - thanks to Dan Scott - resolved:
    https://bugs.launchpad.net/evergreen/+bug/1442276.)
  * Feed.pm module
    (http://git.evergreen-ils.org/?p=Evergreen.git;a=blob;f=Open-ILS/src/perlmods/lib/OpenILS/WWW/SuperCat/Feed.pm,
    line 796) and some .sql files seem to use the output_string which is
    probably deprecated as Dan pointed out in connection with Zotero
    encoding issues. The list of files which mention output_string can
    be found here:
    http://git.evergreen-ils.org/?p=Evergreen.git&a=search&h=HEAD&st=grep&s=output_string.

(BTW - of course once our module is polished and ready to share - which 
will happen rather soon -, we shall work with you so that it is included 
in Evergreen.)

Thank you in advance for your help in our programming endeavor!

Linda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://libmail.georgialibraries.org/pipermail/open-ils-dev/attachments/20160804/9b75a7a4/attachment.html>


More information about the Open-ils-dev mailing list