[OPEN-ILS-DEV] Yet another function for uescaping UTF-8

Dan Scott denials at gmail.com
Sun Nov 30 00:13:05 EST 2008


2008/11/28 Scott McKellar <mck9 at swbell.net>:
> --- On Fri, 11/28/08, Dan Scott <dan at coffeecode.net> wrote:
>
> <snip -- about some new code for generating surrogate pairs>
>
>> I started generating some examples for you using Python;
>> maybe the
>> attached script will be helpful to you in generating other
>> ranges, but
>> here's a snippet of what the script generates for the
>> Ancient Greek
>> Numbers range
>> (http://www.utf8-chartable.de/unicode-utf8-table.pl gives
>> lots of alternate representations):
>
> <snip -- 95 translations from code points to surrogate pairs>
>
> My code yields the same surrogate pairs as are in the list you provided.
> I may play around with your Python script looking for corner cases, but
> so far as I can tell we're good to go.

I would agree. I successfully ran Evergreen on top of OpenSRF with
your patch and suffered no ill effects with the sample data I had at
hand (nothing fancy - just a handful of French records).

I think this is ready for integration. Thanks Scott!

-- 
Dan Scott
Laurentian University


More information about the Open-ils-dev mailing list