Re: A rough roadmap for internationalization fixes

From: Peter Eisentraut <peter_e(at)gmx(dot)net>
To: Tatsuo Ishii <t-ishii(at)sra(dot)co(dot)jp>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: A rough roadmap for internationalization fixes
Date: 2003-11-25 10:11:29
Message-ID: Pine.LNX.4.44.0311251103370.10715-100000@peter.localdomain
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

Tatsuo Ishii writes:

> > 3. Implement Unicode collation algorithm and character classification
> > routines that are aware of 1. Use that in place of system locale
> > routines.
>
> I don't see a relationship between Unicode and the one you are going
> to replace the system locale routines. If you are going to the
> direction for an "Unicode central" implementation, I will object.

The Unicode collation algorithm works for any character set, not only for
Unicode. It just happens to be published by the Unicode consortium. So
basically this is just a concrete alternative to making up our own out of
thin air. Also, the Unicode collation algorithm gives us the flexibility
to define customizations of collations that users frequently want, such as
ignoring or not ignoring punctuation.

Actually, what will more likely happen is that we'll define a collation as
a collection of one or more support functions, the equivalents of
strxfrm() and possibly a few more. Then it will be up to those functions
to define the collation order. The server will provide utility functions
that will facilitate implementing a collation order that follows the
Unicode collation algorithm, but you could just as well implement one
using memcmp() or whatever you like.

--
Peter Eisentraut peter_e(at)gmx(dot)net

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Peter Eisentraut 2003-11-25 10:12:20 Re: A rough roadmap for internationalization fixes
Previous Message Dennis Bjorklund 2003-11-25 09:43:54 Re: A rough roadmap for internationalization fixes