Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.

Lists: pgsql-committerspgsql-hackers
From: mha(at)postgresql(dot)org (Magnus Hagander)
To: pgsql-committers(at)postgresql(dot)org
Subject: pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 11:30:16
Message-ID: 20071016113016.638E9754229@cvs.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Log Message:
-----------
Re-allow UTF8 encodings on win32. Since UTF8 is converted to
UTF16 before being used, all (valid) locales will work for this.

Modified Files:
--------------
pgsql/src/backend/commands:
dbcommands.c (r1.201 -> r1.202)
(http://developer.postgresql.org/cvsweb.cgi/pgsql/src/backend/commands/dbcommands.c?r1=1.201&r2=1.202)
pgsql/src/bin/initdb:
initdb.c (r1.146 -> r1.147)
(http://developer.postgresql.org/cvsweb.cgi/pgsql/src/bin/initdb/initdb.c?r1=1.146&r2=1.147)


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Magnus Hagander <magnus(at)hagander(dot)net>, Dave Page <dpage(at)postgresql(dot)org>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 17:23:37
Message-ID: 20614.1192555417@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

mha(at)postgresql(dot)org (Magnus Hagander) writes:
> Re-allow UTF8 encodings on win32. Since UTF8 is converted to
> UTF16 before being used, all (valid) locales will work for this.

So where do we stand on the Windows locale/encoding business --- are
we happy with the behavior now, or does it still need work?

regards, tom lane


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Dave Page <dpage(at)postgresql(dot)org>, pgsql-hackers(at)postgresql(dot)org, Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 17:29:20
Message-ID: 4714F4F0.5000505@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Tom Lane wrote:
> mha(at)postgresql(dot)org (Magnus Hagander) writes:
>> Re-allow UTF8 encodings on win32. Since UTF8 is converted to
>> UTF16 before being used, all (valid) locales will work for this.
>
> So where do we stand on the Windows locale/encoding business --- are
> we happy with the behavior now, or does it still need work?

I think we're good. But I'd like to hear some verification from somebody
else. Specifically, I'd like to hear a signoff from someone who can
actually do "real tests" on a locale that's not US and not Swedish.
Also, I'd like to hear from the Japanese people (Hiroshi? Can you do
this?) that we didn't break it for them. I don't think we did, but I
want to be sure :)

Hiroshi, and whomever else can help to test, this is only testing the
backend, not the installer. The installer may need a few minor tweaks
still once the backend is considered fixed. And what needs to be tested
is CVS HEAD as of today.

//Magnus


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 18:11:30
Message-ID: 08ad01c8101f$fa4733a0$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

Um, It seems that it only passed the strict check of chklocale.c. Probably, It may
enable mistaken selection...However, I will clarify a problem by the test.

Regards,
Hiroshi Saito

From: "Magnus Hagander" <magnus(at)hagander(dot)net>

> Tom Lane wrote:
>> mha(at)postgresql(dot)org (Magnus Hagander) writes:
>>> Re-allow UTF8 encodings on win32. Since UTF8 is converted to
>>> UTF16 before being used, all (valid) locales will work for this.
>>
>> So where do we stand on the Windows locale/encoding business --- are
>> we happy with the behavior now, or does it still need work?
>
> I think we're good. But I'd like to hear some verification from somebody
> else. Specifically, I'd like to hear a signoff from someone who can
> actually do "real tests" on a locale that's not US and not Swedish.
> Also, I'd like to hear from the Japanese people (Hiroshi? Can you do
> this?) that we didn't break it for them. I don't think we did, but I
> want to be sure :)
>
> Hiroshi, and whomever else can help to test, this is only testing the
> backend, not the installer. The installer may need a few minor tweaks
> still once the backend is considered fixed. And what needs to be tested
> is CVS HEAD as of today.
>
> //Magnus


From: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
To: "Magnus Hagander" <magnus(at)hagander(dot)net>
Cc: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Dave Page" <dpage(at)postgresql(dot)org>, pgsql-hackers(at)postgresql(dot)org, "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 18:17:25
Message-ID: 162867790710161117u7cd63bcci917bda69b098dbbb@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

2007/10/16, Magnus Hagander <magnus(at)hagander(dot)net>:
> Tom Lane wrote:
> > mha(at)postgresql(dot)org (Magnus Hagander) writes:
> >> Re-allow UTF8 encodings on win32. Since UTF8 is converted to
> >> UTF16 before being used, all (valid) locales will work for this.
> >
> > So where do we stand on the Windows locale/encoding business --- are
> > we happy with the behavior now, or does it still need work?
>
> I think we're good. But I'd like to hear some verification from somebody
> else. Specifically, I'd like to hear a signoff from someone who can
> actually do "real tests" on a locale that's not US and not Swedish.
> Also, I'd like to hear from the Japanese people (Hiroshi? Can you do
> this?) that we didn't break it for them. I don't think we did, but I
> want to be sure :)
>
> Hiroshi, and whomever else can help to test, this is only testing the
> backend, not the installer. The installer may need a few minor tweaks
> still once the backend is considered fixed. And what needs to be tested
> is CVS HEAD as of today.
>
> //Magnus
>
> ---------------------------(end of broadcast)---------------------------
> TIP 9: In versions below 8.0, the planner will ignore your desire to
> choose an index scan if your joining column's datatypes do not
> match
>

I can test it with czech locale. Can I download binaries anywhere?

Pavel


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>, "Magnus Hagander" <magnus(at)hagander(dot)net>
Cc: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 18:37:46
Message-ID: 08c401c81023$a5596c60$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

> I can test it with czech locale. Can I download binaries anywhere?
http://winpg.jp/~saito/pg83/postgresql-8.3beta-cvs.tgz
It is a thing after regression test.(MinGW+gcc)

Regards,
Hiroshi Saito


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 18:39:48
Message-ID: 08c901c81023$ee8d0c20$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

> Um, It seems that it only passed the strict check of chklocale.c. Probably, It may
> enable mistaken selection...However, I will clarify a problem by the test.

First, it is one problem....
http://winpg.jp/~saito/pg83/pg83b1-err.txt

And a test continues....


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 18:54:25
Message-ID: 091d01c81025$f8f2df80$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

Second, it is big problem....
http://winpg.jp/~saito/pg83/pg83b1-err2.txt
It is text serch config error.
However, It passes initdb.(locale=Japanese_Japan.932 ... This is ShiftJIS locale)

And a test continues....

Regards,
Hiroshi Saito


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Dave Page <dpage(at)postgresql(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:01:42
Message-ID: 47150A96.9090705@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
>> Um, It seems that it only passed the strict check of chklocale.c.
>> Probably, It may enable mistaken selection...However, I will clarify a
>> problem by the test.
>
> First, it is one problem....
> http://winpg.jp/~saito/pg83/pg83b1-err.txt
>
> And a test continues....

But SJIS isn't supposed to work, no?

//Magnus


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Dave Page <dpage(at)postgresql(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:02:11
Message-ID: 47150AB3.5010203@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
> Second, it is big problem....
> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
> It is text serch config error.
> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
> ShiftJIS locale)
>
> And a test continues....

What text search config would you expect?

//Magnus


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Magnus Hagander" <magnus(at)hagander(dot)net>
Cc: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:07:41
Message-ID: 092c01c81027$d3a4c8e0$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

> Hiroshi Saito wrote:
>> Hi.
>>
>> Second, it is big problem....
>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>> It is text serch config error.
>> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
>> ShiftJIS locale)
>>
>> And a test continues....
>
> What text search config would you expect?

This problem here is that locale of initdb passes Japanese_Japan.932.

Regards,
Hiroshi Saito


From: Dave Page <dpage(at)postgresql(dot)org>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Magnus Hagander <magnus(at)hagander(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:15:14
Message-ID: 47150DC2.9070909@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
> Second, it is big problem....
> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
> It is text serch config error.
> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
> ShiftJIS locale)
>
> And a test continues....

The changes that were made were only to re-enable UTF-8.

SJIS wasn't ever supported as a server encoding
(http://www.postgresql.org/docs/8.2/interactive/multibyte.html). The
fact that initdb continues if you use Japanese_Japan.932 is an
inconsistency I reported previously but has yet to be fixed.

/D


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Dave Page" <dpage(at)postgresql(dot)org>
Cc: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:48:44
Message-ID: 096701c8102d$8f470c70$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

From: "Dave Page" <dpage(at)postgresql(dot)org>

> Hiroshi Saito wrote:
>> Hi.
>>
>> Second, it is big problem....
>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>> It is text serch config error.
>> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
>> ShiftJIS locale)
>>
>> And a test continues....
>
> The changes that were made were only to re-enable UTF-8.

Yes, Please see,
http://winpg.jp/~saito/pg83/pg83b1-err2.txt
Is that initdb is successful a problem as for this?

>
> SJIS wasn't ever supported as a server encoding
> (http://www.postgresql.org/docs/8.2/interactive/multibyte.html). The
> fact that initdb continues if you use Japanese_Japan.932 is an
> inconsistency I reported previously but has yet to be fixed.

Yes, However, Encoding and locale are not equivalent.

Regards,
Hiroshi Saito


From: Dave Page <dpage(at)postgresql(dot)org>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Magnus Hagander <magnus(at)hagander(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:56:17
Message-ID: 47151761.9080400@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> From: "Dave Page" <dpage(at)postgresql(dot)org>
>
>> Hiroshi Saito wrote:
>>> Hi.
>>>
>>> Second, it is big problem....
>>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>>> It is text serch config error.
>>> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
>>> ShiftJIS locale)
>>>
>>> And a test continues....
>>
>> The changes that were made were only to re-enable UTF-8.
>
> Yes, Please see,
> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
> Is that initdb is successful a problem as for this?

Oh, sorry - misread that. I chatted with Magnus about that. It is
correct, but misleading. pg_control will say Japanese_Japan.932 as well
iirc, even though it is really Japanese_Japan.65001.

Regards, Dave


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Dave Page <dpage(at)postgresql(dot)org>
Cc: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:56:29
Message-ID: 4715176D.3000603@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Dave Page wrote:
> Hiroshi Saito wrote:
>> Hi.
>>
>> Second, it is big problem....
>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>> It is text serch config error.
>> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
>> ShiftJIS locale)
>>
>> And a test continues....
>
> The changes that were made were only to re-enable UTF-8.
>
> SJIS wasn't ever supported as a server encoding
> (http://www.postgresql.org/docs/8.2/interactive/multibyte.html). The
> fact that initdb continues if you use Japanese_Japan.932 is an
> inconsistency I reported previously but has yet to be fixed.

That is a good point, if unrelated to this very discussion. Do we want
to change that thing to an exit instead of complain-and-continue? I
think yes?

//Magnus


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Dave Page <dpage(at)postgresql(dot)org>
Cc: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 19:57:13
Message-ID: 47151799.9030004@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Dave Page wrote:
> Hiroshi Saito wrote:
>> From: "Dave Page" <dpage(at)postgresql(dot)org>
>>
>>> Hiroshi Saito wrote:
>>>> Hi.
>>>>
>>>> Second, it is big problem....
>>>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>>>> It is text serch config error.
>>>> However, It passes initdb.(locale=Japanese_Japan.932 ... This is
>>>> ShiftJIS locale)
>>>>
>>>> And a test continues....
>>> The changes that were made were only to re-enable UTF-8.
>> Yes, Please see,
>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>> Is that initdb is successful a problem as for this?
>
> Oh, sorry - misread that. I chatted with Magnus about that. It is
> correct, but misleading. pg_control will say Japanese_Japan.932 as well
> iirc, even though it is really Japanese_Japan.65001.

Not so. The locale is Japanese_Japan, really. That's the only part
that's relevant for UTF16 encodings, which is what we use to do UTF8. We
specifically *don't* try to use Japanese_Japan.65001.

//Magnus


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Dave Page" <dpage(at)postgresql(dot)org>
Cc: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:02:19
Message-ID: 098801c8102f$757972e0$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

From: "Dave Page" <dpage(at)postgresql(dot)org>
>> Yes, Please see,
>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>> Is that initdb is successful a problem as for this?
>
> Oh, sorry - misread that. I chatted with Magnus about that. It is
> correct, but misleading. pg_control will say Japanese_Japan.932 as well
> iirc, even though it is really Japanese_Japan.65001.

But, Please see.
http://winpg.jp/~saito/pg83/pg83b1-err3.txt
Japanese_Japan.65001 is error...
Japanese_Japan is true.

Regards,
Hiroshi Saito


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Dave Page <dpage(at)postgresql(dot)org>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:15:45
Message-ID: 47151BF1.2040008@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
> From: "Dave Page" <dpage(at)postgresql(dot)org>
>>> Yes, Please see,
>>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>>> Is that initdb is successful a problem as for this?
>>
>> Oh, sorry - misread that. I chatted with Magnus about that. It is
>> correct, but misleading. pg_control will say Japanese_Japan.932 as well
>> iirc, even though it is really Japanese_Japan.65001.
>
> But, Please see.
> http://winpg.jp/~saito/pg83/pg83b1-err3.txt
> Japanese_Japan.65001 is error...
> Japanese_Japan is true.

Yes, that is expected. If you explicitly ask for the .65001 locale it
will try the one that doesn't have the proper NLS files, and that
shouldn't work. If you just put in Japanese_Japan, it will use the UTF16
locale.

//Magnus


From: Dave Page <dpage(at)postgresql(dot)org>
To: Magnus Hagander <magnus(at)hagander(dot)net>
Cc: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:15:51
Message-ID: 47151BF7.7030303@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Magnus Hagander wrote:
> Not so. The locale is Japanese_Japan, really. That's the only part
> that's relevant for UTF16 encodings, which is what we use to do UTF8. We
> specifically *don't* try to use Japanese_Japan.65001.

Thats not what I mean. From a *usability* perspective, Hiroshi should
see Japanese_Japan.65001 because he's selected UTF-8 in Japanese_Japan.
He shouldn't see Japanese_Japan.932 because that definitely isn't what
he selected.

/D


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, "Dave Page" <dpage(at)postgresql(dot)org>
Cc: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:16:38
Message-ID: 09b701c81031$751ff650$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

> But, Please see.
> http://winpg.jp/~saito/pg83/pg83b1-err3.txt
> Japanese_Japan.65001 is error...
> Japanese_Japan is true.

However, The test of this state is continued.
But but but, Sorry, I face to a bed...

Regards,
Hiroshi Saito


From: Magnus Hagander <magnus(at)hagander(dot)net>
To: Dave Page <dpage(at)postgresql(dot)org>
Cc: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:17:17
Message-ID: 47151C4D.2070704@hagander.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Dave Page wrote:
> Magnus Hagander wrote:
>> Not so. The locale is Japanese_Japan, really. That's the only part
>> that's relevant for UTF16 encodings, which is what we use to do UTF8. We
>> specifically *don't* try to use Japanese_Japan.65001.
>
> Thats not what I mean. From a *usability* perspective, Hiroshi should
> see Japanese_Japan.65001 because he's selected UTF-8 in Japanese_Japan.
> He shouldn't see Japanese_Japan.932 because that definitely isn't what
> he selected.

I'l grant you that from a usbility perspective, he should see
Japanese_Japan. Not the .65001 part, though.

//Magnus


From: Dave Page <dpage(at)postgresql(dot)org>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Magnus Hagander <magnus(at)hagander(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:18:22
Message-ID: 47151C8E.5020208@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
> From: "Dave Page" <dpage(at)postgresql(dot)org>
>>> Yes, Please see,
>>> http://winpg.jp/~saito/pg83/pg83b1-err2.txt
>>> Is that initdb is successful a problem as for this?
>>
>> Oh, sorry - misread that. I chatted with Magnus about that. It is
>> correct, but misleading. pg_control will say Japanese_Japan.932 as well
>> iirc, even though it is really Japanese_Japan.65001.
>
> But, Please see.
> http://winpg.jp/~saito/pg83/pg83b1-err3.txt
> Japanese_Japan.65001 is error...
> Japanese_Japan is true.

Yes, we're faking utf-8 support using utf-16. Specifying it as you have
there bypasses the workaround and tries to use the 65001 codepage which
then fails because LC_CTYPE cannot be set to .65001 in any locale.

/D


From: Dave Page <dpage(at)postgresql(dot)org>
To: Magnus Hagander <magnus(at)hagander(dot)net>
Cc: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:20:53
Message-ID: 47151D25.60804@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Magnus Hagander wrote:
> Dave Page wrote:
>> Magnus Hagander wrote:
>>> Not so. The locale is Japanese_Japan, really. That's the only part
>>> that's relevant for UTF16 encodings, which is what we use to do UTF8. We
>>> specifically *don't* try to use Japanese_Japan.65001.
>> Thats not what I mean. From a *usability* perspective, Hiroshi should
>> see Japanese_Japan.65001 because he's selected UTF-8 in Japanese_Japan.
>> He shouldn't see Japanese_Japan.932 because that definitely isn't what
>> he selected.
>
> I'l grant you that from a usbility perspective, he should see
> Japanese_Japan. Not the .65001 part, though.

Well, that depends on whether we care that we're actually faking the
utf-8 support and/or we want to keep the message consistent with what
you'd see in other locales.

/D


From: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
To: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Dave Page" <dpage(at)postgresql(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 20:37:14
Message-ID: 162867790710161337r2c3db7bbu9cfe794d0f700efd@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

2007/10/16, Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>:
> Hi.
>
> > I can test it with czech locale. Can I download binaries anywhere?
> http://winpg.jp/~saito/pg83/postgresql-8.3beta-cvs.tgz
> It is a thing after regression test.(MinGW+gcc)
>

I have problem, there isn't libintl-2.dll

Pavel


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Magnus Hagander <magnus(at)hagander(dot)net>
Cc: Dave Page <dpage(at)postgresql(dot)org>, Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 21:07:18
Message-ID: 23597.1192568838@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Magnus Hagander <magnus(at)hagander(dot)net> writes:
> Dave Page wrote:
>> SJIS wasn't ever supported as a server encoding
>> (http://www.postgresql.org/docs/8.2/interactive/multibyte.html). The
>> fact that initdb continues if you use Japanese_Japan.932 is an
>> inconsistency I reported previously but has yet to be fixed.

> That is a good point, if unrelated to this very discussion. Do we want
> to change that thing to an exit instead of complain-and-continue? I
> think yes?

Yeah, I thought we'd agreed to that a few days ago.

regards, tom lane


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
Cc: "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, "Dave Page" <dpage(at)postgresql(dot)org>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-16 23:56:38
Message-ID: 0ac601c81050$314a9a60$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

From: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
>> > I can test it with czech locale. Can I download binaries anywhere?
>> http://winpg.jp/~saito/pg83/postgresql-8.3beta-cvs.tgz
>> It is a thing after regression test.(MinGW+gcc)
>>
>
> I have problem, there isn't libintl-2.dll

Ooops, sorry, it is full-build.
Please, this is minimum composition
http://winpg.jp/~saito/pg83/postgresql-8.3beta-cvs-minbin.tgz
Thanks.

Regards,
Hiroshi Saito


From: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
To: "Magnus Hagander" <magnus(at)hagander(dot)net>
Cc: "Dave Page" <dpage(at)postgresql(dot)org>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-17 00:29:10
Message-ID: 0af101c81054$bcb58890$c601a8c0@HP22720319231
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hi.

From: "Magnus Hagander" <magnus(at)hagander(dot)net>

>> But, Please see.
>> http://winpg.jp/~saito/pg83/pg83b1-err3.txt
>> Japanese_Japan.65001 is error...
>> Japanese_Japan is true.
>
> Yes, that is expected. If you explicitly ask for the .65001 locale it
> will try the one that doesn't have the proper NLS files, and that
> shouldn't work. If you just put in Japanese_Japan, it will use the UTF16
> locale.

Umm, As for result ...
initdb -E UTF8 --locale=Japanese_Japan -D../data
http://winpg.jp/~saito/pg83/pg83b1-err4.txt
It seems that it is only complemented.

Regards,
Hiroshi Saito


From: Dave Page <dpage(at)postgresql(dot)org>
To: Hiroshi Saito <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>
Cc: Magnus Hagander <magnus(at)hagander(dot)net>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-17 08:52:41
Message-ID: 4715CD59.5040802@postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

Hiroshi Saito wrote:
> Hi.
>
> From: "Magnus Hagander" <magnus(at)hagander(dot)net>
>
>>> But, Please see.
>>> http://winpg.jp/~saito/pg83/pg83b1-err3.txt
>>> Japanese_Japan.65001 is error...
>>> Japanese_Japan is true.
>>
>> Yes, that is expected. If you explicitly ask for the .65001 locale it
>> will try the one that doesn't have the proper NLS files, and that
>> shouldn't work. If you just put in Japanese_Japan, it will use the UTF16
>> locale.
>
> Umm, As for result ... initdb -E UTF8 --locale=Japanese_Japan -D../data
> http://winpg.jp/~saito/pg83/pg83b1-err4.txt
> It seems that it is only complemented.

Yes, that is expected, though not entirely to my tastes. The cluster
should still actually be in utf-8 however.

/D


From: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
To: "Dave Page" <dpage(at)postgresql(dot)org>
Cc: "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, "Magnus Hagander" <magnus(at)hagander(dot)net>, "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-17 14:47:17
Message-ID: 162867790710170747i1c329a93s59e4450b19de5c7d@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

I did some test, but without success,

Pavel

I have win2003 Server .. with czech locales support.

I:\PGSQL\BIN>initdb -D ../data -L i:\pgsql\share

The files belonging to this database system will be owned by user "postgres".

This user must also own the server process.

The database cluster will be initialized with locale Czech_Czech Republic.1250.

could not determine encoding for locale "Czech_Czech Republic.1250": codeset is

"CP1250"

INITDB: could not find suitable encoding for locale Czech_Czech Republic.1250

Rerun INITDB with the -E option.

Try "INITDB --help" for more information.

I:\PGSQL\BIN>

I:\PGSQL\BIN>initdb -E UTF-8 -D ../data -L i:\pgsql\share

The files belonging to this database system will be owned by user "postgres".

This user must also own the server process.

The database cluster will be initialized with locale Czech_Czech Republic.1250.

could not determine encoding for locale "Czech_Czech Republic.1250": codeset is

"CP1250"

INITDB: could not find suitable text search configuration for locale Czech_Czech

Republic.1250

The default text search configuration will be set to "simple".

fixing permissions on existing directory ../data ... ok

creating subdirectories ... ok

selecting default max_connections ... 10

selecting default shared_buffers/max_fsm_pages ... 400kB/20000

creating configuration files ... ok

creating template1 database in ../data/base/1 ... FATAL: could not select a sui

table default timezone

DETAIL: It appears that your GMT time zone uses leap seconds. PostgreSQL does n

ot support leap seconds.

child process exited with exit code 1

INITDB: removing contents of data directory "../data"

I:\PGSQL\BIN>initdb -E win1250 --locale="Czech_Czech Republic.1250" -D ../data -

L i:\pgsql\share

The files belonging to this database system will be owned by user "postgres".

This user must also own the server process.

The database cluster will be initialized with locale Czech_Czech Republic.1250.

could not determine encoding for locale "Czech_Czech Republic.1250": codeset is

"CP1250"

INITDB: could not find suitable text search configuration for locale Czech_Czech

Republic.1250

The default text search configuration will be set to "simple".

fixing permissions on existing directory ../data ... ok

creating subdirectories ... ok

selecting default max_connections ... 10

selecting default shared_buffers/max_fsm_pages ... 400kB/20000

creating configuration files ... ok

creating template1 database in ../data/base/1 ... FATAL: could not select a sui

table default timezone

DETAIL: It appears that your GMT time zone uses leap seconds. PostgreSQL does n

ot support leap seconds.

child process exited with exit code 1

INITDB: removing contents of data directory "../data"


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com>
Cc: "Dave Page" <dpage(at)postgresql(dot)org>, "Hiroshi Saito" <z-saito(at)guitar(dot)ocn(dot)ne(dot)jp>, "Magnus Hagander" <magnus(at)hagander(dot)net>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: [COMMITTERS] pgsql: Re-allow UTF8 encodings on win32.
Date: 2007-10-17 15:24:50
Message-ID: 6384.1192634690@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-committers pgsql-hackers

"Pavel Stehule" <pavel(dot)stehule(at)gmail(dot)com> writes:
> could not determine encoding for locale "Czech_Czech Republic.1250": codeset is

> "CP1250"

Hm, we seem to have missed an entry for PG_WIN1250. Fixed.

regards, tom lane