Lists: | pgsql-hackers |
---|
From: | "John Hansen" <john(at)geeknet(dot)com(dot)au> |
---|---|
To: | "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Cc: | "Teodor Sigaev" <teodor(at)stack(dot)net>, "Pgsql Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ltree PostgreSQL Module |
Date: | 2004-11-11 09:56:13 |
Message-ID: | 5066E5A966339E42AA04BA10BA706AE56205@rodrick.geeknet.com.au |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Oleg,
For example:
The DMOZ topic: Top/World/Español/PaÃses/México/Referencia/Bibliotecas/Nacionales
select text2ltree(replace('Top/World/Español/PaÃses/México/Referencia/Bibliotecas/Nacionales','/','.'));
ERROR: syntax error at position 14 near "Ã"
I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich
Kind regards
John
> -----Original Message-----
> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
> Sent: Thursday, November 11, 2004 8:48 PM
> To: John Hansen
> Cc: Teodor Sigaev; Pgsql Hackers
> Subject: Re: ltree PostgreSQL Module
>
> John,
>
> On Thu, 11 Nov 2004, John Hansen wrote:
>
> > Hello folks,
> >
> > It seems that the ltree module does not support the data
> for which is
> > was (aparantly) created.
> >
> > The DMOZ data is encoded in UTF8, but ISALNUM() does not support
> > unicode characters...
> > In fact, it does not support any database encoding.
> >
> > Is there any plans to change this?
>
> it's very difficult to see your problem without any examples !
> Many people are lucky users of ltree module, so you probably
> have some other problem. Any test demonstrating your probelm
> would be very useful.
> Also, posting to mailing lists would help other people help you.
>
> >
> > Kind Regards,
> >
> > John Hansen
> >
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
> Sternberg Astronomical Institute, Moscow University (Russia)
> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> phone: +007(095)939-16-83, +007(095)939-23-83
>
>
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | John Hansen <john(at)geeknet(dot)com(dot)au> |
Cc: | Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ltree PostgreSQL Module |
Date: | 2004-11-11 13:35:16 |
Message-ID: | Pine.GSO.4.61.0411111631560.23066@ra.sai.msu.su |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.
---559023410-1215378052-1100180116=:23066
Content-Type: TEXT/PLAIN; charset=koi8-r; format=flowed
Content-Transfer-Encoding: 8BIT
Hmm,
I don't see the error comes from ltree module, not from 'replace' function.
Also, are you sure your postgresql setup is ok (locale issue).
On Thu, 11 Nov 2004, John Hansen wrote:
> Oleg,
>
> For example:
>
> The DMOZ topic: Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales
>
> select text2ltree(replace('Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales','/','.'));
> ERROR: syntax error at position 14 near "?"
>
> I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich
>
> Kind regards
>
> John
>
>> -----Original Message-----
>> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
>> Sent: Thursday, November 11, 2004 8:48 PM
>> To: John Hansen
>> Cc: Teodor Sigaev; Pgsql Hackers
>> Subject: Re: ltree PostgreSQL Module
>>
>> John,
>>
>> On Thu, 11 Nov 2004, John Hansen wrote:
>>
>>> Hello folks,
>>>
>>> It seems that the ltree module does not support the data
>> for which is
>>> was (aparantly) created.
>>>
>>> The DMOZ data is encoded in UTF8, but ISALNUM() does not support
>>> unicode characters...
>>> In fact, it does not support any database encoding.
>>>
>>> Is there any plans to change this?
>>
>> it's very difficult to see your problem without any examples !
>> Many people are lucky users of ltree module, so you probably
>> have some other problem. Any test demonstrating your probelm
>> would be very useful.
>> Also, posting to mailing lists would help other people help you.
>>
>>>
>>> Kind Regards,
>>>
>>> John Hansen
>>>
>>
>> Regards,
>> Oleg
>> _____________________________________________________________
>> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
>> Sternberg Astronomical Institute, Moscow University (Russia)
>> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
>> phone: +007(095)939-16-83, +007(095)939-23-83
>>
>>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---559023410-1215378052-1100180116=:23066--
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | John Hansen <john(at)geeknet(dot)com(dot)au> |
Cc: | Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ltree PostgreSQL Module |
Date: | 2004-11-11 14:42:01 |
Message-ID: | Pine.GSO.4.61.0411111740550.25413@ra.sai.msu.su |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.
---559023410-959030623-1100184121=:25413
Content-Type: TEXT/PLAIN; charset=koi8-r; format=flowed
Content-Transfer-Encoding: 8BIT
John,
I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.
Oleg
On Thu, 11 Nov 2004, John Hansen wrote:
> Oleg,
>
> For example:
>
> The DMOZ topic: Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales
>
> select text2ltree(replace('Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales','/','.'));
> ERROR: syntax error at position 14 near "?"
>
> I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich
>
> Kind regards
>
> John
>
>> -----Original Message-----
>> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
>> Sent: Thursday, November 11, 2004 8:48 PM
>> To: John Hansen
>> Cc: Teodor Sigaev; Pgsql Hackers
>> Subject: Re: ltree PostgreSQL Module
>>
>> John,
>>
>> On Thu, 11 Nov 2004, John Hansen wrote:
>>
>>> Hello folks,
>>>
>>> It seems that the ltree module does not support the data
>> for which is
>>> was (aparantly) created.
>>>
>>> The DMOZ data is encoded in UTF8, but ISALNUM() does not support
>>> unicode characters...
>>> In fact, it does not support any database encoding.
>>>
>>> Is there any plans to change this?
>>
>> it's very difficult to see your problem without any examples !
>> Many people are lucky users of ltree module, so you probably
>> have some other problem. Any test demonstrating your probelm
>> would be very useful.
>> Also, posting to mailing lists would help other people help you.
>>
>>>
>>> Kind Regards,
>>>
>>> John Hansen
>>>
>>
>> Regards,
>> Oleg
>> _____________________________________________________________
>> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
>> Sternberg Astronomical Institute, Moscow University (Russia)
>> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
>> phone: +007(095)939-16-83, +007(095)939-23-83
>>
>>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---559023410-959030623-1100184121=:25413--
From: | John Hansen <john(at)geeknet(dot)com(dot)au> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ltree PostgreSQL Module |
Date: | 2004-11-11 21:15:35 |
Message-ID: | 1100207735.20649.7.camel@localhost.localdomain |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
> I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.
ok,. how about all the 'other' characters from us-ascii :
,(dot)?!(at)#$%^&*()_+-=[]{}\|'"?><`~
these 'should' all be valid for the ltxtquery, ltree, and ltree[] types,
except maybe for . which is used as seperator (and maybe . should be
valid too, if prepended with a '\', just as you would with a regex to
make the next character a literal).
... John
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | John Hansen <john(at)geeknet(dot)com(dot)au> |
Cc: | Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ltree PostgreSQL Module |
Date: | 2004-11-12 05:34:49 |
Message-ID: | Pine.GSO.4.61.0411120833260.25413@ra.sai.msu.su |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Fri, 12 Nov 2004, John Hansen wrote:
>> I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.
>
> ok,. how about all the 'other' characters from us-ascii :
> ,(dot)?!(at)#$%^&*()_+-=[]{}\|'"?><`~
>
> these 'should' all be valid for the ltxtquery, ltree, and ltree[] types,
> except maybe for . which is used as seperator (and maybe . should be
> valid too, if prepended with a '\', just as you would with a regex to
> make the next character a literal).
I agree with you, but Unfortunately, it's not easy and we have no time
to work on ltree now.
>
> ... John
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83