Re: ltree PostgreSQL Module

Lists: pgsql-hackers
From: "John Hansen" <john(at)geeknet(dot)com(dot)au>
To: "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su>
Cc: "Teodor Sigaev" <teodor(at)stack(dot)net>, "Pgsql Hackers" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ltree PostgreSQL Module
Date: 2004-11-11 09:56:13
Message-ID: 5066E5A966339E42AA04BA10BA706AE56205@rodrick.geeknet.com.au
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Oleg,

For example:

The DMOZ topic: Top/World/Español/Países/México/Referencia/Bibliotecas/Nacionales

select text2ltree(replace('Top/World/Español/Países/México/Referencia/Bibliotecas/Nacionales','/','.'));
ERROR: syntax error at position 14 near "Ã"

I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich

Kind regards

John

> -----Original Message-----
> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
> Sent: Thursday, November 11, 2004 8:48 PM
> To: John Hansen
> Cc: Teodor Sigaev; Pgsql Hackers
> Subject: Re: ltree PostgreSQL Module
>
> John,
>
> On Thu, 11 Nov 2004, John Hansen wrote:
>
> > Hello folks,
> >
> > It seems that the ltree module does not support the data
> for which is
> > was (aparantly) created.
> >
> > The DMOZ data is encoded in UTF8, but ISALNUM() does not support
> > unicode characters...
> > In fact, it does not support any database encoding.
> >
> > Is there any plans to change this?
>
> it's very difficult to see your problem without any examples !
> Many people are lucky users of ltree module, so you probably
> have some other problem. Any test demonstrating your probelm
> would be very useful.
> Also, posting to mailing lists would help other people help you.
>
> >
> > Kind Regards,
> >
> > John Hansen
> >
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
> Sternberg Astronomical Institute, Moscow University (Russia)
> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> phone: +007(095)939-16-83, +007(095)939-23-83
>
>


From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: John Hansen <john(at)geeknet(dot)com(dot)au>
Cc: Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ltree PostgreSQL Module
Date: 2004-11-11 13:35:16
Message-ID: Pine.GSO.4.61.0411111631560.23066@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.

---559023410-1215378052-1100180116=:23066
Content-Type: TEXT/PLAIN; charset=koi8-r; format=flowed
Content-Transfer-Encoding: 8BIT

Hmm,

I don't see the error comes from ltree module, not from 'replace' function.
Also, are you sure your postgresql setup is ok (locale issue).

On Thu, 11 Nov 2004, John Hansen wrote:

> Oleg,
>
> For example:
>
> The DMOZ topic: Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales
>
> select text2ltree(replace('Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales','/','.'));
> ERROR: syntax error at position 14 near "?"
>
> I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich
>
> Kind regards
>
> John
>
>> -----Original Message-----
>> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
>> Sent: Thursday, November 11, 2004 8:48 PM
>> To: John Hansen
>> Cc: Teodor Sigaev; Pgsql Hackers
>> Subject: Re: ltree PostgreSQL Module
>>
>> John,
>>
>> On Thu, 11 Nov 2004, John Hansen wrote:
>>
>>> Hello folks,
>>>
>>> It seems that the ltree module does not support the data
>> for which is
>>> was (aparantly) created.
>>>
>>> The DMOZ data is encoded in UTF8, but ISALNUM() does not support
>>> unicode characters...
>>> In fact, it does not support any database encoding.
>>>
>>> Is there any plans to change this?
>>
>> it's very difficult to see your problem without any examples !
>> Many people are lucky users of ltree module, so you probably
>> have some other problem. Any test demonstrating your probelm
>> would be very useful.
>> Also, posting to mailing lists would help other people help you.
>>
>>>
>>> Kind Regards,
>>>
>>> John Hansen
>>>
>>
>> Regards,
>> Oleg
>> _____________________________________________________________
>> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
>> Sternberg Astronomical Institute, Moscow University (Russia)
>> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
>> phone: +007(095)939-16-83, +007(095)939-23-83
>>
>>
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---559023410-1215378052-1100180116=:23066--


From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: John Hansen <john(at)geeknet(dot)com(dot)au>
Cc: Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ltree PostgreSQL Module
Date: 2004-11-11 14:42:01
Message-ID: Pine.GSO.4.61.0411111740550.25413@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.

---559023410-959030623-1100184121=:25413
Content-Type: TEXT/PLAIN; charset=koi8-r; format=flowed
Content-Transfer-Encoding: 8BIT

John,

I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.

Oleg
On Thu, 11 Nov 2004, John Hansen wrote:

> Oleg,
>
> For example:
>
> The DMOZ topic: Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales
>
> select text2ltree(replace('Top/World/Espa??ol/Pa??ses/M?©xico/Referencia/Bibliotecas/Nacionales','/','.'));
> ERROR: syntax error at position 14 near "?"
>
> I've also found that topics contain , as in the DMOZ Topic: Top/Arts/Music/Composition/Composers/F/Fasch,_Johann_Friedrich
>
> Kind regards
>
> John
>
>> -----Original Message-----
>> From: Oleg Bartunov [mailto:oleg(at)sai(dot)msu(dot)su]
>> Sent: Thursday, November 11, 2004 8:48 PM
>> To: John Hansen
>> Cc: Teodor Sigaev; Pgsql Hackers
>> Subject: Re: ltree PostgreSQL Module
>>
>> John,
>>
>> On Thu, 11 Nov 2004, John Hansen wrote:
>>
>>> Hello folks,
>>>
>>> It seems that the ltree module does not support the data
>> for which is
>>> was (aparantly) created.
>>>
>>> The DMOZ data is encoded in UTF8, but ISALNUM() does not support
>>> unicode characters...
>>> In fact, it does not support any database encoding.
>>>
>>> Is there any plans to change this?
>>
>> it's very difficult to see your problem without any examples !
>> Many people are lucky users of ltree module, so you probably
>> have some other problem. Any test demonstrating your probelm
>> would be very useful.
>> Also, posting to mailing lists would help other people help you.
>>
>>>
>>> Kind Regards,
>>>
>>> John Hansen
>>>
>>
>> Regards,
>> Oleg
>> _____________________________________________________________
>> Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
>> Sternberg Astronomical Institute, Moscow University (Russia)
>> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
>> phone: +007(095)939-16-83, +007(095)939-23-83
>>
>>
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---559023410-959030623-1100184121=:25413--


From: John Hansen <john(at)geeknet(dot)com(dot)au>
To: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
Cc: Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ltree PostgreSQL Module
Date: 2004-11-11 21:15:35
Message-ID: 1100207735.20649.7.camel@localhost.localdomain
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

> I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.

ok,. how about all the 'other' characters from us-ascii :
,(dot)?!(at)#$%^&*()_+-=[]{}\|'"?><`~

these 'should' all be valid for the ltxtquery, ltree, and ltree[] types,
except maybe for . which is used as seperator (and maybe . should be
valid too, if prepended with a '\', just as you would with a regex to
make the next character a literal).

... John


From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: John Hansen <john(at)geeknet(dot)com(dot)au>
Cc: Teodor Sigaev <teodor(at)stack(dot)net>, Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ltree PostgreSQL Module
Date: 2004-11-12 05:34:49
Message-ID: Pine.GSO.4.61.0411120833260.25413@ra.sai.msu.su
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Fri, 12 Nov 2004, John Hansen wrote:

>> I miss about UTF-8 :) ltree doesn't supports UTF-8 yet.
>
> ok,. how about all the 'other' characters from us-ascii :
> ,(dot)?!(at)#$%^&*()_+-=[]{}\|'"?><`~
>
> these 'should' all be valid for the ltxtquery, ltree, and ltree[] types,
> except maybe for . which is used as seperator (and maybe . should be
> valid too, if prepended with a '\', just as you would with a regex to
> make the next character a literal).

I agree with you, but Unfortunately, it's not easy and we have no time
to work on ltree now.

>
> ... John
>
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83