Lists: | pgsql-hackers |
---|
From: | Andreas Joseph Krogh <andreak(at)officenet(dot)no> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Subject: | accentuated letters in text-search |
Date: | 2010-07-21 21:23:43 |
Message-ID: | 4C47655F.6050809@officenet.no |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Hi.
I was googling for how to create a text-seach-config with the following
properties:
- Map unicode accentuated letters to an un-accentuated equivalent
- No stop-words
- Lowercase all words
And came over this from -general:
http://www.techienuggets.com/Comments?tx=106813
Then after some more googling I found this:
http://www.sai.msu.su/~megera/wiki/unaccent
Any reason the unaccent dict. and function did not make it in 9.0?
--
Andreas Joseph Krogh<andreak(at)officenet(dot)no>
Senior Software Developer / CTO
------------------------+---------------------------------------------+
OfficeNet AS | The most difficult thing in the world is to |
Rosenholmveien 25 | know how to do a thing and to watch |
1414 Trollåsen | somebody else doing it wrong, without |
NORWAY | comment. |
| |
Tlf: +47 24 15 38 90 | |
Fax: +47 24 15 38 91 | |
Mobile: +47 909 56 963 | |
------------------------+---------------------------------------------+
From: | Guillaume Lelarge <guillaume(at)lelarge(dot)info> |
---|---|
To: | Andreas Joseph Krogh <andreak(at)officenet(dot)no> |
Cc: | pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: accentuated letters in text-search |
Date: | 2010-07-22 05:42:03 |
Message-ID: | 4C47DA2B.8070802@lelarge.info |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Le 21/07/2010 23:23, Andreas Joseph Krogh a écrit :
> [...]
> I was googling for how to create a text-seach-config with the following
> properties:
> - Map unicode accentuated letters to an un-accentuated equivalent
> - No stop-words
> - Lowercase all words
>
> And came over this from -general:
> http://www.techienuggets.com/Comments?tx=106813
>
> Then after some more googling I found this:
> http://www.sai.msu.su/~megera/wiki/unaccent
>
> Any reason the unaccent dict. and function did not make it in 9.0?
>
Well, AFAICT, it's available in 9.0:
http://www.postgresql.org/docs/9.0/static/unaccent.html
--
Guillaume
http://www.postgresql.fr
http://dalibo.com
From: | Andreas Joseph Krogh <andreak(at)officenet(dot)no> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: accentuated letters in text-search |
Date: | 2010-07-22 07:57:30 |
Message-ID: | 4C47F9EA.8060000@officenet.no |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On 07/22/2010 07:42 AM, Guillaume Lelarge wrote:
> Le 21/07/2010 23:23, Andreas Joseph Krogh a écrit :
>
>> [...]
>> I was googling for how to create a text-seach-config with the following
>> properties:
>> - Map unicode accentuated letters to an un-accentuated equivalent
>> - No stop-words
>> - Lowercase all words
>>
>> And came over this from -general:
>> http://www.techienuggets.com/Comments?tx=106813
>>
>> Then after some more googling I found this:
>> http://www.sai.msu.su/~megera/wiki/unaccent
>>
>> Any reason the unaccent dict. and function did not make it in 9.0?
>>
>>
> Well, AFAICT, it's available in 9.0:
>
> http://www.postgresql.org/docs/9.0/static/unaccent.html
>
My contrib-foo was pretty low last night it seems, sorry for the noise...
--
Andreas Joseph Krogh<andreak(at)officenet(dot)no>
Senior Software Developer / CTO
------------------------+---------------------------------------------+
OfficeNet AS | The most difficult thing in the world is to |
Rosenholmveien 25 | know how to do a thing and to watch |
1414 Trollåsen | somebody else doing it wrong, without |
NORWAY | comment. |
| |
Tlf: +47 24 15 38 90 | |
Fax: +47 24 15 38 91 | |
Mobile: +47 909 56 963 | |
------------------------+---------------------------------------------+