accentuated letters in text-search

Lists: pgsql-hackers
From: Andreas Joseph Krogh <andreak(at)officenet(dot)no>
To: pgsql-hackers(at)postgresql(dot)org
Subject: accentuated letters in text-search
Date: 2010-07-21 21:23:43
Message-ID: 4C47655F.6050809@officenet.no
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Hi.

I was googling for how to create a text-seach-config with the following
properties:
- Map unicode accentuated letters to an un-accentuated equivalent
- No stop-words
- Lowercase all words

And came over this from -general:
http://www.techienuggets.com/Comments?tx=106813

Then after some more googling I found this:
http://www.sai.msu.su/~megera/wiki/unaccent

Any reason the unaccent dict. and function did not make it in 9.0?

--
Andreas Joseph Krogh<andreak(at)officenet(dot)no>
Senior Software Developer / CTO
------------------------+---------------------------------------------+
OfficeNet AS | The most difficult thing in the world is to |
Rosenholmveien 25 | know how to do a thing and to watch |
1414 Trollåsen | somebody else doing it wrong, without |
NORWAY | comment. |
| |
Tlf: +47 24 15 38 90 | |
Fax: +47 24 15 38 91 | |
Mobile: +47 909 56 963 | |
------------------------+---------------------------------------------+


From: Guillaume Lelarge <guillaume(at)lelarge(dot)info>
To: Andreas Joseph Krogh <andreak(at)officenet(dot)no>
Cc: pgsql-hackers(at)postgresql(dot)org
Subject: Re: accentuated letters in text-search
Date: 2010-07-22 05:42:03
Message-ID: 4C47DA2B.8070802@lelarge.info
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Le 21/07/2010 23:23, Andreas Joseph Krogh a écrit :
> [...]
> I was googling for how to create a text-seach-config with the following
> properties:
> - Map unicode accentuated letters to an un-accentuated equivalent
> - No stop-words
> - Lowercase all words
>
> And came over this from -general:
> http://www.techienuggets.com/Comments?tx=106813
>
> Then after some more googling I found this:
> http://www.sai.msu.su/~megera/wiki/unaccent
>
> Any reason the unaccent dict. and function did not make it in 9.0?
>

Well, AFAICT, it's available in 9.0:

http://www.postgresql.org/docs/9.0/static/unaccent.html

--
Guillaume
http://www.postgresql.fr
http://dalibo.com


From: Andreas Joseph Krogh <andreak(at)officenet(dot)no>
To: pgsql-hackers(at)postgresql(dot)org
Subject: Re: accentuated letters in text-search
Date: 2010-07-22 07:57:30
Message-ID: 4C47F9EA.8060000@officenet.no
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On 07/22/2010 07:42 AM, Guillaume Lelarge wrote:
> Le 21/07/2010 23:23, Andreas Joseph Krogh a écrit :
>
>> [...]
>> I was googling for how to create a text-seach-config with the following
>> properties:
>> - Map unicode accentuated letters to an un-accentuated equivalent
>> - No stop-words
>> - Lowercase all words
>>
>> And came over this from -general:
>> http://www.techienuggets.com/Comments?tx=106813
>>
>> Then after some more googling I found this:
>> http://www.sai.msu.su/~megera/wiki/unaccent
>>
>> Any reason the unaccent dict. and function did not make it in 9.0?
>>
>>
> Well, AFAICT, it's available in 9.0:
>
> http://www.postgresql.org/docs/9.0/static/unaccent.html
>

My contrib-foo was pretty low last night it seems, sorry for the noise...

--
Andreas Joseph Krogh<andreak(at)officenet(dot)no>
Senior Software Developer / CTO
------------------------+---------------------------------------------+
OfficeNet AS | The most difficult thing in the world is to |
Rosenholmveien 25 | know how to do a thing and to watch |
1414 Trollåsen | somebody else doing it wrong, without |
NORWAY | comment. |
| |
Tlf: +47 24 15 38 90 | |
Fax: +47 24 15 38 91 | |
Mobile: +47 909 56 963 | |
------------------------+---------------------------------------------+