Re: ispell file format

From: Teodor Sigaev <teodor(at)sigaev(dot)ru>
To: Heikki Linnakangas <heikki(at)enterprisedb(dot)com>
Cc: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ispell file format
Date: 2007-08-23 13:08:54
Message-ID: 46CD86E6.9070809@sigaev.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

> Here's a snippet of the affix file around that line:
>
>> prefixes
>>
>> flag *A:
>> . > ALI # alivaltiosihteeri, alihankkija # line 83
>> I > ALI\-
Just remove the rules with \-, tsearch allows only alpha character here.

>
> ispell works just fine with it.
>
> I found a man page describing the ispell file format with Google:
> http://www.delorie.com/gnu/docs/ispell/ispell.4.html. Is this the same
> file format tsearch accepts? It looks like the grammar we accept is only
> a small subset of the ispell grammar, there's things statements like
> "boundarychars", "stringchar" that we apparently don't support.

Yes, that options are useless for dictionary:
- string char is already checked by postgres itself (by recode or verify functions)
- parser already splits words and default parser treat '-' as word separator

Hmm, I found another problem here. After removing that rules every works fine
with fi_FI.ISO8859-1 locale but with fi_FI.UTF-8, I'll dig tomorrow into it.

--
Teodor Sigaev E-mail: teodor(at)sigaev(dot)ru
WWW: http://www.sigaev.ru/

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Albe Laurenz 2007-08-23 13:46:07 Undetected corruption of table files
Previous Message Peter Eisentraut 2007-08-23 12:11:31 Owner's grant options not visible