ToDo: preload for fulltext dictionary

Lists: pgsql-hackers
From: Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>
To: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: ToDo: preload for fulltext dictionary
Date: 2010-02-15 14:37:20
Message-ID: 162867791002150637y6cc73981ycfd54b4ec3e75d10@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Hello,

Czech users reports a slow first fulltext queries. It is based on
using ispell dictionary.

The dictionary data could be shared or minimally dictionary could be
preloaded like some PL language.

What do you think about this?

Regards
Pavel Stehule


From: Takahiro Itagaki <itagaki(dot)takahiro(at)oss(dot)ntt(dot)co(dot)jp>
To: Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>
Cc: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ToDo: preload for fulltext dictionary
Date: 2010-02-16 00:53:13
Message-ID: 20100216095310.9D7D.52131E4D@oss.ntt.co.jp
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers


Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com> wrote:

> The dictionary data could be shared or minimally dictionary could be
> preloaded like some PL language.
>
> What do you think about this?

Surely preloading is the most realistic approach, but I hope we would support
dynamic allocation of shared memory, and load dictionaries in the area and
share it with backends. We should avoid additonal calls of shmget() or mmap()
in the additional shared memory allocation, but we can shrink shared buffers
and reuse the area for general purposes. We often have serveral GB of shared
buffers nowadays, so dividing some MB of buffers will not be problem.

Regards,
---
Takahiro Itagaki
NTT Open Source Software Center


From: Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com>
To: Takahiro Itagaki <itagaki(dot)takahiro(at)oss(dot)ntt(dot)co(dot)jp>
Cc: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ToDo: preload for fulltext dictionary
Date: 2010-02-16 08:43:05
Message-ID: 162867791002160043u46213690v7fba2c78df40b563@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

2010/2/16 Takahiro Itagaki <itagaki(dot)takahiro(at)oss(dot)ntt(dot)co(dot)jp>:
>
> Pavel Stehule <pavel(dot)stehule(at)gmail(dot)com> wrote:
>
>> The dictionary data could be shared or minimally dictionary could be
>> preloaded like some PL language.
>>
>> What do you think about this?
>
> Surely preloading is the most realistic approach, but I hope we would support
> dynamic allocation of shared memory, and load dictionaries in the area and
> share it with backends. We should avoid additonal calls of shmget() or mmap()
> in the additional shared memory allocation, but we can shrink shared buffers
> and reuse the area for general purposes. We often have serveral GB of shared
> buffers nowadays, so dividing some MB of buffers will not be problem.
>

I am for all solution, but using a shared memory isn't realistic now,
and can be a problem on MS Windows. More - it can be difficult with
some special custom non ro dictionaries.

Regards
Pavel

> Regards,
> ---
> Takahiro Itagaki
> NTT Open Source Software Center
>
>
>