Lists: | pgsql-hackers |
---|
From: | "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com> |
---|---|
To: | "PGSQL Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Problem with site doc search |
Date: | 2008-02-02 13:18:39 |
Message-ID: | 65937bea0802020518x36c2e27bs727ba55c5bd227ed@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Hi All,
I just noticed a minor bug in our search results. Searching for
is_insteadbool in 8.3 docs returns the following page:
http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
is_instead is a column, and bool is the datatype, both mentioned in
different columns. I know it is based on postgres' own full text search, but
am not sure about the method how docs are read by the search engine. It
seems that the problem lies in the the way the doc was read and fed to the
index builder, because the initial search results show these two words
combined.
Also, is it possible to teach our search engine to *not* treat _
(underscore) as a word separator? This would be great help and would result
much better results.
Thanks and best regards,
--
gurjeet[(dot)singh](at)EnterpriseDB(dot)com
singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
EnterpriseDB http://www.enterprisedb.com
17° 29' 34.37"N, 78° 30' 59.76"E - Hyderabad
18° 32' 57.25"N, 73° 56' 25.42"E - Pune
37° 47' 19.72"N, 122° 24' 1.69" W - San Francisco *
Mail sent from my BlackLaptop device
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com> |
Cc: | PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-02-02 22:08:50 |
Message-ID: | Pine.LNX.4.64.0802030103020.13757@sn.sai.msu.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Sat, 2 Feb 2008, Gurjeet Singh wrote:
> Hi All,
>
> I just noticed a minor bug in our search results. Searching for
> is_insteadbool in 8.3 docs returns the following page:
>
> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
>
> is_instead is a column, and bool is the datatype, both mentioned in
> different columns. I know it is based on postgres' own full text search, but
> am not sure about the method how docs are read by the search engine. It
> seems that the problem lies in the the way the doc was read and fed to the
> index builder, because the initial search results show these two words
> combined.
>
> Also, is it possible to teach our search engine to *not* treat _
> (underscore) as a word separator? This would be great help and would result
> much better results.
Sure, there are many ways to do this. Magnus ?
>
> Thanks and best regards,
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-02-03 16:40:12 |
Message-ID: | 47A5EE6C.4040704@hagander.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Oleg Bartunov wrote:
> On Sat, 2 Feb 2008, Gurjeet Singh wrote:
>
>> Hi All,
>>
>> I just noticed a minor bug in our search results. Searching for
>> is_insteadbool in 8.3 docs returns the following page:
>>
>> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
>>
>> is_instead is a column, and bool is the datatype, both mentioned in
>> different columns. I know it is based on postgres' own full text
>> search, but
>> am not sure about the method how docs are read by the search engine. It
>> seems that the problem lies in the the way the doc was read and fed to
>> the
>> index builder, because the initial search results show these two words
>> combined.
>>
>> Also, is it possible to teach our search engine to *not* treat _
>> (underscore) as a word separator? This would be great help and would
>> result
>> much better results.
>
>
> Sure, there are many ways to do this. Magnus ?
Which way would you recommend? You're the tsearch master ;-)
//Magnus
From: | "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com> |
---|---|
To: | "Magnus Hagander" <magnus(at)hagander(dot)net> |
Cc: | "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su>, "PGSQL Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-02-05 02:43:09 |
Message-ID: | 65937bea0802041843l36e5907dh99e0f01992cd03ce@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Hi guys.... any updates on this? Pinging you just so that we do not forget
it in the heap of mails in our inboxes.
Best regards,
On Feb 3, 2008 8:40 AM, Magnus Hagander <magnus(at)hagander(dot)net> wrote:
> Oleg Bartunov wrote:
> > On Sat, 2 Feb 2008, Gurjeet Singh wrote:
> >
> >> Hi All,
> >>
> >> I just noticed a minor bug in our search results. Searching for
> >> is_insteadbool in 8.3 docs returns the following page:
> >>
> >> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
> >>
> >> is_instead is a column, and bool is the datatype, both mentioned in
> >> different columns. I know it is based on postgres' own full text
> >> search, but
> >> am not sure about the method how docs are read by the search engine. It
> >> seems that the problem lies in the the way the doc was read and fed to
> >> the
> >> index builder, because the initial search results show these two words
> >> combined.
> >>
> >> Also, is it possible to teach our search engine to *not* treat _
> >> (underscore) as a word separator? This would be great help and would
> >> result
> >> much better results.
> >
> >
> > Sure, there are many ways to do this. Magnus ?
>
> Which way would you recommend? You're the tsearch master ;-)
>
> //Magnus
>
--
gurjeet[(dot)singh](at)EnterpriseDB(dot)com
singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
EnterpriseDB http://www.enterprisedb.com
17° 29' 34.37"N, 78° 30' 59.76"E - Hyderabad
18° 32' 57.25"N, 73° 56' 25.42"E - Pune
37° 47' 19.72"N, 122° 24' 1.69" W - San Francisco *
Mail sent from my BlackLaptop device
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com> |
Cc: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-02-05 16:09:20 |
Message-ID: | 20080205160920.GH24114@svr2.hagander.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
No. It's on the list, but other things around the release haev priority.
//Magnus
On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
> Hi guys.... any updates on this? Pinging you just so that we do not forget
> it in the heap of mails in our inboxes.
>
> Best regards,
>
> On Feb 3, 2008 8:40 AM, Magnus Hagander <magnus(at)hagander(dot)net> wrote:
>
> > Oleg Bartunov wrote:
> > > On Sat, 2 Feb 2008, Gurjeet Singh wrote:
> > >
> > >> Hi All,
> > >>
> > >> I just noticed a minor bug in our search results. Searching for
> > >> is_insteadbool in 8.3 docs returns the following page:
> > >>
> > >> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
> > >>
> > >> is_instead is a column, and bool is the datatype, both mentioned in
> > >> different columns. I know it is based on postgres' own full text
> > >> search, but
> > >> am not sure about the method how docs are read by the search engine. It
> > >> seems that the problem lies in the the way the doc was read and fed to
> > >> the
> > >> index builder, because the initial search results show these two words
> > >> combined.
> > >>
> > >> Also, is it possible to teach our search engine to *not* treat _
> > >> (underscore) as a word separator? This would be great help and would
> > >> result
> > >> much better results.
> > >
> > >
> > > Sure, there are many ways to do this. Magnus ?
> >
> > Which way would you recommend? You're the tsearch master ;-)
> >
> > //Magnus
> >
>
>
>
> --
> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
>
> EnterpriseDB http://www.enterprisedb.com
>
> 17° 29' 34.37"N, 78° 30' 59.76"E - Hyderabad
> 18° 32' 57.25"N, 73° 56' 25.42"E - Pune
> 37° 47' 19.72"N, 122° 24' 1.69" W - San Francisco *
>
> http://gurjeet.frihost.net
>
> Mail sent from my BlackLaptop device
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Magnus Hagander <magnus(at)hagander(dot)net> |
Cc: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-02-07 18:07:06 |
Message-ID: | Pine.LNX.4.64.0802072106210.13757@sn.sai.msu.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Tue, 5 Feb 2008, Magnus Hagander wrote:
> No. It's on the list, but other things around the release haev priority.
I just returned from my Europe trip and have many things to do :)
>
> //Magnus
>
> On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
>> Hi guys.... any updates on this? Pinging you just so that we do not forget
>> it in the heap of mails in our inboxes.
>>
>> Best regards,
>>
>> On Feb 3, 2008 8:40 AM, Magnus Hagander <magnus(at)hagander(dot)net> wrote:
>>
>>> Oleg Bartunov wrote:
>>>> On Sat, 2 Feb 2008, Gurjeet Singh wrote:
>>>>
>>>>> Hi All,
>>>>>
>>>>> I just noticed a minor bug in our search results. Searching for
>>>>> is_insteadbool in 8.3 docs returns the following page:
>>>>>
>>>>> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
>>>>>
>>>>> is_instead is a column, and bool is the datatype, both mentioned in
>>>>> different columns. I know it is based on postgres' own full text
>>>>> search, but
>>>>> am not sure about the method how docs are read by the search engine. It
>>>>> seems that the problem lies in the the way the doc was read and fed to
>>>>> the
>>>>> index builder, because the initial search results show these two words
>>>>> combined.
>>>>>
>>>>> Also, is it possible to teach our search engine to *not* treat _
>>>>> (underscore) as a word separator? This would be great help and would
>>>>> result
>>>>> much better results.
>>>>
>>>>
>>>> Sure, there are many ways to do this. Magnus ?
>>>
>>> Which way would you recommend? You're the tsearch master ;-)
>>>
>>> //Magnus
>>>
>>
>>
>>
>> --
>> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
>> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
>>
>> EnterpriseDB http://www.enterprisedb.com
>>
>> 17° 29' 34.37"N, 78° 30' 59.76"E - Hyderabad
>> 18° 32' 57.25"N, 73° 56' 25.42"E - Pune
>> 37° 47' 19.72"N, 122° 24' 1.69" W - San Francisco *
>>
>> http://gurjeet.frihost.net
>>
>> Mail sent from my BlackLaptop device
>
> ---------------------------(end of broadcast)---------------------------
> TIP 7: You can help support the PostgreSQL project by donating at
>
> http://www.postgresql.org/about/donate
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Magnus Hagander <magnus(at)hagander(dot)net>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-06 03:23:28 |
Message-ID: | 200803060323.m263NSO21038@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Has this been addressed?
---------------------------------------------------------------------------
Oleg Bartunov wrote:
> On Tue, 5 Feb 2008, Magnus Hagander wrote:
>
> > No. It's on the list, but other things around the release haev priority.
>
> I just returned from my Europe trip and have many things to do :)
>
> >
> > //Magnus
> >
> > On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
> >> Hi guys.... any updates on this? Pinging you just so that we do not forget
> >> it in the heap of mails in our inboxes.
> >>
> >> Best regards,
> >>
> >> On Feb 3, 2008 8:40 AM, Magnus Hagander <magnus(at)hagander(dot)net> wrote:
> >>
> >>> Oleg Bartunov wrote:
> >>>> On Sat, 2 Feb 2008, Gurjeet Singh wrote:
> >>>>
> >>>>> Hi All,
> >>>>>
> >>>>> I just noticed a minor bug in our search results. Searching for
> >>>>> is_insteadbool in 8.3 docs returns the following page:
> >>>>>
> >>>>> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
> >>>>>
> >>>>> is_instead is a column, and bool is the datatype, both mentioned in
> >>>>> different columns. I know it is based on postgres' own full text
> >>>>> search, but
> >>>>> am not sure about the method how docs are read by the search engine. It
> >>>>> seems that the problem lies in the the way the doc was read and fed to
> >>>>> the
> >>>>> index builder, because the initial search results show these two words
> >>>>> combined.
> >>>>>
> >>>>> Also, is it possible to teach our search engine to *not* treat _
> >>>>> (underscore) as a word separator? This would be great help and would
> >>>>> result
> >>>>> much better results.
> >>>>
> >>>>
> >>>> Sure, there are many ways to do this. Magnus ?
> >>>
> >>> Which way would you recommend? You're the tsearch master ;-)
> >>>
> >>> //Magnus
> >>>
> >>
> >>
> >>
> >> --
> >> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> >> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
> >>
> >> EnterpriseDB http://www.enterprisedb.com
> >>
> >> 17? 29' 34.37"N, 78? 30' 59.76"E - Hyderabad
> >> 18? 32' 57.25"N, 73? 56' 25.42"E - Pune
> >> 37? 47' 19.72"N, 122? 24' 1.69" W - San Francisco *
> >>
> >> http://gurjeet.frihost.net
> >>
> >> Mail sent from my BlackLaptop device
> >
> > ---------------------------(end of broadcast)---------------------------
> > TIP 7: You can help support the PostgreSQL project by donating at
> >
> > http://www.postgresql.org/about/donate
> >
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
> Sternberg Astronomical Institute, Moscow University, Russia
> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> phone: +007(495)939-16-83, +007(495)939-23-83
> ---------------------------(end of broadcast)---------------------------
> TIP 1: if posting/reading through Usenet, please send an appropriate
> subscribe-nomail command to majordomo(at)postgresql(dot)org so that your
> message can get through to the mailing list cleanly
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://postgres.enterprisedb.com
+ If your life is a hard drive, Christ can be your backup. +
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-06 08:56:03 |
Message-ID: | 20080306085603.GC2033@svr2.hagander.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
No, it's still on the TODO. Gevik has also been looking a bit at it (I
think - at least he's indicated that he is), and he recently got some nwe
parser code to look at to see if we can use to fix it.
//Magnus
On Wed, Mar 05, 2008 at 10:23:28PM -0500, Bruce Momjian wrote:
>
> Has this been addressed?
>
> ---------------------------------------------------------------------------
>
> Oleg Bartunov wrote:
> > On Tue, 5 Feb 2008, Magnus Hagander wrote:
> >
> > > No. It's on the list, but other things around the release haev priority.
> >
> > I just returned from my Europe trip and have many things to do :)
> >
> > >
> > > //Magnus
> > >
> > > On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
> > >> Hi guys.... any updates on this? Pinging you just so that we do not forget
> > >> it in the heap of mails in our inboxes.
> > >>
> > >> Best regards,
> > >>
> > >> On Feb 3, 2008 8:40 AM, Magnus Hagander <magnus(at)hagander(dot)net> wrote:
> > >>
> > >>> Oleg Bartunov wrote:
> > >>>> On Sat, 2 Feb 2008, Gurjeet Singh wrote:
> > >>>>
> > >>>>> Hi All,
> > >>>>>
> > >>>>> I just noticed a minor bug in our search results. Searching for
> > >>>>> is_insteadbool in 8.3 docs returns the following page:
> > >>>>>
> > >>>>> http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
> > >>>>>
> > >>>>> is_instead is a column, and bool is the datatype, both mentioned in
> > >>>>> different columns. I know it is based on postgres' own full text
> > >>>>> search, but
> > >>>>> am not sure about the method how docs are read by the search engine. It
> > >>>>> seems that the problem lies in the the way the doc was read and fed to
> > >>>>> the
> > >>>>> index builder, because the initial search results show these two words
> > >>>>> combined.
> > >>>>>
> > >>>>> Also, is it possible to teach our search engine to *not* treat _
> > >>>>> (underscore) as a word separator? This would be great help and would
> > >>>>> result
> > >>>>> much better results.
> > >>>>
> > >>>>
> > >>>> Sure, there are many ways to do this. Magnus ?
> > >>>
> > >>> Which way would you recommend? You're the tsearch master ;-)
> > >>>
> > >>> //Magnus
> > >>>
> > >>
> > >>
> > >>
> > >> --
> > >> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> > >> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
> > >>
> > >> EnterpriseDB http://www.enterprisedb.com
> > >>
> > >> 17? 29' 34.37"N, 78? 30' 59.76"E - Hyderabad
> > >> 18? 32' 57.25"N, 73? 56' 25.42"E - Pune
> > >> 37? 47' 19.72"N, 122? 24' 1.69" W - San Francisco *
> > >>
> > >> http://gurjeet.frihost.net
> > >>
> > >> Mail sent from my BlackLaptop device
> > >
> > > ---------------------------(end of broadcast)---------------------------
> > > TIP 7: You can help support the PostgreSQL project by donating at
> > >
> > > http://www.postgresql.org/about/donate
> > >
> >
> > Regards,
> > Oleg
> > _____________________________________________________________
> > Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
> > Sternberg Astronomical Institute, Moscow University, Russia
> > Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> > phone: +007(495)939-16-83, +007(495)939-23-83
> > ---------------------------(end of broadcast)---------------------------
> > TIP 1: if posting/reading through Usenet, please send an appropriate
> > subscribe-nomail command to majordomo(at)postgresql(dot)org so that your
> > message can get through to the mailing list cleanly
>
> --
> Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
> EnterpriseDB http://postgres.enterprisedb.com
>
> + If your life is a hard drive, Christ can be your backup. +
From: | "Gevik Babakhani" <pgdev(at)xs4all(dot)nl> |
---|---|
To: | "'Magnus Hagander'" <magnus(at)hagander(dot)net>, "'Bruce Momjian'" <bruce(at)momjian(dot)us> |
Cc: | "'Oleg Bartunov'" <oleg(at)sai(dot)msu(dot)su>, "'Gurjeet Singh'" <singh(dot)gurjeet(at)gmail(dot)com>, "'PGSQL Hackers'" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-06 11:31:50 |
Message-ID: | 000301c87f7d$ae4dc130$0a01a8c0@gevmus |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
> No, it's still on the TODO. Gevik has also been looking a bit
> at it (I think - at least he's indicated that he is), and he
> recently got some new parser code to look at to see if we can
> use to fix it.
>
I have the new parser code. Next week I have some time reserved to look at
it.
From: | "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com> |
---|---|
To: | "Gevik Babakhani" <pgdev(at)xs4all(dot)nl> |
Cc: | "Magnus Hagander" <magnus(at)hagander(dot)net>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su>, "PGSQL Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-24 14:01:32 |
Message-ID: | 65937bea0803240701p225c7b51x61d016248de4da74@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl> wrote:
> > No, it's still on the TODO. Gevik has also been looking a bit
> > at it (I think - at least he's indicated that he is), and he
> > recently got some new parser code to look at to see if we can
> > use to fix it.
> >
>
> I have the new parser code. Next week I have some time reserved to look at
> it.
>
>
For the last few days I have been noticing that the _ (underscore) is no
longer being treated as a word boundary. Can you confirm this has been
fixed?
Best regards,
--
gurjeet[(dot)singh](at)EnterpriseDB(dot)com
singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
EnterpriseDB http://www.enterprisedb.com
Mail sent from my BlackLaptop device
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com> |
Cc: | Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Bruce Momjian <bruce(at)momjian(dot)us>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-24 21:31:42 |
Message-ID: | 47E81DBE.30701@hagander.net |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Gurjeet Singh wrote:
> On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
> <mailto:pgdev(at)xs4all(dot)nl>> wrote:
>
> > No, it's still on the TODO. Gevik has also been looking a bit
> > at it (I think - at least he's indicated that he is), and he
> > recently got some new parser code to look at to see if we can
> > use to fix it.
> >
>
> I have the new parser code. Next week I have some time reserved to
> look at
> it.
>
>
> For the last few days I have been noticing that the _ (underscore) is no
> longer being treated as a word boundary. Can you confirm this has been
> fixed?
AFAIK, nothing has been done yet :-)
//Magnus
From: | "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com> |
---|---|
To: | "Magnus Hagander" <magnus(at)hagander(dot)net> |
Cc: | "Gevik Babakhani" <pgdev(at)xs4all(dot)nl>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su>, "PGSQL Hackers" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-03-26 11:43:50 |
Message-ID: | 65937bea0803260443q10e1d8efo4a825bac40c0a833@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander <magnus(at)hagander(dot)net>
wrote:
> Gurjeet Singh wrote:
> > On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
> > <mailto:pgdev(at)xs4all(dot)nl>> wrote:
> >
> > > No, it's still on the TODO. Gevik has also been looking a bit
> > > at it (I think - at least he's indicated that he is), and he
> > > recently got some new parser code to look at to see if we can
> > > use to fix it.
> > >
> >
> > I have the new parser code. Next week I have some time reserved to
> > look at
> > it.
> >
> >
> > For the last few days I have been noticing that the _ (underscore) is no
> > longer being treated as a word boundary. Can you confirm this has been
> > fixed?
>
> AFAIK, nothing has been done yet :-)
>
You are right.... I was looking for pg_standby and another pg_<something>
and since I found those results on top, I thought there was some action
taken.
The problem persists....
Best regards,
--
gurjeet[(dot)singh](at)EnterpriseDB(dot)com
singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
EnterpriseDB http://www.enterprisedb.com
Mail sent from my BlackLaptop device
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com> |
Cc: | Magnus Hagander <magnus(at)hagander(dot)net>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:13:34 |
Message-ID: | 200804151413.m3FEDY128486@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
I just searched for pg_standby and it looks like it is fixed now.
---------------------------------------------------------------------------
Gurjeet Singh wrote:
> On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander <magnus(at)hagander(dot)net>
> wrote:
>
> > Gurjeet Singh wrote:
> > > On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
> > > <mailto:pgdev(at)xs4all(dot)nl>> wrote:
> > >
> > > > No, it's still on the TODO. Gevik has also been looking a bit
> > > > at it (I think - at least he's indicated that he is), and he
> > > > recently got some new parser code to look at to see if we can
> > > > use to fix it.
> > > >
> > >
> > > I have the new parser code. Next week I have some time reserved to
> > > look at
> > > it.
> > >
> > >
> > > For the last few days I have been noticing that the _ (underscore) is no
> > > longer being treated as a word boundary. Can you confirm this has been
> > > fixed?
> >
> > AFAIK, nothing has been done yet :-)
> >
>
> You are right.... I was looking for pg_standby and another pg_<something>
> and since I found those results on top, I thought there was some action
> taken.
>
> The problem persists....
>
> Best regards,
> --
> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
>
> EnterpriseDB http://www.enterprisedb.com
>
> Mail sent from my BlackLaptop device
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com
+ If your life is a hard drive, Christ can be your backup. +
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:17:56 |
Message-ID: | 20080415161756.57d3ccf6@mha-laptop |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
I didn't do anything, but possibly it got fixed by a different upgrade
at some point, and the recrawling of the sites.
//Magnus
Bruce Momjian wrote:
>
> I just searched for pg_standby and it looks like it is fixed now.
>
> ---------------------------------------------------------------------------
>
> Gurjeet Singh wrote:
> > On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
> > <magnus(at)hagander(dot)net> wrote:
> >
> > > Gurjeet Singh wrote:
> > > > On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
> > > > <mailto:pgdev(at)xs4all(dot)nl>> wrote:
> > > >
> > > > > No, it's still on the TODO. Gevik has also been looking
> > > > > a bit at it (I think - at least he's indicated that he
> > > > > is), and he recently got some new parser code to look at
> > > > > to see if we can use to fix it.
> > > > >
> > > >
> > > > I have the new parser code. Next week I have some time
> > > > reserved to look at
> > > > it.
> > > >
> > > >
> > > > For the last few days I have been noticing that the _
> > > > (underscore) is no longer being treated as a word boundary. Can
> > > > you confirm this has been fixed?
> > >
> > > AFAIK, nothing has been done yet :-)
> > >
> >
> > You are right.... I was looking for pg_standby and another
> > pg_<something> and since I found those results on top, I thought
> > there was some action taken.
> >
> > The problem persists....
> >
> > Best regards,
> > --
> > gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> > singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
> >
> > EnterpriseDB http://www.enterprisedb.com
> >
> > Mail sent from my BlackLaptop device
>
> --
> Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
> EnterpriseDB http://enterprisedb.com
>
> + If your life is a hard drive, Christ can be your backup. +
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Magnus Hagander <magnus(at)hagander(dot)net> |
Cc: | Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:22:02 |
Message-ID: | 200804151422.m3FEM2g05614@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Magnus Hagander wrote:
> I didn't do anything, but possibly it got fixed by a different upgrade
> at some point, and the recrawling of the sites.
Oops, maybe it isn't fixed. I tried "pg_standby" and it seemed to work
but "pg" and "standby" returns the same results.
Is that correct? How do I test this?
---------------------------------------------------------------------------
>
> //Magnus
>
> Bruce Momjian wrote:
> >
> > I just searched for pg_standby and it looks like it is fixed now.
> >
> > ---------------------------------------------------------------------------
> >
> > Gurjeet Singh wrote:
> > > On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
> > > <magnus(at)hagander(dot)net> wrote:
> > >
> > > > Gurjeet Singh wrote:
> > > > > On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
> > > > > <mailto:pgdev(at)xs4all(dot)nl>> wrote:
> > > > >
> > > > > > No, it's still on the TODO. Gevik has also been looking
> > > > > > a bit at it (I think - at least he's indicated that he
> > > > > > is), and he recently got some new parser code to look at
> > > > > > to see if we can use to fix it.
> > > > > >
> > > > >
> > > > > I have the new parser code. Next week I have some time
> > > > > reserved to look at
> > > > > it.
> > > > >
> > > > >
> > > > > For the last few days I have been noticing that the _
> > > > > (underscore) is no longer being treated as a word boundary. Can
> > > > > you confirm this has been fixed?
> > > >
> > > > AFAIK, nothing has been done yet :-)
> > > >
> > >
> > > You are right.... I was looking for pg_standby and another
> > > pg_<something> and since I found those results on top, I thought
> > > there was some action taken.
> > >
> > > The problem persists....
> > >
> > > Best regards,
> > > --
> > > gurjeet[(dot)singh](at)EnterpriseDB(dot)com
> > > singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
> > >
> > > EnterpriseDB http://www.enterprisedb.com
> > >
> > > Mail sent from my BlackLaptop device
> >
> > --
> > Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
> > EnterpriseDB http://enterprisedb.com
> >
> > + If your life is a hard drive, Christ can be your backup. +
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com
+ If your life is a hard drive, Christ can be your backup. +
From: | Richard Huxton <dev(at)archonet(dot)com> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Magnus Hagander <magnus(at)hagander(dot)net>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:44:31 |
Message-ID: | 4804BF4F.5040201@archonet.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Bruce Momjian wrote:
> Magnus Hagander wrote:
>> I didn't do anything, but possibly it got fixed by a different upgrade
>> at some point, and the recrawling of the sites.
>
> Oops, maybe it isn't fixed. I tried "pg_standby" and it seemed to work
> but "pg" and "standby" returns the same results.
>
> Is that correct? How do I test this?
The default is to split words on underscore, so it's probably doing what
it always did. Try "to_tsquery" and you should see it matching "tsquery"
(probably "to" is a stopword).
I did put together a custom parser that allowed underscore in words, but
given my extensive "C" experience in the last decade (one tsearch
parser) you don't want to just plug that into the live site. Someone
(Gevik?) was going to have a look at it when they had the time, but I'd
guess that's the one thing none of us have much of.
--
Richard Huxton
Archonet Ltd
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Richard Huxton <dev(at)archonet(dot)com> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:48:13 |
Message-ID: | 20080415164813.4c88e680@mha-laptop |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Richard Huxton wrote:
> Bruce Momjian wrote:
> > Magnus Hagander wrote:
> >> I didn't do anything, but possibly it got fixed by a different
> >> upgrade at some point, and the recrawling of the sites.
> >
> > Oops, maybe it isn't fixed. I tried "pg_standby" and it seemed to
> > work but "pg" and "standby" returns the same results.
> >
> > Is that correct? How do I test this?
>
> The default is to split words on underscore, so it's probably doing
> what it always did. Try "to_tsquery" and you should see it matching
> "tsquery" (probably "to" is a stopword).
>
> I did put together a custom parser that allowed underscore in words,
> but given my extensive "C" experience in the last decade (one tsearch
> parser) you don't want to just plug that into the live site. Someone
> (Gevik?) was going to have a look at it when they had the time, but
> I'd guess that's the one thing none of us have much of.
Yeah.
Did you ever post the code to anybody other than Gevik? If not, please
send it to pgsql-www and someone can give it a quick look-over (perhaps
Oleg can help us there?)
//Magnus
From: | Richard Huxton <dev(at)archonet(dot)com> |
---|---|
To: | Magnus Hagander <magnus(at)hagander(dot)net> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 14:51:03 |
Message-ID: | 4804C0D7.6030407@archonet.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Magnus Hagander wrote:
> Did you ever post the code to anybody other than Gevik? If not, please
> send it to pgsql-www and someone can give it a quick look-over (perhaps
> Oleg can help us there?)
Will do.
--
Richard Huxton
Archonet Ltd
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Magnus Hagander <magnus(at)hagander(dot)net> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 16:05:38 |
Message-ID: | Pine.LNX.4.64.0804152004400.21547@sn.sai.msu.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Tue, 15 Apr 2008, Magnus Hagander wrote:
> I didn't do anything, but possibly it got fixed by a different upgrade
> at some point, and the recrawling of the sites.
Magnus, we have parser for indexing pgdocs, do you need it ?
>
> //Magnus
>
> Bruce Momjian wrote:
>>
>> I just searched for pg_standby and it looks like it is fixed now.
>>
>> ---------------------------------------------------------------------------
>>
>> Gurjeet Singh wrote:
>>> On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
>>> <magnus(at)hagander(dot)net> wrote:
>>>
>>>> Gurjeet Singh wrote:
>>>>> On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani <pgdev(at)xs4all(dot)nl
>>>>> <mailto:pgdev(at)xs4all(dot)nl>> wrote:
>>>>>
>>>>> > No, it's still on the TODO. Gevik has also been looking
>>>>> > a bit at it (I think - at least he's indicated that he
>>>>> > is), and he recently got some new parser code to look at
>>>>> > to see if we can use to fix it.
>>>>> >
>>>>>
>>>>> I have the new parser code. Next week I have some time
>>>>> reserved to look at
>>>>> it.
>>>>>
>>>>>
>>>>> For the last few days I have been noticing that the _
>>>>> (underscore) is no longer being treated as a word boundary. Can
>>>>> you confirm this has been fixed?
>>>>
>>>> AFAIK, nothing has been done yet :-)
>>>>
>>>
>>> You are right.... I was looking for pg_standby and another
>>> pg_<something> and since I found those results on top, I thought
>>> there was some action taken.
>>>
>>> The problem persists....
>>>
>>> Best regards,
>>> --
>>> gurjeet[(dot)singh](at)EnterpriseDB(dot)com
>>> singh(dot)gurjeet(at){ gmail | hotmail | indiatimes | yahoo }.com
>>>
>>> EnterpriseDB http://www.enterprisedb.com
>>>
>>> Mail sent from my BlackLaptop device
>>
>> --
>> Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
>> EnterpriseDB http://enterprisedb.com
>>
>> + If your life is a hard drive, Christ can be your backup. +
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Richard Huxton <dev(at)archonet(dot)com> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Magnus Hagander <magnus(at)hagander(dot)net>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 16:08:25 |
Message-ID: | Pine.LNX.4.64.0804152006080.21547@sn.sai.msu.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Sergey Karpov prepared contrib/extend_parser, which we intend to use for
indexing pg-related documents. It handles '_' properly, so if anybody
interested, we could post it. Also, it can be useful for playing, since
it's standalone contrib module.
Oleg
On Tue, 15 Apr 2008, Richard Huxton wrote:
> Bruce Momjian wrote:
>> Magnus Hagander wrote:
>>> I didn't do anything, but possibly it got fixed by a different upgrade
>>> at some point, and the recrawling of the sites.
>>
>> Oops, maybe it isn't fixed. I tried "pg_standby" and it seemed to work
>> but "pg" and "standby" returns the same results.
>>
>> Is that correct? How do I test this?
>
> The default is to split words on underscore, so it's probably doing what it
> always did. Try "to_tsquery" and you should see it matching "tsquery"
> (probably "to" is a stopword).
>
> I did put together a custom parser that allowed underscore in words, but
> given my extensive "C" experience in the last decade (one tsearch parser) you
> don't want to just plug that into the live site. Someone (Gevik?) was going
> to have a look at it when they had the time, but I'd guess that's the one
> thing none of us have much of.
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
From: | Richard Huxton <dev(at)archonet(dot)com> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Magnus Hagander <magnus(at)hagander(dot)net>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-15 16:24:59 |
Message-ID: | 4804D6DB.4040002@archonet.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Oleg Bartunov wrote:
> Sergey Karpov prepared contrib/extend_parser, which we intend to use for
> indexing pg-related documents. It handles '_' properly, so if anybody
> interested, we could post it. Also, it can be useful for playing, since
> it's standalone contrib module.
Does it make sense to back-patch the default parser for 8.4? At present,
it can't handle underscores in file-paths.
--
Richard Huxton
Archonet Ltd
From: | Cédric Villemain <cedric(dot)villemain(at)dalibo(dot)com> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Cc: | "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com>, "Gevik Babakhani" <pgdev(at)xs4all(dot)nl>, "Magnus Hagander" <magnus(at)hagander(dot)net>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-16 07:47:08 |
Message-ID: | 200804160947.15379.cedric.villemain@dalibo.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Notice that :
http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=r
and
http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=d
do not provide same result (3 results by date, 1 by rank) even if only the
sorting is changed.
--
Cédric Villemain
Administrateur de Base de Données
Cel: +33 (0)6 74 15 56 53
http://dalibo.com - http://dalibo.org
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Bruce Momjian <bruce(at)momjian(dot)us>, Gurjeet Singh <singh(dot)gurjeet(at)gmail(dot)com>, Gevik Babakhani <pgdev(at)xs4all(dot)nl>, PGSQL Hackers <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-16 08:40:53 |
Message-ID: | 20080416104053.15f48e3f@mha-laptop |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Oleg Bartunov wrote:
> On Tue, 15 Apr 2008, Magnus Hagander wrote:
>
> > I didn't do anything, but possibly it got fixed by a different
> > upgrade at some point, and the recrawling of the sites.
>
> Magnus, we have parser for indexing pgdocs, do you need it ?
Yes, please!
//Magnus
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Cédric Villemain <cedric(dot)villemain(at)dalibo(dot)com> |
Cc: | pgsql-hackers(at)postgresql(dot)org, "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com>, "Gevik Babakhani" <pgdev(at)xs4all(dot)nl>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-16 08:55:12 |
Message-ID: | 20080416105512.310facc0@mha-laptop |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Cédric Villemain wrote:
> Notice that :
>
> http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=r
> and
> http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=d
>
> do not provide same result (3 results by date, 1 by rank) even if
> only the sorting is changed.
Actually, I get 5 and 7, in the other order.
The reason for this is that Tom Lane is way too active. It's
gin_fuzzy_search_limit that's doing the restriction first, and the date
restriction comes in later.
//Magnus
From: | Cédric Villemain <cedric(dot)villemain(at)dalibo(dot)com> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Cc: | Magnus Hagander <magnus(at)hagander(dot)net>, "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com>, "Gevik Babakhani" <pgdev(at)xs4all(dot)nl>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-16 09:15:18 |
Message-ID: | 200804161115.23980.cedric.villemain@dalibo.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Le Wednesday 16 April 2008, Magnus Hagander a écrit :
> Cédric Villemain wrote:
> > Notice that :
> >
> > http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=r
> > and
> > http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=d
> >
> > do not provide same result (3 results by date, 1 by rank) even if
> > only the sorting is changed.
>
> Actually, I get 5 and 7, in the other order.
>
> The reason for this is that Tom Lane is way too active. It's
> gin_fuzzy_search_limit that's doing the restriction first, and the date
> restriction comes in later.
Yes, you are perfectly right. Can I suggest to deactivate
gin_fuzzy_search_limit (or increase the value) when one condition (and use
the condition earlier) can considerably reduce the number of results (like
the 'post date' here) ?
>
> //Magnus
--
Cédric Villemain
Administrateur de Base de Données
Cel: +33 (0)6 74 15 56 53
http://dalibo.com - http://dalibo.org
From: | Magnus Hagander <magnus(at)hagander(dot)net> |
---|---|
To: | Cédric Villemain <cedric(dot)villemain(at)dalibo(dot)com> |
Cc: | pgsql-hackers(at)postgresql(dot)org, "Gurjeet Singh" <singh(dot)gurjeet(at)gmail(dot)com>, "Gevik Babakhani" <pgdev(at)xs4all(dot)nl>, "Bruce Momjian" <bruce(at)momjian(dot)us>, "Oleg Bartunov" <oleg(at)sai(dot)msu(dot)su> |
Subject: | Re: Problem with site doc search |
Date: | 2008-04-16 09:25:12 |
Message-ID: | 20080416112512.43b32997@mha-laptop |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Cédric Villemain wrote:
> Le Wednesday 16 April 2008, Magnus Hagander a écrit :
> > Cédric Villemain wrote:
> > > Notice that :
> > >
> > > http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=r
> > > and
> > > http://search.postgresql.org/search?q=tom+lane&m=1&l=&d=1&s=d
> > >
> > > do not provide same result (3 results by date, 1 by rank) even if
> > > only the sorting is changed.
> >
> > Actually, I get 5 and 7, in the other order.
> >
> > The reason for this is that Tom Lane is way too active. It's
> > gin_fuzzy_search_limit that's doing the restriction first, and the
> > date restriction comes in later.
>
> Yes, you are perfectly right. Can I suggest to deactivate
> gin_fuzzy_search_limit (or increase the value) when one condition
> (and use the condition earlier) can considerably reduce the number of
> results (like the 'post date' here) ?
You'd have to convince the planner to actually not use an indexscan at
all on the tsvector. Normally it'll choose an index scan on each and
then a bitmap join, and we don't want to bring back so many rows...
//Magnus