Re: ts_rank

Lists: pgsql-hackers
From: "Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov>
To: <robertmhaas(at)gmail(dot)com>,<Marek(dot)Balgar(at)seznam(dot)cz>
Cc: <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ts_rank
Date: 2011-05-20 02:42:32
Message-ID: 4DD58EC8020000250003D997@gw.wicourts.gov
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Robert Haas wrote:
> Mark wrote:

>> Could somebody explain me on which methods is based ts_rank and
>> how it works? I would appreciate some articles, if exist.

> As far as I can tell, our documentation contains no useful
> information on this topic whatsoever. :-(

There's some potentially useful information here:

http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING

Although I don't know if it addresses Mark's question very well.
Personally, I wonder how relevant ts_rank will be after knn-giswt
is out....

-Kevin


From: Robert Haas <robertmhaas(at)gmail(dot)com>
To: Kevin Grittner <Kevin(dot)Grittner(at)wicourts(dot)gov>
Cc: Marek(dot)Balgar(at)seznam(dot)cz, pgsql-hackers(at)postgresql(dot)org
Subject: Re: ts_rank
Date: 2011-05-20 15:27:08
Message-ID: BANLkTiksH9BG0Y4X17jdxMuLOCAtaLJsZg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Thu, May 19, 2011 at 10:42 PM, Kevin Grittner
<Kevin(dot)Grittner(at)wicourts(dot)gov> wrote:
> Robert Haas  wrote:
>> Mark  wrote:
>
>>> Could somebody explain me on which methods is based ts_rank and
>>> how it works?  I would appreciate some articles, if exist.
>
>> As far as I can tell, our documentation contains no useful
>> information on this topic whatsoever. :-(
>
> There's some potentially useful information here:
>
> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING

Ah, yes. I didn't read that carefully enough. That is useful, but it
doesn't really explain how it works.

> Although I don't know if it addresses Mark's question very well.
> Personally, I wonder how relevant ts_rank will be after knn-giswt
> is out....

I don't see why it would be any less useful... though if someone could
find a way to KNN-ify such searches, I'm sure there would be a lot of
very happy users. Seems pretty difficult, though.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


From: Mark <Marek(dot)Balgar(at)seznam(dot)cz>
To: pgsql-hackers(at)postgresql(dot)org
Subject: Re: ts_rank
Date: 2011-05-21 11:41:34
Message-ID: 1305978094293-4414631.post@n5.nabble.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

>There's some potentially useful information here:
>http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING

Thanks for reply. I was reading the documentation of PostgreSQL, but there
it is not written the name of the used methods. Everywhere there is written,
that ts_rank use standard ranking function. But it is difficult to say which
is the standard function.
Somewhere I found that it is maybe based on Vector space model and it seems
to be truth, because in the code of tsrank.c is counted the frequency of
words, but I am not sure of that :-(

--
View this message in context: http://postgresql.1045698.n5.nabble.com/ts-rank-tp4384614p4414631.html
Sent from the PostgreSQL - hackers mailing list archive at Nabble.com.


From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Mark <Marek(dot)Balgar(at)seznam(dot)cz>
Cc: pgsql-hackers(at)postgresql(dot)org, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, Teodor Sigaev <teodor(at)sigaev(dot)ru>
Subject: Re: ts_rank
Date: 2011-09-11 02:18:44
Message-ID: 201109110218.p8B2Iik21211@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Mark wrote:
> >There's some potentially useful information here:
> >http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
>
> Thanks for reply. I was reading the documentation of PostgreSQL, but there
> it is not written the name of the used methods. Everywhere there is written,
> that ts_rank use standard ranking function. But it is difficult to say which
> is the standard function.
> Somewhere I found that it is maybe based on Vector space model and it seems
> to be truth, because in the code of tsrank.c is counted the frequency of
> words, but I am not sure of that :-(

Oleg, Teodor, can you give me a description of how ts_rank decided how
to rank items? Thanks.

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ It's impossible for everything to be true. +


From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Bruce Momjian <bruce(at)momjian(dot)us>
Cc: Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, Teodor Sigaev <teodor(at)sigaev(dot)ru>
Subject: Re: ts_rank
Date: 2011-10-12 21:31:35
Message-ID: 201110122131.p9CLVZo12669@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Bruce Momjian wrote:
> Mark wrote:
> > >There's some potentially useful information here:
> > >http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
> >
> > Thanks for reply. I was reading the documentation of PostgreSQL, but there
> > it is not written the name of the used methods. Everywhere there is written,
> > that ts_rank use standard ranking function. But it is difficult to say which
> > is the standard function.
> > Somewhere I found that it is maybe based on Vector space model and it seems
> > to be truth, because in the code of tsrank.c is counted the frequency of
> > words, but I am not sure of that :-(
>
> Oleg, Teodor, can you give me a description of how ts_rank decided how
> to rank items? Thanks.

Any news on this question?

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ It's impossible for everything to be true. +


From: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
To: Bruce Momjian <bruce(at)momjian(dot)us>
Cc: Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Teodor Sigaev <teodor(at)sigaev(dot)ru>
Subject: Re: ts_rank
Date: 2011-10-13 11:23:14
Message-ID: Pine.LNX.4.64.1110131514570.26195@sn.sai.msu.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

I'm sorry, my plane to Nepal is waiting me :) I'll be back in the
midst of November. In short, ts_rank is based only on frequencies of lexems
and doesn't count distance between query lexems. Also, it supports only
primitive queries.

Oleg
On Wed, 12 Oct 2011, Bruce Momjian wrote:

> Bruce Momjian wrote:
>> Mark wrote:
>>>> There's some potentially useful information here:
>>>> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
>>>
>>> Thanks for reply. I was reading the documentation of PostgreSQL, but there
>>> it is not written the name of the used methods. Everywhere there is written,
>>> that ts_rank use standard ranking function. But it is difficult to say which
>>> is the standard function.
>>> Somewhere I found that it is maybe based on Vector space model and it seems
>>> to be truth, because in the code of tsrank.c is counted the frequency of
>>> words, but I am not sure of that :-(
>>
>> Oleg, Teodor, can you give me a description of how ts_rank decided how
>> to rank items? Thanks.
>
> Any news on this question?
>
>

Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83


From: Bruce Momjian <bruce(at)momjian(dot)us>
To: Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>
Cc: Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Teodor Sigaev <teodor(at)sigaev(dot)ru>
Subject: Re: ts_rank
Date: 2011-10-13 18:17:48
Message-ID: 201110131817.p9DIHmi11048@momjian.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Oleg Bartunov wrote:
> I'm sorry, my plane to Nepal is waiting me :) I'll be back in the
> midst of November. In short, ts_rank is based only on frequencies of lexems
> and doesn't count distance between query lexems. Also, it supports only
> primitive queries.

Thanks. Attached doc patch applied to head and 9.1.X.

---------------------------------------------------------------------------

>
> Oleg
> On Wed, 12 Oct 2011, Bruce Momjian wrote:
>
> > Bruce Momjian wrote:
> >> Mark wrote:
> >>>> There's some potentially useful information here:
> >>>> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
> >>>
> >>> Thanks for reply. I was reading the documentation of PostgreSQL, but there
> >>> it is not written the name of the used methods. Everywhere there is written,
> >>> that ts_rank use standard ranking function. But it is difficult to say which
> >>> is the standard function.
> >>> Somewhere I found that it is maybe based on Vector space model and it seems
> >>> to be truth, because in the code of tsrank.c is counted the frequency of
> >>> words, but I am not sure of that :-(
> >>
> >> Oleg, Teodor, can you give me a description of how ts_rank decided how
> >> to rank items? Thanks.
> >
> > Any news on this question?
> >
> >
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
> Sternberg Astronomical Institute, Moscow University, Russia
> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> phone: +007(495)939-16-83, +007(495)939-23-83

--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com

+ It's impossible for everything to be true. +

Attachment Content-Type Size
/rtmp/ts_rank text/x-diff 618 bytes