Lists: | pgsql-hackers |
---|
From: | "Kevin Grittner" <Kevin(dot)Grittner(at)wicourts(dot)gov> |
---|---|
To: | <robertmhaas(at)gmail(dot)com>,<Marek(dot)Balgar(at)seznam(dot)cz> |
Cc: | <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: ts_rank |
Date: | 2011-05-20 02:42:32 |
Message-ID: | 4DD58EC8020000250003D997@gw.wicourts.gov |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Robert Haas wrote:
> Mark wrote:
>> Could somebody explain me on which methods is based ts_rank and
>> how it works? I would appreciate some articles, if exist.
> As far as I can tell, our documentation contains no useful
> information on this topic whatsoever. :-(
There's some potentially useful information here:
http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
Although I don't know if it addresses Mark's question very well.
Personally, I wonder how relevant ts_rank will be after knn-giswt
is out....
-Kevin
From: | Robert Haas <robertmhaas(at)gmail(dot)com> |
---|---|
To: | Kevin Grittner <Kevin(dot)Grittner(at)wicourts(dot)gov> |
Cc: | Marek(dot)Balgar(at)seznam(dot)cz, pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: ts_rank |
Date: | 2011-05-20 15:27:08 |
Message-ID: | BANLkTiksH9BG0Y4X17jdxMuLOCAtaLJsZg@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
On Thu, May 19, 2011 at 10:42 PM, Kevin Grittner
<Kevin(dot)Grittner(at)wicourts(dot)gov> wrote:
> Robert Haas wrote:
>> Mark wrote:
>
>>> Could somebody explain me on which methods is based ts_rank and
>>> how it works? I would appreciate some articles, if exist.
>
>> As far as I can tell, our documentation contains no useful
>> information on this topic whatsoever. :-(
>
> There's some potentially useful information here:
>
> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
Ah, yes. I didn't read that carefully enough. That is useful, but it
doesn't really explain how it works.
> Although I don't know if it addresses Mark's question very well.
> Personally, I wonder how relevant ts_rank will be after knn-giswt
> is out....
I don't see why it would be any less useful... though if someone could
find a way to KNN-ify such searches, I'm sure there would be a lot of
very happy users. Seems pretty difficult, though.
--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company
From: | Mark <Marek(dot)Balgar(at)seznam(dot)cz> |
---|---|
To: | pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: ts_rank |
Date: | 2011-05-21 11:41:34 |
Message-ID: | 1305978094293-4414631.post@n5.nabble.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
>There's some potentially useful information here:
>http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
Thanks for reply. I was reading the documentation of PostgreSQL, but there
it is not written the name of the used methods. Everywhere there is written,
that ts_rank use standard ranking function. But it is difficult to say which
is the standard function.
Somewhere I found that it is maybe based on Vector space model and it seems
to be truth, because in the code of tsrank.c is counted the frequency of
words, but I am not sure of that :-(
--
View this message in context: http://postgresql.1045698.n5.nabble.com/ts-rank-tp4384614p4414631.html
Sent from the PostgreSQL - hackers mailing list archive at Nabble.com.
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Mark <Marek(dot)Balgar(at)seznam(dot)cz> |
Cc: | pgsql-hackers(at)postgresql(dot)org, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, Teodor Sigaev <teodor(at)sigaev(dot)ru> |
Subject: | Re: ts_rank |
Date: | 2011-09-11 02:18:44 |
Message-ID: | 201109110218.p8B2Iik21211@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Mark wrote:
> >There's some potentially useful information here:
> >http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
>
> Thanks for reply. I was reading the documentation of PostgreSQL, but there
> it is not written the name of the used methods. Everywhere there is written,
> that ts_rank use standard ranking function. But it is difficult to say which
> is the standard function.
> Somewhere I found that it is maybe based on Vector space model and it seems
> to be truth, because in the code of tsrank.c is counted the frequency of
> words, but I am not sure of that :-(
Oleg, Teodor, can you give me a description of how ts_rank decided how
to rank items? Thanks.
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com
+ It's impossible for everything to be true. +
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Oleg Bartunov <oleg(at)sai(dot)msu(dot)su>, Teodor Sigaev <teodor(at)sigaev(dot)ru> |
Subject: | Re: ts_rank |
Date: | 2011-10-12 21:31:35 |
Message-ID: | 201110122131.p9CLVZo12669@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Bruce Momjian wrote:
> Mark wrote:
> > >There's some potentially useful information here:
> > >http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
> >
> > Thanks for reply. I was reading the documentation of PostgreSQL, but there
> > it is not written the name of the used methods. Everywhere there is written,
> > that ts_rank use standard ranking function. But it is difficult to say which
> > is the standard function.
> > Somewhere I found that it is maybe based on Vector space model and it seems
> > to be truth, because in the code of tsrank.c is counted the frequency of
> > words, but I am not sure of that :-(
>
> Oleg, Teodor, can you give me a description of how ts_rank decided how
> to rank items? Thanks.
Any news on this question?
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com
+ It's impossible for everything to be true. +
From: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
---|---|
To: | Bruce Momjian <bruce(at)momjian(dot)us> |
Cc: | Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Teodor Sigaev <teodor(at)sigaev(dot)ru> |
Subject: | Re: ts_rank |
Date: | 2011-10-13 11:23:14 |
Message-ID: | Pine.LNX.4.64.1110131514570.26195@sn.sai.msu.ru |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
I'm sorry, my plane to Nepal is waiting me :) I'll be back in the
midst of November. In short, ts_rank is based only on frequencies of lexems
and doesn't count distance between query lexems. Also, it supports only
primitive queries.
Oleg
On Wed, 12 Oct 2011, Bruce Momjian wrote:
> Bruce Momjian wrote:
>> Mark wrote:
>>>> There's some potentially useful information here:
>>>> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
>>>
>>> Thanks for reply. I was reading the documentation of PostgreSQL, but there
>>> it is not written the name of the used methods. Everywhere there is written,
>>> that ts_rank use standard ranking function. But it is difficult to say which
>>> is the standard function.
>>> Somewhere I found that it is maybe based on Vector space model and it seems
>>> to be truth, because in the code of tsrank.c is counted the frequency of
>>> words, but I am not sure of that :-(
>>
>> Oleg, Teodor, can you give me a description of how ts_rank decided how
>> to rank items? Thanks.
>
> Any news on this question?
>
>
Regards,
Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
From: | Bruce Momjian <bruce(at)momjian(dot)us> |
---|---|
To: | Oleg Bartunov <oleg(at)sai(dot)msu(dot)su> |
Cc: | Mark <Marek(dot)Balgar(at)seznam(dot)cz>, pgsql-hackers(at)postgresql(dot)org, Teodor Sigaev <teodor(at)sigaev(dot)ru> |
Subject: | Re: ts_rank |
Date: | 2011-10-13 18:17:48 |
Message-ID: | 201110131817.p9DIHmi11048@momjian.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Lists: | pgsql-hackers |
Oleg Bartunov wrote:
> I'm sorry, my plane to Nepal is waiting me :) I'll be back in the
> midst of November. In short, ts_rank is based only on frequencies of lexems
> and doesn't count distance between query lexems. Also, it supports only
> primitive queries.
Thanks. Attached doc patch applied to head and 9.1.X.
---------------------------------------------------------------------------
>
> Oleg
> On Wed, 12 Oct 2011, Bruce Momjian wrote:
>
> > Bruce Momjian wrote:
> >> Mark wrote:
> >>>> There's some potentially useful information here:
> >>>> http://www.postgresql.org/docs/9.0/interactive/textsearch-controls.html#TEXTSEARCH-RANKING
> >>>
> >>> Thanks for reply. I was reading the documentation of PostgreSQL, but there
> >>> it is not written the name of the used methods. Everywhere there is written,
> >>> that ts_rank use standard ranking function. But it is difficult to say which
> >>> is the standard function.
> >>> Somewhere I found that it is maybe based on Vector space model and it seems
> >>> to be truth, because in the code of tsrank.c is counted the frequency of
> >>> words, but I am not sure of that :-(
> >>
> >> Oleg, Teodor, can you give me a description of how ts_rank decided how
> >> to rank items? Thanks.
> >
> > Any news on this question?
> >
> >
>
> Regards,
> Oleg
> _____________________________________________________________
> Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
> Sternberg Astronomical Institute, Moscow University, Russia
> Internet: oleg(at)sai(dot)msu(dot)su, http://www.sai.msu.su/~megera/
> phone: +007(495)939-16-83, +007(495)939-23-83
--
Bruce Momjian <bruce(at)momjian(dot)us> http://momjian.us
EnterpriseDB http://enterprisedb.com
+ It's impossible for everything to be true. +
Attachment | Content-Type | Size |
---|---|---|
/rtmp/ts_rank | text/x-diff | 618 bytes |