Skip site navigation (1) Skip section navigation (2)

Peripheral Links

Header And Logo

PostgreSQL
| The world's most advanced open source database.

Site Navigation

Search archives
  Advanced Search

lexeme ordering in tsvector


  • From: Sushant Sinha <sushant354(at)gmail(dot)com>
  • To: pgsql-hackers(at)postgresql(dot)org
  • Subject: lexeme ordering in tsvector
  • Date: Mon, 30 Nov 2009 23:35:22 +0530
  • Message-id: <1259604322.3191.7.camel@dragflick> <text/plain>

It seems like the ordering of lexemes in tsvector has changed from 8.3
to 8.4.

For example in 8.3.1,

postgres=# select to_tsvector('english', 'quit everytime');
      to_tsvector      
-----------------------
 'quit':1 'everytim':2

The lexemes are arranged by length and then by string comparison.

In postgres 8.4.1,

select to_tsvector('english', 'quit everytime');
      to_tsvector      
-----------------------
 'everytim':2 'quit':1

they are arranged by strncmp and then by length.

I looked in tsvector_op.c, in the function tsCompareString, first memcmp
and then length comparison is done.

Was this change in ordering deliberate?

Wouldn't length comparison be cheaper than memcmp?

-Sushant.




Home | Main Index | Thread Index

Privacy Policy | About PostgreSQL
Copyright © 1996 – 2012 PostgreSQL Global Development Group