Skip site navigation (1) Skip section navigation (2)

Peripheral Links

Header And Logo

PostgreSQL
| The world's most advanced open source database.

Site Navigation

Search for
  Advanced Search

Re: to_tsvector in 8.2.3


  • From: Magnus Hagander <magnus(at)hagander(dot)net>
  • To: Teodor Sigaev <teodor(at)sigaev(dot)ru>
  • Cc: richardcraig <richard(at)v3fm(dot)com>, pgsql-general <pgsql-general(at)postgresql(dot)org>
  • Subject: Re: to_tsvector in 8.2.3
  • Date: Thu, 22 Mar 2007 14:47:32 +0100
  • Message-id: <20070322134732(dot)GB5635(at)svr2(dot)hagander(dot)net>

On Wed, Mar 21, 2007 at 09:13:55PM +0300, Teodor Sigaev wrote:
> >postgres=# select to_tsvector('test text');
> >  to_tsvector
> >---------------
> > 'test text':1
> >(1 row)
> Ok. that's related to 
> http://developer.postgresql.org/cvsweb.cgi/pgsql/contrib/tsearch2/wordparser/parser.c.diff?r1=1.11;r2=1.12;f=h
> commit. Thomas pointed that it can be non-breakable space (0xa0) and that 
> commit assumes any character with C locale and multibyte encoding and > 
> 0x7f is alpha.
> To check theory, pls, apply attached patch.
> 
> If so, I'm confused, we can not assume that 0xa0 is a space symbol in any 
> multibyte encoding, even in Windows.

Nope, same result with this patch.

//Magnus




Home | Main Index | Thread Index

Privacy Policy | PostgreSQL Archives hosted by Command Prompt, Inc. | Designed by tinysofa
Copyright © 1996 – 2008 PostgreSQL Global Development Group