Skip site navigation (1) Skip section navigation (2)

Peripheral Links

Header And Logo

PostgreSQL
| The world's most advanced open source database.

Site Navigation

Search archives
  Advanced Search

Re: A counter productive conversation about search.


  • From: "Joshua D. Drake" <jd(at)commandprompt(dot)com>
  • To: Dave Page <dpage(at)vale-housing(dot)co(dot)uk>
  • Cc: PostgreSQL WWW <pgsql-www(at)postgresql(dot)org>
  • Subject: Re: A counter productive conversation about search.
  • Date: Tue, 29 Aug 2006 07:28:05 -0700
  • Message-id: <44F44EF5.3090102@commandprompt.com> <text/plain>


Other options include lucene, and rolling our own.

Is Lucene capable of handling the size of our index? This has always

I am going to say, "yes" without any actual knowledge because of Lucene but that is because I am putting more trust in the fact that it is an Apache project then anything. I will check.

been the problem we've had with other projects like MnogoSearch. They
work well until you load them up with the archives after which they
simply can't cope without ridiculous amounts of hardware.

Rolling our own really wouldn't be that hard "if" we can create a reasonably smart web page grabber. We have all the tools (tsearch2 and pg_pgtrm) to easily do the searches.

So is anyone up for helping develop a page grabber?

We have one - it builds the static version of the main site by spidering
it hourly.

Should we look at that then?


Regards, Dave.



--

   === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
   Providing the most comprehensive  PostgreSQL solutions since 1997
             http://www.commandprompt.com/





Home | Main Index | Thread Index

Privacy Policy | About PostgreSQL
Copyright © 1996 – 2012 PostgreSQL Global Development Group