Re: A counter productive conversation about search.
- From: "Joshua D. Drake" <jd(at)commandprompt(dot)com>
- To: Dave Page <dpage(at)vale-housing(dot)co(dot)uk>
- Cc: PostgreSQL WWW <pgsql-www(at)postgresql(dot)org>
- Subject: Re: A counter productive conversation about search.
- Date: Tue, 29 Aug 2006 07:28:05 -0700
- Message-id: <44F44EF5.3090102@commandprompt.com> <text/plain>
Other options include lucene, and rolling our own.
Is Lucene capable of handling the size of our index? This has always
I am going to say, "yes" without any actual knowledge because of Lucene
but that is because I am putting more trust in the fact that it is an
Apache project then anything. I will check.
been the problem we've had with other projects like MnogoSearch. They
work well until you load them up with the archives after which they
simply can't cope without ridiculous amounts of hardware.
Rolling our own really wouldn't be that hard "if" we can create a
reasonably smart web page grabber. We have all the tools
(tsearch2 and
pg_pgtrm) to easily do the searches.
So is anyone up for helping develop a page grabber?
We have one - it builds the static version of the main site by spidering
it hourly.
Should we look at that then?
Regards, Dave.
--
=== The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive PostgreSQL solutions since 1997
http://www.commandprompt.com/
Home |
Main Index |
Thread Index