Re: ANALYZE sampling is too good

From: Greg Stark <stark(at)mit(dot)edu>
To: Josh Berkus <josh(at)agliodbs(dot)com>
Cc: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ANALYZE sampling is too good
Date: 2013-12-10 19:59:29
Message-ID: CAM-w4HMLgydObQ-XKBDzFk3zm-rH_X8mXzxA0nX7WKwt03gNZA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Dec 10, 2013 at 7:54 PM, Josh Berkus <josh(at)agliodbs(dot)com> wrote:
> As discussed, we need math though. Does anyone have an ACM subscription
> and time to do a search? Someone must. We can buy one with community
> funds, but no reason to do so if we don't have to.

Anyone in a university likely has access through their library.

But I don't really think this is the right way to go about this.
Research papers are going to turn up pretty specialized solutions that
are probably patented. We don't even have the basic understanding we
need. I suspect a basic textbook chapter on multistage sampling will
discuss at least the standard techniques.

Once we have a handle on the standard multistage sampling techniques
that would be safe from patents then we might want to go look at
research papers to find how they've been applied to databases in the
past but we would have to do that fairly carefully.

--
greg

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Simon Riggs 2013-12-10 20:00:49 Re: ANALYZE sampling is too good
Previous Message Josh Berkus 2013-12-10 19:54:57 Re: ANALYZE sampling is too good