Re: ANALYZE sampling is too good

From: Claudio Freire <klaussfreire(at)gmail(dot)com>
To: Josh Berkus <josh(at)agliodbs(dot)com>
Cc: PostgreSQL-Dev <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: ANALYZE sampling is too good
Date: 2013-12-12 19:13:23
Message-ID: CAGTBQpY+znTQOujv3yV38NzF06OnK_wakudNb+ZRRL8JsG6QBQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Thu, Dec 12, 2013 at 3:56 PM, Josh Berkus <josh(at)agliodbs(dot)com> wrote:
>
> Estimated grouping should, however, affect MCVs. In cases where we
> estimate that grouping levels are high, the expected % of observed
> values should be "discounted" somehow. That is, with total random
> distribution you have a 1:1 ratio between observed frequency of a value
> and assumed frequency. However, with highly grouped values, you might
> have a 2:1 ratio.

Cross validation can help there. But it's costly.

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Jeff Janes 2013-12-12 19:13:55 Re: ANALYZE sampling is too good
Previous Message Josh Berkus 2013-12-12 18:56:42 Re: ANALYZE sampling is too good