Re: ANALYZE sampling is too good

From: Florian Pflug <fgp(at)phlo(dot)org>
To: Claudio Freire <klaussfreire(at)gmail(dot)com>
Cc: Greg Stark <stark(at)mit(dot)edu>, Albe Laurenz <laurenz(dot)albe(at)wien(dot)gv(dot)at>, Mark Kirkwood <mark(dot)kirkwood(at)catalyst(dot)net(dot)nz>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, Robert Haas <robertmhaas(at)gmail(dot)com>, Josh Berkus <josh(at)agliodbs(dot)com>
Subject: Re: ANALYZE sampling is too good
Date: 2013-12-11 14:28:51
Message-ID: 848AB714-0DF8-493C-BA8F-7F49102B4E10@phlo.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Dec10, 2013, at 15:32 , Claudio Freire <klaussfreire(at)gmail(dot)com> wrote:
> On Tue, Dec 10, 2013 at 11:02 AM, Greg Stark <stark(at)mit(dot)edu> wrote:
>>
>> On 10 Dec 2013 08:28, "Albe Laurenz" <laurenz(dot)albe(at)wien(dot)gv(dot)at> wrote:
>>>
>>>
>>> Doesn't all that assume a normally distributed random variable?
>>
>> I don't think so because of the law of large numbers. If you have a large
>> population and sample it the sample behaves like a normal distribution when
>> if the distribution of the population isn't.
>
> No, the large population says that if you have an AVERAGE of many
> samples of a random variable, the random variable that is the AVERAGE
> behaves like a normal.

Actually, that's the central limit theorem, and it doesn't hold for all
random variables, only for those with finite expected value and variance.

The law of large numbers, in contrast, only tells you that the AVERAGE of
n samples of a random variable will converge to the random variables'
expected value as n goes to infinity (there are different versions of the
law which guarantee different kinds of convergence, weak or strong).

best regards,
Florian Pflug

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message email 2013-12-11 14:30:04 BUG #8676: Bug Money JSON
Previous Message Kevin Grittner 2013-12-11 14:26:58 Re: Why the buildfarm is all pink