Quick Links

Re: Incorrect assumptions with low LIMITs

From:	Jeff Davis <pgsql(at)j-davis(dot)com>
To:	Simon Riggs <simon(at)2ndQuadrant(dot)com>
Cc:	Greg Stark <stark(at)mit(dot)edu>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Incorrect assumptions with low LIMITs
Date:	2012-03-20 01:19:23
Message-ID:	1332206363.3803.27.camel@sussancws0025
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Sat, 2012-03-17 at 12:48 +0000, Simon Riggs wrote:
> The problems are as I described them
>
> (1) no account made for sparsity, and other factors leading to an
> overestimate of rows (N)
>
> (2) inappropriate assumption of the effect of LIMIT m, which causes a
> costly SeqScan to appear better than an IndexScan for low m/N, when in
> fact that is seldom the case.
>
> Overestimating N in (1) inverts the problem, so that an overestimate
> isn't the safe thing at all.

I think the actual problem has more to do with risk. The planner doesn't
know how uniform the distribution of the table is, which introduces risk
for the table scan.

I would tend to agree that for low selectivity fraction and a very low
limit (e.g. 1-3 in your example) and a large table, it doesn't seem like
a good risk to use a table scan. I don't know how that should be modeled
or implemented though.

Regards,
Jeff Davis

In response to

Re: Incorrect assumptions with low LIMITs at 2012-03-17 12:48:20 from Simon Riggs

Responses

Re: Incorrect assumptions with low LIMITs at 2012-03-20 04:33:40 from Daniel Farina

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Greg Stark	2012-03-20 01:20:16	Re: sortsupport for text
Previous Message	Josh Berkus	2012-03-20 01:17:59	Re: Gsoc2012 Idea --- Social Network database schema