Quick Links

Re: ANALYZE sampling is too good

From:	Andres Freund <andres(at)2ndquadrant(dot)com>
To:	Peter Geoghegan <pg(at)heroku(dot)com>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Greg Stark <stark(at)mit(dot)edu>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: ANALYZE sampling is too good
Date:	2013-12-06 09:21:14
Message-ID:	20131206092114.GH7814@awork2.anarazel.de
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 2013-12-05 17:52:34 -0800, Peter Geoghegan wrote:
> Has anyone ever thought about opportunistic ANALYZE piggy-backing on
> other full-table scans? That doesn't really help Greg, because his
> complaint is mostly that a fresh ANALYZE is too expensive, but it
> could be an interesting, albeit risky approach.

What I've been thinking of is

a) making it piggy back on scans vacuum is doing instead of doing
separate ones all the time (if possible, analyze needs to be more
frequent). Currently with quite some likelihood the cache will be gone
again when revisiting.

b) make analyze incremental. In lots of bigger tables most of the table
is static - and we actually *do* know that, thanks to the vm. So keep a
rawer form of what ends in the catalogs around somewhere, chunked by the
region of the table the statistic is from. Everytime a part of the table
changes, re-sample only that part. Then recompute the aggregate.

Greetings,

Andres Freund

--
Andres Freund http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

In response to

Re: ANALYZE sampling is too good at 2013-12-06 01:52:34 from Peter Geoghegan

Responses

Re: ANALYZE sampling is too good at 2013-12-06 16:05:45 from Greg Stark
Re: ANALYZE sampling is too good at 2013-12-09 21:20:17 from Jim Nasby
Re: ANALYZE sampling is too good at 2013-12-10 19:23:37 from Simon Riggs

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Boszormenyi Zoltan	2013-12-06 09:43:43	Re: Backup throttling
Previous Message	Amit Kapila	2013-12-06 08:49:33	Re: ANALYZE sampling is too good