Quick Links

Re: CPU-intensive autovacuuming

Lists:	pgsql-generalpgsql-hackers

From:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
To:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-06 15:16:22
Message-ID:	42A468C6.6040603@chezphil.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Following up on my own post from last night:

> Could it be that there is some code in autovacuum that is O(n^2) in
> the number of tables?

Browsing the code using webcvs, I have found this:

for (j = 0; j < PQntuples(res); j++)
{
tbl_elem = DLGetHead(dbs->table_list);
while (tbl_elem != NULL)
{

I haven't really tried to understand what is going on in here, but it
does look like it is getting the result of the "pg_class join stats"
query and then matching it up against its internal list of tables using
nested loops, which is undoubtedly O(n^2) in the number of tables.

Have I correctly understood what is going on here?

--Phil.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 00:33:26
Message-ID:	42A4EB56.5080905@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil Endecott wrote:

> Following up on my own post from last night:
>
> > Could it be that there is some code in autovacuum that is O(n^2) in
> > the number of tables?
>
> Browsing the code using webcvs, I have found this:
>
> for (j = 0; j < PQntuples(res); j++)
> {
> tbl_elem = DLGetHead(dbs->table_list);
> while (tbl_elem != NULL)
> {
>
> I haven't really tried to understand what is going on in here, but it
> does look like it is getting the result of the "pg_class join stats"
> query and then matching it up against its internal list of tables
> using nested loops, which is undoubtedly O(n^2) in the number of tables.
>
> Have I correctly understood what is going on here?

Indeed you have. I have head a few similar reports but perhaps none as
bad as yours. One person put a small sleep value so that it doesn't
spin so tight. You could also just up the sleep delay so that it
doesn't do this work quite so often. No other quick suggestions.

From:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 11:16:19
Message-ID:	42A58203.8070801@chezphil.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Matthew T. O'Connor wrote:
> Phil Endecott wrote:
>> > Could it be that there is some code in autovacuum that is O(n^2) in
>> > the number of tables?
>>
>> Browsing the code using webcvs, I have found this:
>>
>> for (j = 0; j < PQntuples(res); j++)
>> {
>> tbl_elem = DLGetHead(dbs->table_list);
>> while (tbl_elem != NULL)
>> {
>> Have I correctly understood what is going on here?

> Indeed you have. I have head a few similar reports but perhaps none as
> bad as yours. One person put a small sleep value so that it doesn't
> spin so tight. You could also just up the sleep delay so that it
> doesn't do this work quite so often. No other quick suggestions.

I do wonder why autovacuum is keeping its table list in memory rather
than in the database.

But given that it is keeping it in memory, I think the real fix is to
sort that list (or keep it ordered when building or updating it). It is
trivial to also get the query results ordered, and they can then be
compared in O(n) time.

I notice various other places where there seem to be nested loops, e.g.
in the update_table_list function. I'm not sure if they can be fixed by
similar means.

--Phil.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 15:16:06
Message-ID:	42A5BA36.6060103@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil Endecott wrote:

> Matthew T. O'Connor wrote:
>
>> Indeed you have. I have head a few similar reports but perhaps none
>> as bad as yours. One person put a small sleep value so that it
>> doesn't spin so tight. You could also just up the sleep delay so
>> that it doesn't do this work quite so often. No other quick
>> suggestions.
>
>
> I do wonder why autovacuum is keeping its table list in memory rather
> than in the database.

For better or worse, this was a conscious design decision that the
contrib version of autovacuum be non-invasive to your database.

> But given that it is keeping it in memory, I think the real fix is to
> sort that list (or keep it ordered when building or updating it). It
> is trivial to also get the query results ordered, and they can then be
> compared in O(n) time.

I'm quite sure there is a better way, please submit a patch if you can.
This was never a real concern for most people since the number of tables
is typically small enough not to be a problem. The integrated version
of autovacuum that didn't make the cut before 8.0 avoids this problem
since the autovacuum data is stored in the database.

> I notice various other places where there seem to be nested loops,
> e.g. in the update_table_list function. I'm not sure if they can be
> fixed by similar means.

I would think so, they all basically do the same type of loop.

From:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 15:35:31
Message-ID:	42A5BEC3.4080409@chezphil.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Matthew T. O'Connor wrote:
> The integrated version
> of autovacuum that didn't make the cut before 8.0 avoids this problem
> since the autovacuum data is stored in the database.

What is the status of this? Is it something that will be included in
8.1 or 8.0.n? I might be able to patch the current code but that
doesn't seem like a useful thing to do if a better solution will arrive
eventually. I am currently running vacuums from a cron job and I think
I will be happy with that for the time being.

(Incidentally, I have also found that the indexes on my pg_attributes
table were taking up over half a gigabyte, which came down to less than
40 megs after reindexing them. Is there a case for having autovacuum
also call reindex?)

--Phil.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 16:00:54
Message-ID:	42A5C4B6.8070908@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil Endecott wrote:

> Matthew T. O'Connor wrote:
>
>> The integrated version of autovacuum that didn't make the cut before
>> 8.0 avoids this problem since the autovacuum data is stored in the
>> database.
>
>
> What is the status of this? Is it something that will be included in
> 8.1 or 8.0.n? I might be able to patch the current code but that
> doesn't seem like a useful thing to do if a better solution will
> arrive eventually. I am currently running vacuums from a cron job and
> I think I will be happy with that for the time being.

This is a good question :-) I have been so busy with work lately that I
have not been able to work on it. I am currently trying to resurrect
the patch I sent in for 8.0 and update it so that it applies against
HEAD. Once that is done, I will need help from someone with the
portions of the work that I'm not comfortable / capable of. The main
issue with the version I created during the 8.0 devel cycle it used
libpq to connect, query and issue commands against the databases. This
was deemed bad, and I need help setting up the infrastructure to make
this happen without libpq. I hope to have my patch applying against
HEAD sometime this week but it probably won't happen till next week.

So the summary of the autovacuum integration status is that we are fast
running out of time (feature freeze July 1), and I have very little time
to devote to this task. So you might want to submit your O(n) patch
cause unfortunately it looks like integrated autovacuum might slip
another release unless someone else steps up to work on it.

> (Incidentally, I have also found that the indexes on my pg_attributes
> table were taking up over half a gigabyte, which came down to less
> than 40 megs after reindexing them. Is there a case for having
> autovacuum also call reindex?)

Yes there is certainly some merit to having autovacuum or something
similar perform other system maintenance tasks such as reindexing. I
just haven't taken it there yet. It does seem strange that your
pg_attributes table go that big, anyone have any insight here? You did
say you are using 7.4.2, I forget it that has the index reclaiming code
in vacuum, also there are some autovacuum bugs in the early 7.4.x
releases. You might try to upgrade to either 8.0.x or a later 7.4.x
release.

Matthew O'Connor

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 16:04:28
Message-ID:	200506071604.j57G4Sr18799@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil Endecott wrote:
> Matthew T. O'Connor wrote:
> > The integrated version
> > of autovacuum that didn't make the cut before 8.0 avoids this problem
> > since the autovacuum data is stored in the database.
>
> What is the status of this? Is it something that will be included in
> 8.1 or 8.0.n? I might be able to patch the current code but that
> doesn't seem like a useful thing to do if a better solution will arrive
> eventually. I am currently running vacuums from a cron job and I think
> I will be happy with that for the time being.

I will post about integrating pg_autovacuum into the backend for 8.1 in
a few minutes.

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-07 16:42:03
Message-ID:	9559.1118162523@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org> writes:
> (Incidentally, I have also found that the indexes on my pg_attributes
> table were taking up over half a gigabyte, which came down to less than
> 40 megs after reindexing them. Is there a case for having autovacuum
> also call reindex?)

Lots of temp tables I suppose? If so that's not autovacuum's fault;
it wasn't getting told about the activity in pg_attribute until this
patch:

2005-03-31 18:20 tgl

* src/backend/postmaster/: pgstat.c (REL7_4_STABLE), pgstat.c
(REL8_0_STABLE), pgstat.c: Flush any remaining statistics counts
out to the collector at process exit. Without this, operations
triggered during backend exit (such as temp table deletions) won't
be counted ... which given heavy usage of temp tables can lead to
pg_autovacuum falling way behind on the need to vacuum pg_class and
pg_attribute. Per reports from Steve Crawford and others.

Unless the bloat occurred after you updated to 8.0.2, there's no issue.

regards, tom lane

From:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>
To:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-10 03:51:55
Message-ID:	3234B35E-7F14-4734-8E1A-F754DA16D81D@sitening.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Phil,

If you complete this patch, I'm very interested to see it.

I think I'm the person Matthew is talking about who inserted a sleep
value. Because of the sheer number of tables involved, even small
values of sleep caused pg_autovacuum to iterate too slowly over its
table lists to be of use in a production environment (where I still
find its behavior to be preferable to a complicated list of manual
vacuums performed in cron).

--
Thomas F. O'Connell
Co-Founder, Information Architect
Sitening, LLC

Strategic Open Source: Open Your i™

http://www.sitening.com/
110 30th Avenue North, Suite 6
Nashville, TN 37203-6320
615-260-0005

On Jun 7, 2005, at 6:16 AM, Phil Endecott wrote:

> Matthew T. O'Connor wrote:
>
>> Phil Endecott wrote:
>>
>>> > Could it be that there is some code in autovacuum that is O
>>> (n^2) in
>>> > the number of tables?
>>>
>>> Browsing the code using webcvs, I have found this:
>>>
>>> for (j = 0; j < PQntuples(res); j++)
>>> {
>>> tbl_elem = DLGetHead(dbs->table_list);
>>> while (tbl_elem != NULL)
>>> { Have I correctly understood what is going on here?
>>>
>
>
>> Indeed you have. I have head a few similar reports but perhaps
>> none as bad as yours. One person put a small sleep value so that
>> it doesn't spin so tight. You could also just up the sleep delay
>> so that it doesn't do this work quite so often. No other quick
>> suggestions.
>>
>
> I do wonder why autovacuum is keeping its table list in memory
> rather than in the database.
>
> But given that it is keeping it in memory, I think the real fix is
> to sort that list (or keep it ordered when building or updating
> it). It is trivial to also get the query results ordered, and they
> can then be compared in O(n) time.
>
> I notice various other places where there seem to be nested loops,
> e.g. in the update_table_list function. I'm not sure if they can
> be fixed by similar means.
>
> --Phil.

From:	Shelby Cain <alyandon(at)yahoo(dot)com>
To:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-10 14:12:28
Message-ID:	20050610141228.46598.qmail@web50107.mail.yahoo.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

--- "Thomas F. O'Connell" <tfo(at)sitening(dot)com> wrote:

> Phil,
>
> If you complete this patch, I'm very interested to see it.
>
> I think I'm the person Matthew is talking about who inserted a sleep
>
> value. Because of the sheer number of tables involved, even small
> values of sleep caused pg_autovacuum to iterate too slowly over its
> table lists to be of use in a production environment (where I still
> find its behavior to be preferable to a complicated list of manual
> vacuums performed in cron).
>
> --
> Thomas F. O'Connell
> Co-Founder, Information Architect
> Sitening, LLC
>

Were you sleeping every time through the loop? How about something
like:

if (j%500 == 1) usleep(100000)

Regards,

Shelby Cain

__________________________________
Discover Yahoo!
Stay in touch with email, IM, photo sharing and more. Check it out!
http://discover.yahoo.com/stayintouch.html

From:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>
To:	Shelby Cain <alyandon(at)yahoo(dot)com>
Cc:	Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-11 01:23:06
Message-ID:	D7880D9B-0571-4416-B9C0-F07C1927E1CE@sitening.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

I was usleeping in tiny increments in each iteration of the loop. I
didn't try break it into iterative groups like this.

Honestly, I'd prefer to see pg_autovacuum improved to do O(n) rather
than O(n^2) table activity. At this point, though, I'm probably not
too likely to have much time to hack pg_autovacuum before 8.1 is
released, although if it doesn't become integrated by beta feature
freeze, I might give it a shot.

But I hope if anyone completes the linear improvement, they'll post
to the lists.

--
Thomas F. O'Connell
Co-Founder, Information Architect
Sitening, LLC

Strategic Open Source: Open Your i™

http://www.sitening.com/
110 30th Avenue North, Suite 6
Nashville, TN 37203-6320
615-260-0005

On Jun 10, 2005, at 9:12 AM, Shelby Cain wrote:

> --- "Thomas F. O'Connell" <tfo(at)sitening(dot)com> wrote:
>
>
> Were you sleeping every time through the loop? How about something
> like:
>
> if (j%500 == 1) usleep(100000)
>
> Regards,
>
> Shelby Cain

From:	Hugo <htakada(at)gmail(dot)com>
To:	pgsql-general(at)postgresql(dot)org
Subject:	how to return a result set from a stored procedure
Date:	2005-06-11 12:44:32
Message-ID:	f3d9ba99050611054472e0a000@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Hi everybody

I am trying to write a stored procedure that returns a result set but it is
not working
this is the function:
///
CREATE OR REPLACE FUNCTION
remisiones.fn_get_total_remitidoxprovision1("numeric")

RETURNS SETOF record AS
$BODY$
begin
select rm.provision as provision,
drm.producto as producto,
sum(drm.cantidad) as cantidad
FROM remisiones.remisiones rm, remisiones.detalles_remision drm
WHERE rm.remision = drm.remision and rm.provision = $1
GROUP BY rm.provision, drm.producto
ORDER BY rm.provision, drm.producto;
end;$BODY$

///
If I call this function from the interactive sql of pgadminIII I get this
result:
select * from fn_gert_total_remitidosxprovision(1)
---------------------------------------------------------------------------
row refcursor
1 <unnamed porta1>

is there a way to display the value of the rows returned, i need it becouse
I need to use it in a Datawindow definition in an Powerbuilder app.

thanks in advance

Hugo

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>
Cc:	Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, pgsql-general(at)postgresql(dot)org
Subject:	Re: CPU-intensive autovacuuming
Date:	2005-06-11 16:54:12
Message-ID:	11858.1118508852@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Thomas F. O'Connell" <tfo(at)sitening(dot)com> writes:
> Honestly, I'd prefer to see pg_autovacuum improved to do O(n) rather
> than O(n^2) table activity. At this point, though, I'm probably not
> too likely to have much time to hack pg_autovacuum before 8.1 is
> released, although if it doesn't become integrated by beta feature
> freeze, I might give it a shot.

This would be vastly easier to fix if the code were integrated into the
backend first. In the backend environment you could just keep the info
in a dynahash.c hashtable instead of in a linear list. On the client
side, you have to roll your own hashing (or adapt dynahash to life
outside the backend environment).

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>
Cc:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Subject:	Autovacuum in the backend
Date:	2005-06-15 01:23:51
Message-ID:	200506150123.j5F1Npa10608@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend. I
think it has to be done in four stages:

o move it into the backend and have it start/stop automatically
o move the autovacuum configuration parameters into postgresql.conf
o modify the code to use the backend API for error recovery
o modify the code to use the backend API utilities, like hashes

Who would like to get started on this? It seems pretty straight-forward.

---------------------------------------------------------------------------

Tom Lane wrote:
> "Thomas F. O'Connell" <tfo(at)sitening(dot)com> writes:
> > Honestly, I'd prefer to see pg_autovacuum improved to do O(n) rather
> > than O(n^2) table activity. At this point, though, I'm probably not
> > too likely to have much time to hack pg_autovacuum before 8.1 is
> > released, although if it doesn't become integrated by beta feature
> > freeze, I might give it a shot.
>
> This would be vastly easier to fix if the code were integrated into the
> backend first. In the backend environment you could just keep the info
> in a dynahash.c hashtable instead of in a linear list. On the client
> side, you have to roll your own hashing (or adapt dynahash to life
> outside the backend environment).
>
> regards, tom lane
>
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend
>

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, "Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 03:00:49
Message-ID:	27390.1118804449@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend. I
> think it has to be done in four stages:

> o move it into the backend and have it start/stop automatically
> o move the autovacuum configuration parameters into postgresql.conf
> o modify the code to use the backend API for error recovery
> o modify the code to use the backend API utilities, like hashes

> Who would like to get started on this? It seems pretty straight-forward.

A small problem here is that until you get at least to step 3
(backend-standard error handling), none of it is going to be acceptable
to commit. So I don't entirely buy Bruce's notion of bite-size pieces
of work. You can certainly work on it in that fashion, but it's not
going into 8.1 unless most of the above is done by the end of the month.

regards, tom lane

From:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 03:23:20
Message-ID:	d8o73a$2397$1@news.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Bruce Momjian" <pgman(at)candle(dot)pha(dot)pa(dot)us> writes
>
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend. I
> think it has to be done in four stages:
>
> o move it into the backend and have it start/stop automatically

The start/stop routine is quite like Bgwriter. It requires PgStats to be
turned on. If it aborts unexpectedly, hopefully we could restart it. Shall
we have a RequestVacuum() to pass the control to this process so to avoid
possible redundant vacuums from user side?

> o move the autovacuum configuration parameters into postgresql.conf

There are some correlations of GUC parameters in order to incorporate it:
* stats_start_collector = true
* stats_row_level = true

add a parameter to let user pass in the configuration parameters:
* autovacuum_command = "-s 100 -S 1 ..."

So if autovacuum_command is given, we will automatically set the upper two
parameters true?

> o modify the code to use the backend API for error recovery
> o modify the code to use the backend API utilities, like hashes

Change "connect/disconnect server" to "start/stop autovacuum process";
Change "execute query" to "backend APIs";
Change "list" to "hash";
Need think more to handle various error conditions ...

>
> Who would like to get started on this? It seems pretty straight-forward.
>

I'd like to give it a try.

Regards,
Qingqing

From:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 03:26:51
Message-ID:	d8o79t$23t0$1@news.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us> writes
>
> A small problem here is that until you get at least to step 3
> (backend-standard error handling), none of it is going to be acceptable
> to commit. So I don't entirely buy Bruce's notion of bite-size pieces
> of work. You can certainly work on it in that fashion, but it's not
> going into 8.1 unless most of the above is done by the end of the month.
>

Scared ...

Regards,
Qingqing

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 05:26:56
Message-ID:	42AFBC20.1060304@samurai.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Qingqing Zhou wrote:
> The start/stop routine is quite like Bgwriter. It requires PgStats to be
> turned on.

Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
stats collector?

-Neil

From:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 05:41:52
Message-ID:	d8of71$2u5p$1@news.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Neil Conway" <neilc(at)samurai(dot)com> writes
>
> Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> stats collector?
>

I don't understand. Currently the basic logic of pg_autovacuum is to use the
pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
relation need to be vacuumed. How to use FSM to get these information?

Regards,
Qingqing

From:	Hannu Krosing <hannu(at)skype(dot)net>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>, "Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 08:50:42
Message-ID:	1118825442.4844.20.camel@fuji.krosing.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On T, 2005-06-14 at 21:23 -0400, Bruce Momjian wrote:
> One goal for 8.1 is to move /contrib/pg_autovacuum in to the backend. I
> think it has to be done in four stages:
>
> o move it into the backend and have it start/stop automatically
> o move the autovacuum configuration parameters into postgresql.conf
> o modify the code to use the backend API for error recovery
> o modify the code to use the backend API utilities, like hashes
>
> Who would like to get started on this? It seems pretty straight-forward.

Can autovacuum yet be configured _not_ to run vacuum during some hours
or above some load ?

Even better - to stop or pause a long-running vacuum if load goes above
some limit.

If it goes into backend before the above is done, it should at least be
possible to switch it off completely.

--
Hannu Krosing <hannu(at)skype(dot)net>

From:	Hannu Krosing <hannu(at)skype(dot)net>
To:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 08:56:09
Message-ID:	1118825769.4844.25.camel@fuji.krosing.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
> "Neil Conway" <neilc(at)samurai(dot)com> writes
> >
> > Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> > stats collector?
> >
>
> I don't understand. Currently the basic logic of pg_autovacuum is to use the
> pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
> relation need to be vacuumed. How to use FSM to get these information?

One can't probably use FSM as it is, as FSM is filled in by vacuum and
this creates a circular dependency.

But it would be very nice to have something _similar_ to FSM, say DSM
(dead space map), which is filled in when a tuple is marked as "dead for
all running backends", which could be used to implement a vacuum which
vacuums only those pages, which do actually contain removable tuples.

--
Hannu Krosing <hannu(at)skype(dot)net>

From:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 09:04:46
Message-ID:	d8or3d$1bgq$1@news.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu> writes
>
> The start/stop routine is quite like Bgwriter.
>

I just realized that the non-standard backend can't execute any SQL command.
If so, how would the background pg_autovacuum do "vacuum"? It could be
forked more like a standard backend, but this is obviously not a good idea,
since we don't know which database this process will sit on.

A possible solution is that backgroud pg_autovacuum could fork another
process to connect to postmaster as an ordinary backend each time it feels
that a "vacuum" is needed.

Any ideas?

Regards,
Qingqing

From:	Roman Neuhauser <neuhauser(at)sigpipe(dot)cz>
To:	Hugo <htakada(at)gmail(dot)com>
Cc:	pgsql-general(at)postgresql(dot)org
Subject:	Re: how to return a result set from a stored procedure
Date:	2005-06-15 09:43:58
Message-ID:	20050615094358.GD76447@isis.sigpipe.cz
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

# htakada(at)gmail(dot)com / 2005-06-11 08:44:32 -0400:
> Hi everybody
>
> I am trying to write a stored procedure that returns a result set but it is
> not working
> this is the function:
> ///
> CREATE OR REPLACE FUNCTION
> remisiones.fn_get_total_remitidoxprovision1("numeric")
>
> RETURNS SETOF record AS
> $BODY$
> begin
> select rm.provision as provision,
> drm.producto as producto,
> sum(drm.cantidad) as cantidad
> FROM remisiones.remisiones rm, remisiones.detalles_remision drm
> WHERE rm.remision = drm.remision and rm.provision = $1
> GROUP BY rm.provision, drm.producto
> ORDER BY rm.provision, drm.producto;
> end;$BODY$
>
> ///

http://www.postgresql.org/docs/current/static/plpgsql-control-structures.html#AEN32875

--
How many Vietnam vets does it take to screw in a light bulb?
You don't know, man. You don't KNOW.
Cause you weren't THERE. http://bash.org/?255991

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Hannu Krosing <hannu(at)skype(dot)net>
Cc:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 12:49:22
Message-ID:	200506151249.j5FCnMq14570@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Hannu Krosing wrote:
> On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
> > "Neil Conway" <neilc(at)samurai(dot)com> writes
> > >
> > > Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
> > > stats collector?
> > >
> >
> > I don't understand. Currently the basic logic of pg_autovacuum is to use the
> > pg_stat_all_tables numbers like n_tup_upd, n_tup_del to determine if a
> > relation need to be vacuumed. How to use FSM to get these information?
>
> One can't probably use FSM as it is, as FSM is filled in by vacuum and
> this creates a circular dependency.
>
> But it would be very nice to have something _similar_ to FSM, say DSM
> (dead space map), which is filled in when a tuple is marked as "dead for
> all running backends", which could be used to implement a vacuum which
> vacuums only those pages, which do actually contain removable tuples.

Yes, those are step five. The TODO list has:

* Auto-vacuum
o Move into the backend code
o Scan the buffer cache to find free space or use background writer
o Use free-space map information to guide refilling
o Do VACUUM FULL if table is nearly empty?

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 14:04:22
Message-ID:	200506151404.j5FE4Mb09030@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

I am going to start working on it. I am concerned it is a big job.

I will post questions as I find them, and the one below is a good one.

---------------------------------------------------------------------------

Qingqing Zhou wrote:
>
> "Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu> writes
> >
> > The start/stop routine is quite like Bgwriter.
> >
>
> I just realized that the non-standard backend can't execute any SQL command.
> If so, how would the background pg_autovacuum do "vacuum"? It could be
> forked more like a standard backend, but this is obviously not a good idea,
> since we don't know which database this process will sit on.
>
> A possible solution is that backgroud pg_autovacuum could fork another
> process to connect to postmaster as an ordinary backend each time it feels
> that a "vacuum" is needed.
>
> Any ideas?
>
> Regards,
> Qingqing
>
>
>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
> http://www.postgresql.org/docs/faq
>

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 14:05:51
Message-ID:	20050615140550.GA9440@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Wed, Jun 15, 2005 at 11:23:20AM +0800, Qingqing Zhou wrote:

> > Who would like to get started on this? It seems pretty straight-forward.
>
> I'd like to give it a try.

I'm on it. I have Matthew's patch, updated to current sources, and I'm
working on cleaning it up to address all known concerns. I expect to be
able to have something for patches early next week, which can be
discussed.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Postgres is bloatware by design: it was built to house
PhD theses." (Joey Hellerstein, SIGMOD annual conference 2002)

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 14:10:40
Message-ID:	200506151410.j5FEAeE09800@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera wrote:
> On Wed, Jun 15, 2005 at 11:23:20AM +0800, Qingqing Zhou wrote:
>
> > > Who would like to get started on this? It seems pretty straight-forward.
> >
> > I'd like to give it a try.
>
> I'm on it. I have Matthew's patch, updated to current sources, and I'm
> working on cleaning it up to address all known concerns. I expect to be
> able to have something for patches early next week, which can be
> discussed.

Oh, excellent. Thanks. Please look at the patch I just applied to
pg_autovacuum today and merge that into what you have. Great!

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 17:01:31
Message-ID:	200506151001.31369.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Qingqing,

> add a parameter to let user pass in the configuration parameters:
> * autovacuum_command = "-s 100 -S 1 ..."

um, can we have these as separate GUCs and not lumped together as a string?
i.e.:
autovacuum_frequency = 60 #seconds, 0 = disable
autovacuum_vacuum_threshold = 200
autovacuum_vacuum_multiple = 0.5
autovacuum_analyze_threshold = 100
autovacuum_analyze_multiple = 0.4

AV should be disabled by default. It should also automatically use the global
vacuum_delay settings.

> But it would be very nice to have something _similar_ to FSM, say DSM
> (dead space map), which is filled in when a tuple is marked as "dead for
> all running backends", which could be used to implement a vacuum which
> vacuums only those pages, which do actually contain removable tuples.

Speaking of FSM improvements, it would be **really** useful to have a pg_stats
view that let you know how full the FSM was, overall. something like:
pg_stats_fsm_usage
fsm_relations fsm_relations_used fsm_pages fsm_pages_used
1000 312 200000 11579

This would allow for other schemes of vacuum automation.

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Hannu Krosing <hannu(at)skype(dot)net>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 19:45:00
Message-ID:	42B0853C.5080904@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Bruce Momjian wrote:

>Hannu Krosing wrote:
>
>
>>On K, 2005-06-15 at 13:41 +0800, Qingqing Zhou wrote:
>>
>>
>>>"Neil Conway" <neilc(at)samurai(dot)com> writes
>>>
>>>
>>>>Wasn't the plan to rewrite pg_autovacuum to use the FSM rather than the
>>>>stats collector?
>>>>
>>>>
>Yes, those are step five. The TODO list has:
>
> * Auto-vacuum
> o Move into the backend code
> o Scan the buffer cache to find free space or use background writer
> o Use free-space map information to guide refilling
> o Do VACUUM FULL if table is nearly empty?
>

I think the two can be used in conjunction with one another, and perhaps
one day it could be used with or without the stats system. Integrating
FSM information has to be helpful, but it also isn't going to tell us
when to do an analyze due to lots of inserts, also the FSM (if not big
enough) is lossy and might not be tracking all the tables. So I think
for 8.1 if it only used stats that would be OK.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 19:48:24
Message-ID:	42B08608.60909@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Bruce Momjian wrote:

>I am going to start working on it. I am concerned it is a big job.
>
>I will post questions as I find them, and the one below is a good one.
>
>

It is a big question, at least it's the main stumbling block I had, and
it's the one that kept my work from being integrated into 8.0 (which
side stepped the issue by using libpq to connect to the server to fire
off commands).

>---------------------------------------------------------------------------
>
>Qingqing Zhou wrote:
>
>
>>I just realized that the non-standard backend can't execute any SQL command.
>>If so, how would the background pg_autovacuum do "vacuum"? It could be
>>forked more like a standard backend, but this is obviously not a good idea,
>>since we don't know which database this process will sit on.
>>
>>A possible solution is that backgroud pg_autovacuum could fork another
>>process to connect to postmaster as an ordinary backend each time it feels
>>that a "vacuum" is needed.
>>

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 19:55:55
Message-ID:	42B087CB.1010401@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh Berkus wrote:

>Qingqing,
>
>
>>add a parameter to let user pass in the configuration parameters:
>>* autovacuum_command = "-s 100 -S 1 ..."
>>
>>
>
>um, can we have these as separate GUCs and not lumped together as a string?
>i.e.:
>autovacuum_frequency = 60 #seconds, 0 = disable
>autovacuum_vacuum_threshold = 200
>autovacuum_vacuum_multiple = 0.5
>autovacuum_analyze_threshold = 100
>autovacuum_analyze_multiple = 0.4
>
>AV should be disabled by default. It should also automatically use the global
>vacuum_delay settings.
>
>

Agreed, disabled by default (at least for 8.1, perhaps a topic of
conversation for 8.2+), yes it should obey the global vacuum_delay
settings, and yes it should have it's own GUC's as you suggested (all of
this was the case with the patch that I submitted for 8.0, which Alvarro
is now working on).

>>But it would be very nice to have something _similar_ to FSM, say DSM
>>(dead space map), which is filled in when a tuple is marked as "dead for
>>all running backends", which could be used to implement a vacuum which
>>vacuums only those pages, which do actually contain removable tuples.
>>
>>
>
>Speaking of FSM improvements, it would be **really** useful to have a pg_stats
>view that let you know how full the FSM was, overall. something like:
>pg_stats_fsm_usage
>fsm_relations fsm_relations_used fsm_pages fsm_pages_used
>1000 312 200000 11579
>
>This would allow for other schemes of vacuum automation.
>
>

Interesting, perhaps if FSM data is exported to the stats system
autovacuum could use that. What might be best is both a view that
showed overall FSM information, but then also export FSM information on
a per table basis, perhaps as additional columns added to existing stats
tables.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Hannu Krosing <hannu(at)skype(dot)net>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>, "Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 20:02:34
Message-ID:	42B0895A.6030607@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Hannu Krosing wrote:

>Can autovacuum yet be configured _not_ to run vacuum during some hours
>or above some load ?
>
>

That is certainly a goal, hopefully it will get done for 8.1. The
actual design I had in mind (based on prior discussion on hackers) is to
allow a maintenance window that would have lower vacuum thresholds, this
way only the tables that really need it will get vacuumed during the day.

>Even better - to stop or pause a long-running vacuum if load goes above
>some limit.
>
>

I don't think the current implementation if VACUUM can support that. I
believe that all the work will get rolled back if gets canceled.

Perhaps a decent solution would be to have autovacuum increase the
vacuum delay settings dynamically based on system load average. That
way if a vacuum starts and the system starts to get busy, the autoavcuum
daemon can increase the vacuum delay settings and VACUUM would honor
this while running.

>If it goes into backend before the above is done, it should at least be
>possible to switch it off completely.
>
>

Absolutely, in fact it will not only have the option to turn it off, it
will be off by default.

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Hannu Krosing <hannu(at)skype(dot)net>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>, "Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>, Shelby Cain <alyandon(at)yahoo(dot)com>, Phil Endecott <spam_from_postgresql_general(at)chezphil(dot)org>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-15 20:21:30
Message-ID:	200506152021.j5FKLUE05388@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Matthew T. O'Connor wrote:
> I don't think the current implementation if VACUUM can support that. I
> believe that all the work will get rolled back if gets canceled.
>
> Perhaps a decent solution would be to have autovacuum increase the
> vacuum delay settings dynamically based on system load average. That
> way if a vacuum starts and the system starts to get busy, the autoavcuum
> daemon can increase the vacuum delay settings and VACUUM would honor
> this while running.

I would like to have the GUC variables be honored while the system is
running, and that would all administrators to make changes from scripts.

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 01:07:31
Message-ID:	Pine.LNX.4.58.0506161039220.18538@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Wed, 15 Jun 2005, Bruce Momjian wrote:

>
> I am going to start working on it. I am concerned it is a big job.
>
> I will post questions as I find them, and the one below is a good one.
>

I'm wondering if effort is being misdirected here. I remember when Mark
Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
significant performance loss -- I think on the order of 30% to 40% (I will
try and dig up a link to the results).

I think these results can be dramatically improved if the focus is on a
more effective vacuum.

In January I was in Toronto with Jan, Tom and others and some ideas about
vacuum were being discussed. The basic idea is that when we dirty pages we
need we set a bit in a bitmap to say that the page has been dirty. A
convenient place to do this is when we are writing dirty buffers out to
disk. In many situations, this can happen inside the bgwriter meaning that
there should be little contention for this bitmap. Of course, individual
backends may be writing pages out and would have to account for the
dirty pages at that point.

Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
file). You only need 2 pages for the bitmap to represent all the pages in
the segment, which is fairly nice. When vacuum is run, instead of visiting
every page, it would see which pages have been dirtied in the bitmap and
visit only pages. With large tables and small numbers of modified
tuples/pages, the effect this change would have would be pretty
impressive.

This also means that we could effectively implement some of the ideas
which are being floated around, such as having vacuum run only for a short
time period.

One problem is whether or not we have to guarantee that we account for
every dirtied page. I think that would be difficult in the presence of a
crash. One idea Neil mentioned is that on a crash, we could set all pages
in the bitmap to dirty and the first vacuum would effectively be a vacuum
full. The alternative is to say that we don't guarantee that this type of
vacuum is completely comprehensive and that it isn't a replacement for
vacuum full.

Thoughts? Comments?

Thanks,

Gavin

From:	Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 01:26:58
Message-ID:	42B0D562.5070709@familyhealth.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

>> um, can we have these as separate GUCs and not lumped together as a
>> string? i.e.:
>> autovacuum_frequency = 60 #seconds, 0 = disable
>> autovacuum_vacuum_threshold = 200
>> autovacuum_vacuum_multiple = 0.5
>> autovacuum_analyze_threshold = 100
>> autovacuum_analyze_multiple = 0.4
>>
>> AV should be disabled by default. It should also automatically use
>> the global vacuum_delay settings.
>
> Agreed, disabled by default (at least for 8.1, perhaps a topic of
> conversation for 8.2+), yes it should obey the global vacuum_delay
> settings, and yes it should have it's own GUC's as you suggested (all of
> this was the case with the patch that I submitted for 8.0, which Alvarro
> is now working on).

I think it should be on by default :)

Let's not ship software in a default configuration that we KNOW will go
to hell.

How about we aim to make that if someone doesn't modify their
postgresql.conf, they will have no problems. Wasn't that the aim of
defaulting shared_buffers to 1000 if we can?

Chris

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 01:45:08
Message-ID:	200506160145.j5G1j9B07088@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

I totally agree. I think we know pg_autovacuum is just a short-term
solution. What we need is for someone to take the lead in this.

The TODO list has the ideas documented. There is no reason the
background writer could not load the FSM directly with free pages.

---------------------------------------------------------------------------

Gavin Sherry wrote:
> On Wed, 15 Jun 2005, Bruce Momjian wrote:
>
> >
> > I am going to start working on it. I am concerned it is a big job.
> >
> > I will post questions as I find them, and the one below is a good one.
> >
>
> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).
>
> I think these results can be dramatically improved if the focus is on a
> more effective vacuum.
>
> In January I was in Toronto with Jan, Tom and others and some ideas about
> vacuum were being discussed. The basic idea is that when we dirty pages we
> need we set a bit in a bitmap to say that the page has been dirty. A
> convenient place to do this is when we are writing dirty buffers out to
> disk. In many situations, this can happen inside the bgwriter meaning that
> there should be little contention for this bitmap. Of course, individual
> backends may be writing pages out and would have to account for the
> dirty pages at that point.
>
> Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
> file). You only need 2 pages for the bitmap to represent all the pages in
> the segment, which is fairly nice. When vacuum is run, instead of visiting
> every page, it would see which pages have been dirtied in the bitmap and
> visit only pages. With large tables and small numbers of modified
> tuples/pages, the effect this change would have would be pretty
> impressive.
>
> This also means that we could effectively implement some of the ideas
> which are being floated around, such as having vacuum run only for a short
> time period.
>
> One problem is whether or not we have to guarantee that we account for
> every dirtied page. I think that would be difficult in the presence of a
> crash. One idea Neil mentioned is that on a crash, we could set all pages
> in the bitmap to dirty and the first vacuum would effectively be a vacuum
> full. The alternative is to say that we don't guarantee that this type of
> vacuum is completely comprehensive and that it isn't a replacement for
> vacuum full.
>
> Thoughts? Comments?
>
> Thanks,
>
> Gavin
>
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend
>

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 01:50:05
Message-ID:	20050616015005.GA14001@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

A question for interested parties. I'm thinking in handling the
user/password issue by reading the flat files (the copies of pg_shadow,
pg_database, etc).

The only thing that I'd need to modify is add the datdba field to
pg_database, so we can figure out an appropiate user for vacuuming each
database.

What do people think?

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Acepta los honores y aplausos y perderás tu libertad"

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 02:54:58
Message-ID:	20050616025457.GA14236@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> On Wed, 15 Jun 2005, Bruce Momjian wrote:
>
> > I am going to start working on it. I am concerned it is a big job.
> >
> > I will post questions as I find them, and the one below is a good one.
>
> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).

I think those are orthogonal issues. One is fixing whatever performance
issues there are because of VACUUM. Note that the fact that Mark was
having such a drop in performance with autovacuum does only mean that
at the enormous load under which the OSDL tests are run, autovacuum is
not the best solution. Not everybody runs with that sort of load
anyway. (In fact lots of people don't.)

So, one issue is that at high loads, there are improvements to be made
to VACUUM. The other issue is to get VACUUM to run in the first place,
which is what autovacuum addresses.

I can easily predict that we will make adjustments and improvements to
VACUUM in the future, but I'm not so sure if it will happen before 8.1
feature-freezes. I have more confidence that we can integrate
autovacuum for 8.1, which will be a leap forward.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Pido que me den el Nobel por razones humanitarias" (Nicanor Parra)

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:42:17
Message-ID:	42B0F519.30604@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera wrote:

>A question for interested parties. I'm thinking in handling the
>user/password issue by reading the flat files (the copies of pg_shadow,
>pg_database, etc).
>
>The only thing that I'd need to modify is add the datdba field to
>pg_database, so we can figure out an appropiate user for vacuuming each
>database.
>
>What do people think?
>
>

I probably don't understand all the issue involved here but reading
pg_shadow by hand seems problematic. Do you constantly re-read it?
What happens when a new user is added etc....

Can't autovacuum run as a super-user that can vacuum anything?

From:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:45:04
Message-ID:	200506161345.05404.mr-russ@pws.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, 16 Jun 2005 12:54 pm, Alvaro Herrera wrote:
> On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> > On Wed, 15 Jun 2005, Bruce Momjian wrote:
> >
> > > I am going to start working on it. I am concerned it is a big job.
> > >
> > > I will post questions as I find them, and the one below is a good one.
> >
> > I'm wondering if effort is being misdirected here. I remember when Mark
> > Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> > significant performance loss -- I think on the order of 30% to 40% (I will
> > try and dig up a link to the results).
>
> I think those are orthogonal issues. One is fixing whatever performance
> issues there are because of VACUUM. Note that the fact that Mark was
> having such a drop in performance with autovacuum does only mean that
> at the enormous load under which the OSDL tests are run, autovacuum is
> not the best solution. Not everybody runs with that sort of load
> anyway. (In fact lots of people don't.)

I agree here. There have been a couple of patches for improvements to VACUUM rejected in the past.
EG, partial vacuum. If we have autovacuum in the backend, it doesn't matter about people
vacuuming the wrong part of a file. The system will manage it.

I'd also say there are a much greater number of people who will be able to fiddle with
an implemented autovac to improve its performance and load. However I think there
are less people who can complete what Alvaro is doing.

> So, one issue is that at high loads, there are improvements to be made
> to VACUUM. The other issue is to get VACUUM to run in the first place,
> which is what autovacuum addresses.

There are plenty of ideas to shoot around here. Like
- only run one iteration of a vacuum so you only clean indexes once, then stop the vacuum till the next cycle.
- Create the dead space man stuff with the bgwriter
- Make sure you have individual table analyze and vacuum stats so vacuum can be flexible to different tables.

Some of the autovac issues we have seen recently like O(n^2) with tables will go away by being in the backend.
So not everything will perform the same after the integration.

>
> I can easily predict that we will make adjustments and improvements to
> VACUUM in the future, but I'm not so sure if it will happen before 8.1
> feature-freezes. I have more confidence that we can integrate
> autovacuum for 8.1, which will be a leap forward.
>
The big leap will be to get it in the backend, which will as Chris KL suggested stop people shooting themselves in the foot.
I'm not sure what strict rules are imposed by Feature freeze, but there may be time for others to make some improvements before 8.1.
We have also looked at this for at least 2 releases now. If it doesn't get in now, it will just get in for 8.2 and no improvements till 8.2.

Regards

Russell Smith

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:45:45
Message-ID:	200506152045.46130.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin, People,

> I'm wondering if effort is being misdirected here. I remember when Mark
> Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> significant performance loss -- I think on the order of 30% to 40% (I will
> try and dig up a link to the results).

It wasn't quite that bad, and the automated DBT2 is deceptive; the test
doesn't run for long enough for *not* vacuuming to be a problem. For a real
test, you'd need to do a 24-hour, or 48-hour DBT2 run.

Not that I don't agree that we need a less I/O intense alternative to VACUUM,
but it seems unlikely that we could actually do this, or even agree on a
spec, before feature freeze. Wheras integrated AV is something we *could*
do, and is widely desired.

If we do integrated AV, it should only be turned on by default at a relatively
low level. And wasn't there an issue on Windows with AV not working?

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:55:47
Message-ID:	20050616035547.GA14519@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Wed, Jun 15, 2005 at 11:42:17PM -0400, Matthew T. O'Connor wrote:
> Alvaro Herrera wrote:
>
> >A question for interested parties. I'm thinking in handling the
> >user/password issue by reading the flat files (the copies of pg_shadow,
> >pg_database, etc).
> >
> >The only thing that I'd need to modify is add the datdba field to
> >pg_database, so we can figure out an appropiate user for vacuuming each
> >database.
>
> I probably don't understand all the issue involved here but reading
> pg_shadow by hand seems problematic. Do you constantly re-read it?
> What happens when a new user is added etc....

You don't read the pg_shadow table. Rather, you read the pg_user file,
which is a plain-text file representing the information in pg_shadow.
It's kept up to date by backends that modify user information. Likewise
for pg_database and pg_group.

> Can't autovacuum run as a super-user that can vacuum anything?

That'd be another way to do it, maybe simpler.

Currently I'm working on separating this in two parts though, one being
a shlib and other the standard postmaster-launched backend process. So
I don't have to address this issue right now. It just bothered me to
need a separate file with username and password, and the corresponding
code to read it.

One issue I do have to deal with right now is how many autovacuum
processes do we want to be running. The current approach is to have one
autovacuum process. Two possible options would be to have one per
database, and one per tablespace. What do people think?

I'm leaning for the simpler option myself but I'd like to hear more
opinions. Particularly since one-per-database makes the code a lot
simpler as far as I can see, because the shlib only needs to worry about
issuing VACUUM commands; with the other approaches, the shlib has to
manage everything (keep the pg_autovacuum table up to date, figuring out
when vacuums are needed, etc.)

The main problem with the one-per-database is that we wouldn't have a
(simple) way of coordinating vacuums so that they don't compete for I/O.
That's why I thought of the one-per-tablespace approach, though that one
is the most complex of all.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Un poeta es un mundo encerrado en un hombre" (Victor Hugo)

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:56:41
Message-ID:	Pine.LNX.4.58.0506161332490.19372@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Wed, 15 Jun 2005, Alvaro Herrera wrote:

> On Thu, Jun 16, 2005 at 11:07:31AM +1000, Gavin Sherry wrote:
> > On Wed, 15 Jun 2005, Bruce Momjian wrote:
> >
> > > I am going to start working on it. I am concerned it is a big job.
> > >
> > > I will post questions as I find them, and the one below is a good one.
> >
> > I'm wondering if effort is being misdirected here. I remember when Mark
> > Wong at OSDL was running pg_autovacuum during a dbt run, he was seeing
> > significant performance loss -- I think on the order of 30% to 40% (I will
> > try and dig up a link to the results).
>
> I think those are orthogonal issues. One is fixing whatever performance
> issues there are because of VACUUM. Note that the fact that Mark was
> having such a drop in performance with autovacuum does only mean that
> at the enormous load under which the OSDL tests are run, autovacuum is
> not the best solution. Not everybody runs with that sort of load
> anyway. (In fact lots of people don't.)

I agree.

> So, one issue is that at high loads, there are improvements to be made
> to VACUUM. The other issue is to get VACUUM to run in the first place,
> which is what autovacuum addresses.
>
> I can easily predict that we will make adjustments and improvements to
> VACUUM in the future, but I'm not so sure if it will happen before 8.1
> feature-freezes. I have more confidence that we can integrate
> autovacuum for 8.1, which will be a leap forward.

I guess my main concern is that we'll have a solution to the problem of
dead tuples which is only half way there. It is only an incremental
improvement upon the contrib module and solves only one real problem:
users do not read up on VACUUM or autovacuum. This is at the expense of
making it appear to be suitable for the general user base when it isn't,
in my opinion. That isn't the fault of autovacuum but is a function of the
cost of ordinary vacuum.

Thanks,

Gavin

From:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
To:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 03:57:45
Message-ID:	42B0F8B9.30706@commandprompt.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> The big leap will be to get it in the backend, which will as Chris KL suggested stop people shooting themselves in the foot.
> I'm not sure what strict rules are imposed by Feature freeze, but there may be time for others to make some improvements before 8.1.
> We have also looked at this for at least 2 releases now. If it doesn't get in now, it will just get in for 8.2 and no improvements till 8.2.

Just my own two cents. First I am not knocking the work that has been on
autovacuum. I am sure that it was a leap on its own to get it to work.
However I will say that I just don't see the reason for it.

Vacuum especially in the 8.x series isn't that bad. Heck if you actually
manage your catalog even on large databases it can be reasonable. Yes
it takes a little **gasp** administrative maintenance to run vacuum at
just that right time, on just those right tables but...

Anyway -- it seems it may be beneficial to focus the efforts somewhere
else. The only reason I wanted to know if it was going to be in the
backend last week was because I needed to know if I was going to have
to document in the new book.

Sincerely,

Joshua D. Drake

>
> Regards
>
> Russell Smith
>
> ---------------------------(end of broadcast)---------------------------
> TIP 8: explain analyze is your friend

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:04:33
Message-ID:	200506152104.34210.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro,

> One issue I do have to deal with right now is how many autovacuum
> processes do we want to be running. The current approach is to have one
> autovacuum process. Two possible options would be to have one per
> database, and one per tablespace. What do people think?

I'd vote for one, period, for the cluster, if you can manage that. Let's
stick to simple for now. Most users have their database on a single disk or
array, so multiple concurrent vacuums will compete for I/O regardless of
different databases.

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:07:30
Message-ID:	200506152107.31086.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh,

> Just my own two cents. First I am not knocking the work that has been on
> autovacuum. I am sure that it was a leap on its own to get it to work.
> However I will say that I just don't see the reason for it.

I've personally seen at least a dozen user requests for "autovacuum in the
backend", and had this conversation about 1,100 times:

NB: "After a week, my database got really slow."
Me: "How often are you running VACUUM ANALYZE?"
NB: "Running what?"

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:08:20
Message-ID:	42B0FB34.5050809@samurai.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh Berkus wrote:
> Not that I don't agree that we need a less I/O intense alternative to VACUUM,
> but it seems unlikely that we could actually do this, or even agree on a
> spec, before feature freeze.

I don't see the need to rush anything in before the feature freeze.

> Wheras integrated AV is something we *could* do, and is widely desired.

I don't see why. IMHO the current autovacuum approach is far from
optimal. If "integrated autovacuum" just means taking the same approach
and building it into the backend, how does that significantly improve
matters? (I find it difficult to take seriously answers like "it lets us
use the backend's hash table implementation"). It _does_ mean there is
more of an implicit stamp of PGDG approval for pg_autovacuum, which is
something I personally wouldn't want to give to the current design.

-Neil

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:09:47
Message-ID:	42B0FB8B.1000600@samurai.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera wrote:
> One issue I do have to deal with right now is how many autovacuum
> processes do we want to be running. The current approach is to have one
> autovacuum process. Two possible options would be to have one per
> database, and one per tablespace. What do people think?

Why do we need more than one pg_autovacuum process? (Note that this need
not necessarily imply only one concurrent VACUUM, as you can use
non-blocking connections in libpq.)

-Neil

From:	Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:14:10
Message-ID:	42B0FC92.4010100@familyhealth.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> I've personally seen at least a dozen user requests for "autovacuum in the
> backend", and had this conversation about 1,100 times:
>
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Me too. Just hang out in #postgresql for a while :)

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:27:43
Message-ID:	20050616042743.GA14678@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, Jun 16, 2005 at 02:09:47PM +1000, Neil Conway wrote:
> Alvaro Herrera wrote:
> >One issue I do have to deal with right now is how many autovacuum
> >processes do we want to be running. The current approach is to have one
> >autovacuum process. Two possible options would be to have one per
> >database, and one per tablespace. What do people think?
>
> Why do we need more than one pg_autovacuum process?

The only reason I considered it is because you can use the regular
catalog-management routines to handle the new pg_autovacuum system
catalog. With a single process, we need to issue SQL queries. This is
very ugly IMHO.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"Tiene valor aquel que admite que es un cobarde" (Fernandel)

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:44:20
Message-ID:	42B103A4.5000907@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Joshua D. Drake wrote:

> Just my own two cents. First I am not knocking the work that has been
> on autovacuum. I am sure that it was a leap on its own to get it to
> work. However I will say that I just don't see the reason for it.

The major reasons for autovacuum as I see it are as follows:

* Reduces administrative overhead having to keep track of what tables
need to be vacuumed how often.
* Reduces the total amount of time the system spends vacuuming since it
only vacuums when needed.
* Keeps stats up-to-date automatically
* Eliminates newbie confusion
* Eliminates one of the criticisms that the public has against
PostgreSQL (justifed or not)

Also, as VACUUM improves, autovacuum will improve with it.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:46:11
Message-ID:	42B10413.7030901@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin Sherry wrote:

>I guess my main concern is that we'll have a solution to the problem of
>dead tuples which is only half way there. It is only an incremental
>improvement upon the contrib module and solves only one real problem:
>users do not read up on VACUUM or autovacuum. This is at the expense of
>making it appear to be suitable for the general user base when it isn't,
>in my opinion. That isn't the fault of autovacuum but is a function of the
>cost of ordinary vacuum.
>
>

Would you mind expounding on why you think autovacuum isn't suitable for
the general public? I know it's not a silver bullet, but I think in
general, it will be helpful for most people.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:52:23
Message-ID:	42B10587.80706@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Neil Conway wrote:

> Josh Berkus wrote:
> > Wheras integrated AV is something we *could* do, and is widely
> desired.
>
> I don't see why. IMHO the current autovacuum approach is far from
> optimal. If "integrated autovacuum" just means taking the same
> approach and building it into the backend, how does that significantly
> improve matters? (I find it difficult to take seriously answers like
> "it lets us use the backend's hash table implementation"). It _does_
> mean there is more of an implicit stamp of PGDG approval for
> pg_autovacuum, which is something I personally wouldn't want to give
> to the current design.

The reason to integrate it has nothing to do with the hash
implementation, it has to do making autovacuum more accecable to the
masses, and more importantly, it proves a solution (not necerraily the
best solution) to the vacuum problem, which I belive is a problem for
PostgreSQL. Integrating it into the backen also allows autovacuum to be
better than it is now, using the backend logging functions, storing per
table thresholds, solving the O(n2) problem, start up and shutdown
issues and more. I agree that if autovacuum becomes a long term
solution then we should also integrate FSM information etc...

What else is lacking in the current design? Or more specifically what
else would have to be done before you would consider giving it the PGDG
stamp of approval?

Matthew

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Neil Conway <neilc(at)samurai(dot)com>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:56:36
Message-ID:	42B10684.9080401@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera wrote:

>On Thu, Jun 16, 2005 at 02:09:47PM +1000, Neil Conway wrote:
>
>
>>Alvaro Herrera wrote:
>>
>>
>>>One issue I do have to deal with right now is how many autovacuum
>>>processes do we want to be running. The current approach is to have one
>>>autovacuum process. Two possible options would be to have one per
>>>database, and one per tablespace. What do people think?
>>>
>>>
>>Why do we need more than one pg_autovacuum process?
>>
>>
>
>The only reason I considered it is because you can use the regular
>catalog-management routines to handle the new pg_autovacuum system
>catalog. With a single process, we need to issue SQL queries. This is
>very ugly IMHO.
>
>

It was always my intention to have VACUUM and ANALYZE update the new
autovacuum system table, I just never got around to making that happen.

Personally I would vote for simplicty for now, that is only one
autovacuum process and allow it to only issue one VACUUM command at any
given time. Something more complicated sounds to me like a 2nd
generation optimisation.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 04:58:01
Message-ID:	42B106D9.1000405@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Neil Conway wrote:

> Alvaro Herrera wrote:
>
>> One issue I do have to deal with right now is how many autovacuum
>> processes do we want to be running. The current approach is to have one
>> autovacuum process. Two possible options would be to have one per
>> database, and one per tablespace. What do people think?
>
>
> Why do we need more than one pg_autovacuum process? (Note that this
> need not necessarily imply only one concurrent VACUUM, as you can use
> non-blocking connections in libpq.)

Part of the backend integration work Alvaro is doing is teaching
autovacuum to do it's work without libpq.

From:	Rod Taylor <pg(at)rbt(dot)ca>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	pgsql-hackers(at)postgresql(dot)org, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Russell Smith <mr-russ(at)pws(dot)com(dot)au>, "Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 05:14:17
Message-ID:	1118898857.78366.106.camel@home
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, 2005-06-16 at 00:44 -0400, Matthew T. O'Connor wrote:
> Joshua D. Drake wrote:
>
> > Just my own two cents. First I am not knocking the work that has been
> > on autovacuum. I am sure that it was a leap on its own to get it to
> > work. However I will say that I just don't see the reason for it.
>
> * Eliminates newbie confusion

Ignore everything else. This one is the clincher.

Someone doing serious database work is going to read the docs to find
out about backup / restore processes and basic tuning. They'll run
across the disable switch for autovacuum soon enough.

The jack of all trades IT guy who is running some minor work but doesn't
know much about databases in general won't have as many hurdles to
climb.

Besides, vacuum off by default possibly makes for huge files and takes
forever to reclaim space (cluster, vacuum full, etc.). Vacuum on by
default means worst case they turn it off and instantly their IO load
decreases.
--

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 05:32:16
Message-ID:	10971.1118899936@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera <alvherre(at)surnet(dot)cl> writes:
> A question for interested parties. I'm thinking in handling the
> user/password issue by reading the flat files (the copies of pg_shadow,
> pg_database, etc).

Er, what "user/password issue"? Context please.

> The only thing that I'd need to modify is add the datdba field to
> pg_database, so we can figure out an appropiate user for vacuuming each
> database.

The datdba is not necessarily a superuser, and therefore is absolutely
not the right answer for any question related to autovacuum. But in
any case, I would expect that an integrated-into-the-backend autovac
implementation would be operating at a level below any permission checks
--- so this question shouldn't be relevant anyway.

regards, tom lane

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 06:20:34
Message-ID:	Pine.LNX.4.58.0506161548040.20326@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, 16 Jun 2005, Matthew T. O'Connor wrote:

> Gavin Sherry wrote:
>
> >I guess my main concern is that we'll have a solution to the problem of
> >dead tuples which is only half way there. It is only an incremental
> >improvement upon the contrib module and solves only one real problem:
> >users do not read up on VACUUM or autovacuum. This is at the expense of
> >making it appear to be suitable for the general user base when it isn't,
> >in my opinion. That isn't the fault of autovacuum but is a function of the
> >cost of ordinary vacuum.
> >
> >
>
> Would you mind expounding on why you think autovacuum isn't suitable for
> the general public? I know it's not a silver bullet, but I think in
> general, it will be helpful for most people.

As I said, this is largely the fault of VACUUM. The main thing I'd like to
see is a complete solution to the problem. I'm not picking on autovacuum.
However, I will elaborate a little on why I think autovacuum should not
be a feature of the backend:

1) The main argument so far is that autovacuum will ensure that users who
do not read the maintenance section of the manual will not notice a
deterioration of performance. This means that we anticipate autovacuum
being on by default. This suggests that the default autovacuum
configuration will not need tuning. I do not think that will be the case.

2) By no fault of its own, autovacuum's level of granularity is the table
level. For people dealing with non-trivial amounts of data (and we're not
talking gigabytes or terabytes here), this is a serious drawback. Vacuum
at peak times can cause very intense IO bursts -- even with the
enhancements in 8.0. I don't think the solution to the problem is to give
users the impression that it is solved and then vacuum their tables during
peak periods. I cannot stress this enough.

3) autovacuum on by default means row level stats are on by default. This
will have a non-trivial performance impact on users, IMHO. For right or
wrong, our users take the postgresql.conf defaults pretty seriously and
this level of stats collection could and will remain enabled in some
non-trivial percentage of users who turn autovacuum off (consider many
users' reluctance to change shared_buffers in previous releases). To quote
from the README:

"The overhead of the stats system has been shown to be significant under
certain workloads. For instance, a tight loop of queries performing
"select 1" was found to run nearly 30% slower when row-level stats were
enabled."

I'm not one for "select 1" benchmarks but this is a problem that hasn't
even been mentioned, as far as I recall.

4) Related to this, I guess, is that a user's FSM settings might be
completely inappropriate. The 'Just read the manual' or 'Just read the
logs' argument doesn't cut it, because the main argument for autovacuum in
the backend is that people do not and will not.

5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
we're telling users about VACUUM less often than we are now, there's bound
to be bloating issues (see 4).

I guess the main point is, if something major like this ships in the
backend it says to users that the problem has gone away. pg_autovacuum is
a good contrib style solution: it addresses a problem users have and
attempts to solve it the way other users might try and solve it. When you
consider it in the backend, it looks like a workaround. I think users are
better served by solving the real problem.

Gavin

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 09:06:42
Message-ID:	42B14122.1050906@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> 2) By no fault of its own, autovacuum's level of granularity is the table
> level. For people dealing with non-trivial amounts of data (and we're not
> talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> at peak times can cause very intense IO bursts -- even with the
> enhancements in 8.0. I don't think the solution to the problem is to give
> users the impression that it is solved and then vacuum their tables during
> peak periods. I cannot stress this enough.

I completly agree with Gavin - integrating this kind of thing into the
backend writer or integrate it with FSM would be the ideal solution.

I guess everybody who has already vacuumed a 2 TB relation will agree
here. VACUUM is not a problem for small "my cat Minka" databases.
However, it has been a real problem on large, heavy-load databases. I
have even seen people splitting large tables and join them with a view
to avoid long vacuums and long CREATE INDEX operations (i am not joking
- this is serious).

postgresql is more an more used to really large boxes. this is an
increasing problem. gavin's approach using a vacuum bitmap seems to be a
good approach. an alternative would be to have some sort of vacuum queue
containing a set of pages which are reported by the writing process (=
backend writer or backends).

best regards,

hans

--
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 09:34:05
Message-ID:	Pine.LNX.4.58.0506161928370.21494@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, 16 Jun 2005, [ISO-8859-1] Hans-Jrgen Schnig wrote:

> > 2) By no fault of its own, autovacuum's level of granularity is the table
> > level. For people dealing with non-trivial amounts of data (and we're not
> > talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> > at peak times can cause very intense IO bursts -- even with the
> > enhancements in 8.0. I don't think the solution to the problem is to give
> > users the impression that it is solved and then vacuum their tables during
> > peak periods. I cannot stress this enough.
>
>
> I completly agree with Gavin - integrating this kind of thing into the
> backend writer or integrate it with FSM would be the ideal solution.
>
> I guess everybody who has already vacuumed a 2 TB relation will agree
> here. VACUUM is not a problem for small "my cat Minka" databases.
> However, it has been a real problem on large, heavy-load databases. I
> have even seen people splitting large tables and join them with a view
> to avoid long vacuums and long CREATE INDEX operations (i am not joking
> - this is serious).

I think this gets away from my point a little. People with 2 TB tables can
take care of themselves, as can people who've taken the time to partition
their tables to speed up vacuum. I'm more concerned about the majority of
people who fall in the middle -- between the hobbiest and the high end
data centre.

Thanks,

Gavin

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 09:59:49
Message-ID:	42B14D95.8040106@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin Sherry wrote:
> On Thu, 16 Jun 2005, [ISO-8859-1] Hans-J�rgen Sch�nig wrote:
>
>
>>>2) By no fault of its own, autovacuum's level of granularity is the table
>>>level. For people dealing with non-trivial amounts of data (and we're not
>>>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>>>at peak times can cause very intense IO bursts -- even with the
>>>enhancements in 8.0. I don't think the solution to the problem is to give
>>>users the impression that it is solved and then vacuum their tables during
>>>peak periods. I cannot stress this enough.
>>
>>
>>I completly agree with Gavin - integrating this kind of thing into the
>>backend writer or integrate it with FSM would be the ideal solution.
>>
>>I guess everybody who has already vacuumed a 2 TB relation will agree
>>here. VACUUM is not a problem for small "my cat Minka" databases.
>>However, it has been a real problem on large, heavy-load databases. I
>>have even seen people splitting large tables and join them with a view
>>to avoid long vacuums and long CREATE INDEX operations (i am not joking
>>- this is serious).
>
>
> I think this gets away from my point a little. People with 2 TB tables can
> take care of themselves, as can people who've taken the time to partition
> their tables to speed up vacuum. I'm more concerned about the majority of
> people who fall in the middle -- between the hobbiest and the high end
> data centre.
>
> Thanks,
>
> Gavin

I think your approach will help all of them.
If we had some sort of autovacuum (which is packages with most distros
anyway - having it in the core is nice as well) and a mechanism to
improve realloaction / vacuum speed we have solved all problems.

i do think that 2 tb can take care of themselves. the question is,
however, whether the database can do what they want ...

thanks a lot,

hans

--
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

From:	"Andrew Dunstan" <andrew(at)dunslane(dot)net>
To:	<swm(at)linuxworld(dot)com(dot)au>
Cc:	<postgres(at)cybertec(dot)at>, <matthew(at)zeut(dot)net>, <alvherre(at)surnet(dot)cl>, <pgman(at)candle(dot)pha(dot)pa(dot)us>, <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 10:55:40
Message-ID:	4813.24.211.165.134.1118919340.squirrel@www.dunslane.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin Sherry said:
> On Thu, 16 Jun 2005, [ISO-8859-1] Hans-Jürgen Schönig wrote:
>
>> > 2) By no fault of its own, autovacuum's level of granularity is the
>> > table level. For people dealing with non-trivial amounts of data
>> > (and we're not talking gigabytes or terabytes here), this is a
>> > serious drawback. Vacuum at peak times can cause very intense IO
>> > bursts -- even with the enhancements in 8.0. I don't think the
>> > solution to the problem is to give users the impression that it is
>> > solved and then vacuum their tables during peak periods. I cannot
>> > stress this enough.
>>
>>
>> I completly agree with Gavin - integrating this kind of thing into the
>> backend writer or integrate it with FSM would be the ideal solution.
>>
>> I guess everybody who has already vacuumed a 2 TB relation will agree
>> here. VACUUM is not a problem for small "my cat Minka" databases.
>> However, it has been a real problem on large, heavy-load databases. I
>> have even seen people splitting large tables and join them with a view
>> to avoid long vacuums and long CREATE INDEX operations (i am not
>> joking - this is serious).
>
> I think this gets away from my point a little. People with 2 TB tables
> can take care of themselves, as can people who've taken the time to
> partition their tables to speed up vacuum. I'm more concerned about the
> majority of people who fall in the middle -- between the hobbiest and
> the high end data centre.
>

My only problemn with what you say is that we should not incorporate AV into
the backend until these things have been solved. This would be one step down
a long raod, and that's how it should be positioned.

I am very concerned that with Feature Freeze 2 weeks away we seem to be in a
similar position to where we were a year ago. I know we don't even promise
anything, but certainly I and others believed that work was being done to
get AV into the backend in 8.1. Not doing this because we think it could be
lots better would not give people a good impression of our processes. I
certainly don't think it will make matters worse, especially if it's not on
by default.

cheers

andrew

From:	Christopher Browne <cbbrowne(at)acm(dot)org>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 13:06:55
Message-ID:	m3ekb2zc0g.fsf@knuth.cbbrowne.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

swm(at)linuxworld(dot)com(dot)au (Gavin Sherry) wrote:
> I guess the main point is, if something major like this ships in the
> backend it says to users that the problem has gone away. pg_autovacuum is
> a good contrib style solution: it addresses a problem users have and
> attempts to solve it the way other users might try and solve it. When you
> consider it in the backend, it looks like a workaround. I think users are
> better served by solving the real problem.

Hear, hear!

It seems to me that the point in time at which it is *really*
appropriate to put this into the backend is when the new GUC variable
"dead_tuple_map_size" (akin to FSM) is introduced, and there is a new
sort of 'VACUUM DEAD TUPLES' command which goes through the DTPM (Dead
Tuple Page Map).

In THAT case, there would be the ability to do a VACUUM on the "dead
bits" of the table that consists of 50M rows without having to go
through the 49M rows that haven't been touched in months.
--
"cbbrowne","@","gmail.com"
http://linuxfinances.info/info/languages.html
"I can't escape the sensation that I have already been thinking in
Lisp all my programming career, but forcing the ideas into the
constraints of bad languages, which explode those ideas into a
bewildering array of details, most of which are workarounds for the
language." -- Kaz Kylheku

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Andrew Dunstan <andrew(at)dunslane(dot)net>
Cc:	swm(at)linuxworld(dot)com(dot)au, postgres(at)cybertec(dot)at, alvherre(at)surnet(dot)cl, pgman(at)candle(dot)pha(dot)pa(dot)us, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 13:47:24
Message-ID:	42B182EC.10808@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Andrew Dunstan wrote:

>Gavin Sherry said:
>
>
>>I think this gets away from my point a little. People with 2 TB tables
>>can take care of themselves, as can people who've taken the time to
>>partition their tables to speed up vacuum. I'm more concerned about the
>>majority of people who fall in the middle -- between the hobbiest and
>>the high end data centre.
>>
>>
>>
>
>My only problemn with what you say is that we should not incorporate AV into
>the backend until these things have been solved. This would be one step down
>a long raod, and that's how it should be positioned.
>
>

Right, I think if VACUUM is improved than the semantics of AV in the
backend might change, but I think there will always be a need for some
maintenance, and a daemon that monitors the maintenance needs of your
database and fires off appropriate maintenance commands for you is
good. No it doesn't solve all problems, but I think it solves a lot of
problems for a lot of people. Besides VACUUM isn't the only the AV
does, it also does ANALYZE to keep your stats up-to-date and it watches
for XID wraparound. It could also look for REINDEX opportunities and
who knows what else in the future.

>I am very concerned that with Feature Freeze 2 weeks away we seem to be in a
>similar position to where we were a year ago. I know we don't even promise
>anything, but certainly I and others believed that work was being done to
>get AV into the backend in 8.1. Not doing this because we think it could be
>lots better would not give people a good impression of our processes. I
>certainly don't think it will make matters worse, especially if it's not on
>by default.
>
>

I agree. Also, some people in this thread have been making noises about
wanting AV on by default. This might be nice, but I am still leaning
towards off by default at least in 8.1.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 14:01:53
Message-ID:	42B18651.5000700@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin Sherry wrote:

>On Thu, 16 Jun 2005, Matthew T. O'Connor wrote:
>
>
>>Would you mind expounding on why you think autovacuum isn't suitable for
>>the general public? I know it's not a silver bullet, but I think in
>>general, it will be helpful for most people.
>>
>>
>
>As I said, this is largely the fault of VACUUM. The main thing I'd like to
>see is a complete solution to the problem. I'm not picking on autovacuum.
>However, I will elaborate a little on why I think autovacuum should not
>be a feature of the backend:
>
>

Don't worry, I don't think you are picking on AV.

>1) The main argument so far is that autovacuum will ensure that users who
>do not read the maintenance section of the manual will not notice a
>deterioration of performance. This means that we anticipate autovacuum
>being on by default. This suggests that the default autovacuum
>configuration will not need tuning. I do not think that will be the case.
>
>

I disagree with this. I think the newbie protection benefits of AV are
not it's primary goal, though I do think it's an important one. The
main thing AV brings is the ability to control bloating in your database
and keep your stats up-to-date no matter what your work load. It is
possible for an Admin to setup cron scripts to run VACUUM or ANALYZE on
particularly needy tables at appropriate intervals, but I guarantee that
the cron script is going to either fire too many, or too few VACUUMS.
Also when the workload changes, or a new table is added, the Admin then
needs to update his cron scripts. This all goes away with AV and I
believe this is a much bigger goal than the newbie problem.

>2) By no fault of its own, autovacuum's level of granularity is the table
>level. For people dealing with non-trivial amounts of data (and we're not
>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>at peak times can cause very intense IO bursts -- even with the
>enhancements in 8.0. I don't think the solution to the problem is to give
>users the impression that it is solved and then vacuum their tables during
>peak periods. I cannot stress this enough.
>
>

I agree this is a major problem with VACUUM, but I also think it's a
different problem. One advantage of integrated AV is that you will be
able to set per-table thresholds, which include the ability to turn off
AV for any given table. If you are running a database with tables this
big, I think you will be able to figure out how to customize integrated
AV to your needs.

>3) autovacuum on by default means row level stats are on by default. This
>will have a non-trivial performance impact on users, IMHO. For right or
>wrong, our users take the postgresql.conf defaults pretty seriously and
>this level of stats collection could and will remain enabled in some
>non-trivial percentage of users who turn autovacuum off (consider many
>users' reluctance to change shared_buffers in previous releases). To quote
>from the README:
>
>"The overhead of the stats system has been shown to be significant under
>certain workloads. For instance, a tight loop of queries performing
>"select 1" was found to run nearly 30% slower when row-level stats were
>enabled."
>
>I'm not one for "select 1" benchmarks but this is a problem that hasn't
>even been mentioned, as far as I recall.
>
>

I mentioned this in the README because I thought I should, not because I
think it's a real problem in practice. I think a real production
database doing queries that are any more complicated than "select 1"
will probably not notice the difference.

>4) Related to this, I guess, is that a user's FSM settings might be
>completely inappropriate. The 'Just read the manual' or 'Just read the
>logs' argument doesn't cut it, because the main argument for autovacuum in
>the backend is that people do not and will not.
>
>

Agreed, it doesn't solve all problems, and I'm not arguing that the
integration of AV makes PostgreSQL newbie safe it just helps reduce the
newbie problem. Again if the default FSM settings are inappropriate
for a database then the user is probably doing something more
complicated that a "my cat minka" database and will need to learn some
tuning skills anyway.

>5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
>we're telling users about VACUUM less often than we are now, there's bound
>to be bloating issues (see 4).
>
>

Not totally true, regular VACUUM can shrink tables a little (I think
only if there is free space at the end of the table it can cutoff
without moving data around). But if AV is on and the settings are
reasonable, then a table shouldn't bloat much or at all. Also, I don't
think we are telling people to VACUUM less, in fact tables that need it
will usually get VACUUM'd more, we are just telling the users that if
they turn AV on, they don't have to manage all the VACUUMing.

>I guess the main point is, if something major like this ships in the
>backend it says to users that the problem has gone away. pg_autovacuum is
>a good contrib style solution: it addresses a problem users have and
>attempts to solve it the way other users might try and solve it. When you
>consider it in the backend, it looks like a workaround. I think users are
>better served by solving the real problem.
>

Which problem goes away? The problem of users forgetting to VACUUM does
go away, the problem of the VACUUM command being problematic on large
tables doesn't but that is a different question.

My basic position is that with integrated AV, there will always (or at
least for the foreseeable future) be some maintenance that users will
need to do to their databases by hand (or by cron) and that AV does this
better than cron does. When VACUUM is improved, the semantics of AV
might change, but the maintenance work will still need to be done.

Matt

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 14:04:00
Message-ID:	42B186D0.2020606@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Hans-Jürgen Schönig wrote:

> I completly agree with Gavin - integrating this kind of thing into the
> backend writer or integrate it with FSM would be the ideal solution.

Yes AV should look at FSM data, and it will eventually. I'm not sure
how you would integrate AV with the backend writer, but again if
improvements are made to vacuum, AV might have to change along with it,
but I still think it will be needed or at least helpful.

From:	Steve Atkins <steve(at)blighty(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 14:05:01
Message-ID:	20050616140501.GB11384@gp.word-to-the-wise.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Wed, Jun 15, 2005 at 09:07:30PM -0700, Josh Berkus wrote:
> Josh,
>
> > Just my own two cents. First I am not knocking the work that has been on
> > autovacuum. I am sure that it was a leap on its own to get it to work.
> > However I will say that I just don't see the reason for it.
>
> I've personally seen at least a dozen user requests for "autovacuum in the
> backend", and had this conversation about 1,100 times:
>
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Yes, me too.

I always understood autovacuum to be a way to avoid having newbies get
burned by not vacuuming, and for simplifying the maintenance of lower
traffic databases.

I don't see people with high-traffic databases (relative to the hardware
they're running on) ever using autovacuum with the current state of
vacuum and autovacuum.

If improvements to vacuum (unrelated to autovacuum) reduce the IO load
that would be a great thing, especially for those of us dealing with
24x7 databases. (I really like the dirty bitmap suggestion - it sounds
a clean way to reduce the amount of work needed). If autovacuum were
extended to allow more flexible scheduling (or even to be aware of the
other IO going on) then it would be of wider use - but I think the real
value of autovacuum is to make sure that new users (Windows...) don't
have a bad experience when they first try PG.

Cheers,
Steve

From:	Andreas Pflug <pgadmin(at)pse-consulting(dot)de>
To:	Dave Page <dpage(at)vale-housing(dot)co(dot)uk>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Andrew Dunstan <andrew(at)dunslane(dot)net>, swm(at)linuxworld(dot)com(dot)au, postgres(at)cybertec(dot)at, alvherre(at)surnet(dot)cl, pgman(at)candle(dot)pha(dot)pa(dot)us, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 14:13:49
Message-ID:	42B1891D.1020609@pse-consulting.de
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Matthew T. O'Connor wrote:

>
> Right, I think if VACUUM is improved than the semantics of AV in the
> backend might change, but I think there will always be a need for some
> maintenance, and a daemon that monitors the maintenance needs of your
> database and fires off appropriate maintenance commands for you is
> good. No it doesn't solve all problems, but I think it solves a lot of
> problems for a lot of people. Besides VACUUM isn't the only the AV
> does, it also does ANALYZE to keep your stats up-to-date and it watches
> for XID wraparound. It could also look for REINDEX opportunities and
> who knows what else in the future.

Dave,

i wonder if we should aim to have pgAgent in the backend which was one
of the reasons why I considered to have it converted from C++ to pure C.

There are many regular maintenance issues that AV can cover, some more
it could cover and many more we can't even think of right now. Having an
sql executing agent freely at hand (consistent on _every_ platform,
without cron/anacron/at/younameit dependencies) should be helpful for that.

Regards,
Andreas

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 15:08:33
Message-ID:	20050616150833.GB16044@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, Jun 16, 2005 at 01:32:16AM -0400, Tom Lane wrote:
> Alvaro Herrera <alvherre(at)surnet(dot)cl> writes:
> > A question for interested parties. I'm thinking in handling the
> > user/password issue by reading the flat files (the copies of pg_shadow,
> > pg_database, etc).
>
> Er, what "user/password issue"? Context please.
>
> > The only thing that I'd need to modify is add the datdba field to
> > pg_database, so we can figure out an appropiate user for vacuuming each
> > database.
>
> The datdba is not necessarily a superuser, and therefore is absolutely
> not the right answer for any question related to autovacuum. But in
> any case, I would expect that an integrated-into-the-backend autovac
> implementation would be operating at a level below any permission checks
> --- so this question shouldn't be relevant anyway.

Ok, seems things are quite a bit out of context. What I did was take
Matthew's patch for integrating contrib pg_autovacuum into the
postmaster. This patch was posted several times as of July and August
2004. This patch had several issues, like an incorrect shutdown
sequence, forcing libpq to be statically linked into the backend, not
correctly using ereport(), not using the backend's memory management
infrastructure.

There were several suggestions. One was to separate it in two parts,
one which would be a process launched by postmaster, and another which
would be a shared library, loaded by that other process, which would in
turn load libpq and issue SQL queries (including but not limited to
VACUUM and ANALYZE queries) to a regular backend, using a regular
connection.

Now, the user/password issue is which user and password combination is
used to connect to the regular backend. Matthew had created a password
file, to be used in a similar fashion to libpq's password file. This
works but has the drawback that the user has to set the file correctly.
What I'm proposing is using the flatfiles for this.

Now, I'm hearing people don't like using libpq. This means the whole
thing turn a lot more complicated; for one thing, because it will need
to "connect" to every database in some fashion. Also, you want it to
"skip" normal permission checks, which would be doable only if it's not
using libpq. On the other hand, if there were multiple autovacuum
processes, one per database, it'd be all much easier, without using
libpq.

Could we clarify what scenario is people envisioning? I don't want to
waste time fixing code that in the end is going to be declared as
fundamentally flawed -- I'd rather work on shared dependencies.

Some people say "keep it simple and have one process per cluster." I
think they don't realize it's actually more complex, not the other way
around. The only additional complexity is how to handle concurrent
vacuuming, but the code turns out to be simpler because we have access
to system catalogs and standard backend infrastructure in a simple
fashion.

A wholly separate approach is what should the autovacuum daemon be
doing. At present we only have "full vacuum", "vacuum" and "analyze".
In the future this can be extended and autovacuum can launch partial
vacuums, nappy vacuums, bitmapped vacuums, coffee-with-cream vacuums.
But we need to start somewhere.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"¿Qué importan los años? Lo que realmente importa es comprobar que
a fin de cuentas la mejor edad de la vida es estar vivo" (Mafalda)

From:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 15:11:20
Message-ID:	20050616151120.GC16044@surnet.cl
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:

People running systems with petabyte-sized tables can disable autovacuum
for those tables, and leave it running for the rest. Then they can
schedule whatever maintenance they see fit on their gigantic tables.
Trying to run a database with more than a dozen gigabytes of data
without expert advice (or at least reading the manual) would be
extremely stupid anyway.

--
Alvaro Herrera (<alvherre[a]surnet.cl>)
"¿Cómo puedes confiar en algo que pagas y que no ves,
y no confiar en algo que te dan y te lo muestran?" (Germán Poo)

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 15:29:47
Message-ID:	42B19AEB.6040502@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera wrote:
> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
>
>
>>2) By no fault of its own, autovacuum's level of granularity is the table
>>level. For people dealing with non-trivial amounts of data (and we're not
>>talking gigabytes or terabytes here), this is a serious drawback. Vacuum
>>at peak times can cause very intense IO bursts -- even with the
>>enhancements in 8.0. I don't think the solution to the problem is to give
>>users the impression that it is solved and then vacuum their tables during
>>peak periods. I cannot stress this enough.
>
>
> People running systems with petabyte-sized tables can disable autovacuum
> for those tables, and leave it running for the rest. Then they can
> schedule whatever maintenance they see fit on their gigantic tables.
> Trying to run a database with more than a dozen gigabytes of data
> without expert advice (or at least reading the manual) would be
> extremely stupid anyway.
>

professional advice won't help you here because you still have to vacuum
this giant table. this is especially critical in case of 24x7 systems
(which are quite frequent). in many cases there is no maintenance window
anymore (e.g. a wastewater system will be only 24x7).

reducing the impact of vacuum and "create index" would be important to
many people. to me improving vacuum it is as important as Jan's bgwriter
patch (it reduces the troubles people had with checkpoints).

best regards,

hans

--
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 15:42:01
Message-ID:	16313.1118936521@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro Herrera <alvherre(at)surnet(dot)cl> writes:
> Now, I'm hearing people don't like using libpq.

Yeah --- a libpq-based solution is not what I think of as integrated at
all, because it cannot do anything that couldn't be done by the existing
external autovacuum process. About all you can buy there is having the
postmaster spawn the autovacuum process, which is slightly more
convenient to use but doesn't buy any real new functionality.

> Some people say "keep it simple and have one process per cluster." I
> think they don't realize it's actually more complex, not the other way
> around.

Agreed. If you aren't connected to a specific database, then you cannot
use any of the normal backend infrastructure for catalog access, which
is pretty much a killer limitation.

A simple approach would be a persistent autovac background process for
each database, but I don't think that's likely to be acceptable because
of the amount of resources tied up (PGPROC slots, open files, etc).

One thing that might work is to have the postmaster spawn an autovac
process every so often. The first thing the autovac child does is pick
up the current statistics dump file (which it can find without being
connected to any particular database). It looks through that to
determine which database is most in need of work, then connects to that
database and does some "reasonable" amount of work there, and finally
quits. Awhile later the postmaster spawns another autovac process that
can connect to a different database and do work there.

This design would mean that the autovac process could not have any
long-term state of its own: any long-term state would have to be in
either system catalogs or the statistics. But I don't see that as
a bad thing really --- exposing the state will be helpful from a
debugging and administrative standpoint.

regards, tom lane

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 16:13:54
Message-ID:	200506160913.54650.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Alvaro,

> coffee-with-cream vacuums.

I tried this and now my Hoover makes this horrible noise and smokes. ;-)

All:

Seriously, all: when I said that "users" were asking for Autovac in the
backend (AVitB), I wasn't talking just the newbies on #postgresql. I'm also
talking companies like Hyperic, and whole groups like the postgresql.org.br.
This is a feature that people want, and unless there's something
fundamentally unstable about it, it seems really stupid to hold it back
because we're planning VACUUM improvements for 8.2.

AVitB has been on the TODO list for 2 versions. There's been 2 years to
question its position there. Now people are bringing up objections when
there's no time for discussion left? This stinks.

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 16:24:53
Message-ID:	200506160924.53895.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

People,

> AVitB has been on the TODO list for 2 versions. There's been 2 years to
> question its position there. Now people are bringing up objections when
> there's no time for discussion left? This stinks.

Hmmm ... to be specific, I'm referring to the objections to the *idea* of
AVitB, not the problems with the current patch.

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	Douglas McNaught <doug(at)mcnaught(dot)org>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 16:29:43
Message-ID:	m2u0jy6z9k.fsf@Douglas-McNaughts-Powerbook.local
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh Berkus <josh(at)agliodbs(dot)com> writes:

> Seriously, all: when I said that "users" were asking for Autovac in the
> backend (AVitB), I wasn't talking just the newbies on #postgresql. I'm also
> talking companies like Hyperic, and whole groups like the postgresql.org.br.
> This is a feature that people want, and unless there's something
> fundamentally unstable about it, it seems really stupid to hold it back
> because we're planning VACUUM improvements for 8.2.

Agreed, and I don't see AVitB as standing in the way of any of those
proposed improvements--it's just that AVitB has a chance of making it
into 8.1, and none of the proposed improvements do. I don't see why
people are objecting.

Also, count me in the "turn it on by default" crowd--I'd rather not
have newbies see unending file bloat from normal usage, it just looks
bad. Anyone who plans to deploy for large databases and high loads
needs to learn to tune (just as with any other database) and can make
an informed decision about whether AV should be on or not.

-Doug

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 16:54:39
Message-ID:	42B1AECF.9080005@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Tom Lane wrote:

>Alvaro Herrera <alvherre(at)surnet(dot)cl> writes:
>
>
>>Now, I'm hearing people don't like using libpq.
>>
>>
>
>Yeah --- a libpq-based solution is not what I think of as integrated at
>all, because it cannot do anything that couldn't be done by the existing
>external autovacuum process. About all you can buy there is having the
>postmaster spawn the autovacuum process, which is slightly more
>convenient to use but doesn't buy any real new functionality.
>
>

Yes libpq has to go, I thought this was clear, but perhaps I didn't say
it clearly enough. Anyway, this was the stumbling block which prevented
me from making more progress on autovacuum integration.

>>Some people say "keep it simple and have one process per cluster." I
>>think they don't realize it's actually more complex, not the other way
>>around.
>>
>>
>
>A simple approach would be a persistent autovac background process for
>each database, but I don't think that's likely to be acceptable because
>of the amount of resources tied up (PGPROC slots, open files, etc).
>
>

Agreed, this seems ugly.

>One thing that might work is to have the postmaster spawn an autovac
>process every so often. The first thing the autovac child does is pick
>up the current statistics dump file (which it can find without being
>connected to any particular database). It looks through that to
>determine which database is most in need of work, then connects to that
>database and does some "reasonable" amount of work there, and finally
>quits. Awhile later the postmaster spawns another autovac process that
>can connect to a different database and do work there.
>
>

I don't think you can use a dump to determine who should be connected to
next since you don't really know what happened since the last time you
exited. What was a priority 5 or 10 minutes ago might not be a priority
now.

>This design would mean that the autovac process could not have any
>long-term state of its own: any long-term state would have to be in
>either system catalogs or the statistics. But I don't see that as
>a bad thing really --- exposing the state will be helpful from a
>debugging and administrative standpoint.
>

This is not a problem as my patch, that Alvaro has now taken over,
already created a new system catalog for all autovac data, so autovac
really doesn't contain any static persistent data.

The rough design I had in mind was:
1) On startup postmaster spawns the master autovacuum process
2) The master autovacuum process spawns backends to do the vacuuming
work on a particular database
3) The master autovacuum waits for this process to exit, then spaws the
next backend for the next database
4) Repeat this loop until all databases in the cluster have been
checked, then sleep for a while, and start over again.

I'm not sure if this is feasible, or if this special master autovacuum
process would be able to fork off or request that the postmaster fork
off an autovacuum process for a particular database in the cluster.
Thoughts or comments?

Matthew

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 17:12:36
Message-ID:	17465.1118941956@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Matthew T. O'Connor" <matthew(at)zeut(dot)net> writes:
> I don't think you can use a dump to determine who should be connected to
> next since you don't really know what happened since the last time you
> exited. What was a priority 5 or 10 minutes ago might not be a priority
> now.

Well, the information necessary to make that decision has to be
available from the statistics file. This doesn't seem like an
insuperable problem.

> The rough design I had in mind was:
> 1) On startup postmaster spawns the master autovacuum process
> 2) The master autovacuum process spawns backends to do the vacuuming
> work on a particular database
> 3) The master autovacuum waits for this process to exit, then spaws the
> next backend for the next database
> 4) Repeat this loop until all databases in the cluster have been
> checked, then sleep for a while, and start over again.

This is unworkable, I believe, because backends have to be direct
children of the postmaster. I don't recall the details at the moment
but there are IPC signaling reasons for it.

> I'm not sure if this is feasible, or if this special master autovacuum
> process would be able to fork off or request that the postmaster fork
> off an autovacuum process for a particular database in the cluster.
> Thoughts or comments?

It's possible that we could add some signaling whereby the autovac
master could request the postmaster to fork a child into a particular
database. I'm not sure why this is a lot better than keeping the
stats out where everyone can see them...

regards, tom lane

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 17:41:38
Message-ID:	42B1B9D2.20407@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Tom Lane wrote:

>"Matthew T. O'Connor" <matthew(at)zeut(dot)net> writes:
>
>
>>I don't think you can use a dump to determine who should be connected to
>>next since you don't really know what happened since the last time you
>>exited. What was a priority 5 or 10 minutes ago might not be a priority
>>now.
>>
>>
>
>Well, the information necessary to make that decision has to be
>available from the statistics file. This doesn't seem like an
>insuperable problem.
>
>

Interesting, so the postmaster would kick off an autovacuum process,
which would read in data from the stats system by hand ( it can do this
because the stat system writes it's data to flat files?). I don't know
how complicated this might be but perhaps a simpler method is to just
have each autovacuum process write a file for itself noting what
database it should connect to next. This would work find assuming we
want to continue to loop through all the databases in much the same
fashion as pg_autovacuum currently does.

>>The rough design I had in mind was:
>>1) On startup postmaster spawns the master autovacuum process
>>2) The master autovacuum process spawns backends to do the vacuuming
>>work on a particular database
>>3) The master autovacuum waits for this process to exit, then spaws the
>>next backend for the next database
>>4) Repeat this loop until all databases in the cluster have been
>>checked, then sleep for a while, and start over again.
>>
>>
>It's possible that we could add some signaling whereby the autovac
>master could request the postmaster to fork a child into a particular
>database. I'm not sure why this is a lot better than keeping the
>stats out where everyone can see them...
>

Ok.

From:	Hannu Krosing <hannu(at)skype(dot)net>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Josh Berkus <josh(at)agliodbs(dot)com>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 18:42:13
Message-ID:	42B1C805.507@skype.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On N, 2005-06-16 at 11:42 -0400, Tom Lane wrote:
> Alvaro Herrera <alvherre(at)surnet(dot)cl> writes:
...
> > Some people say "keep it simple and have one process per cluster." I
> > think they don't realize it's actually more complex, not the other way
> > around.
>
> Agreed. If you aren't connected to a specific database, then you cannot
> use any of the normal backend infrastructure for catalog access, which
> is pretty much a killer limitation.
>
> A simple approach would be a persistent autovac background process for
> each database, but I don't think that's likely to be acceptable because
> of the amount of resources tied up (PGPROC slots, open files, etc).

In this case it should also be configurable, which databases will get
their own AV processes.

Also, there is probably no need to keep an AV process running very long
after last "real" backend for that database has closed, as there won't
be any changes anyway.

Having one AV process per DB will likely be a problem for only
installations, where there is very many single-user user-always-
connected databases, which I don't expect to be that many.

And I also expect that soon (after my vacuums-dont-step-on-each-other
patch goes in), there will be need for running several vacuums in
parallel on the same database (say one with non-intrusive vacuum_page
settings for a really large table and several more agressive ones for
fast-changing small tables at the same time), AFAIKS this will also need
several backends - at least one for each parallel vacuum.

> One thing that might work is to have the postmaster spawn an autovac
> process every so often.

my fastest manual vacuum does its job in 5 sec and is repeated at 10 sec
inervals - will this design be able to match this ?

> The first thing the autovac child does is pick
> up the current statistics dump file (which it can find without being
> connected to any particular database). It looks through that to
> determine which database is most in need of work, then connects to that
> database and does some "reasonable" amount of work there, and finally
> quits. Awhile later the postmaster spawns another autovac process that
> can connect to a different database and do work there.
>
> This design would mean that the autovac process could not have any
> long-term state of its own: any long-term state would have to be in
> either system catalogs or the statistics. But I don't see that as
> a bad thing really --- exposing the state will be helpful from a
> debugging and administrative standpoint.

--
Hannu Krosing <hannu(at)skype(dot)net>

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-16 22:22:16
Message-ID:	Pine.LNX.4.58.0506170817200.25511@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Thu, 16 Jun 2005, Alvaro Herrera wrote:

> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
>
> > 2) By no fault of its own, autovacuum's level of granularity is the table
> > level. For people dealing with non-trivial amounts of data (and we're not
> > talking gigabytes or terabytes here), this is a serious drawback. Vacuum
> > at peak times can cause very intense IO bursts -- even with the
> > enhancements in 8.0. I don't think the solution to the problem is to give
> > users the impression that it is solved and then vacuum their tables during
> > peak periods. I cannot stress this enough.
>
> People running systems with petabyte-sized tables can disable autovacuum
> for those tables, and leave it running for the rest. Then they can
> schedule whatever maintenance they see fit on their gigantic tables.
> Trying to run a database with more than a dozen gigabytes of data
> without expert advice (or at least reading the manual) would be
> extremely stupid anyway.

As I've said a few times, I'm not concerned about such users. I'm
concerned about users with some busy tables of a few hundred megabytes. I
still don't think VACUUM at arbitary times on such tables is suitable.

Thanks,

Gavin

From:	Tim Allen <tim(at)proximity(dot)com(dot)au>
To:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 01:04:33
Message-ID:	42B221A1.8000200@proximity.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh Berkus wrote:
> Alvaro,
>
>
>>coffee-with-cream vacuums.
>
> I tried this and now my Hoover makes this horrible noise and smokes. ;-)

Probably related to the quality of American coffee ;).

> All:
>
> Seriously, all: when I said that "users" were asking for Autovac in the
> backend (AVitB), I wasn't talking just the newbies on #postgresql. I'm also
> talking companies like Hyperic, and whole groups like the postgresql.org.br.
> This is a feature that people want, and unless there's something
> fundamentally unstable about it, it seems really stupid to hold it back
> because we're planning VACUUM improvements for 8.2.
>
> AVitB has been on the TODO list for 2 versions. There's been 2 years to
> question its position there. Now people are bringing up objections when
> there's no time for discussion left? This stinks.

Complete agreement from me. Incremental improvements are good - pointing
out that there are some other incremental improvements that would also
be good to make is not an argument for delaying the first set of
incremental improvements.

In our case, we want to be able to install postgres at dozens (ideally
hundreds... no, thousands :) ) of customer sites, where the customers in
general are not going to have anyone onsite who has a clue about
postgres. The existing contrib autovacuum gives a good solution to
setting things up to maintain the database in a reasonable state of
health without need for further intervention from us. It's not perfect,
of course, but if it means the difference between having to unleash our
support team on a customer once a month and once a year, that's a good
deal for us. Having it integrated into the backend will make it much
easier for us, we (hopefully...) won't have to fiddle with extra startup
scripts, and we'll have one fewer point of failure (eg some customer
might accidentally turn off the separate pg_autovacuum daemon). Being
able to customise the autovacuum parameters on a per-table basis is also
attractive.

Just my AUD0.02. I realise that keeping _our_ customers happy is not
necessarily anyone else's priority. I'd like to be able to invest some
coding time, but can't. I haven't even gotten around to completing
Gavin's survey form (sorry Gav, I'll get to it soon, I hope! :)), so I
can't demand to be listened to.

But for what it's worth, Alvaro, please keep going, don't be dissuaded.

Tim

--
-----------------------------------------------
Tim Allen tim(at)proximity(dot)com(dot)au
Proximity Pty Ltd http://www.proximity.com.au/

From:	"Qingqing Zhou" <zhouqq(at)cs(dot)toronto(dot)edu>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 02:12:06
Message-ID:	d8tblj$15mi$1@news.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us> writes:
>
> Yeah --- a libpq-based solution is not what I think of as integrated at
> all, because it cannot do anything that couldn't be done by the existing
> external autovacuum process. About all you can buy there is having the
> postmaster spawn the autovacuum process, which is slightly more
> convenient to use but doesn't buy any real new functionality.
>

One reason of not using lib-pq is that this one has to wait for the
completion of each vacuum (we don't has async execution in libpq right?),
but by signaling does not.

But by signaling, we have to detect that if the forked backend successfully
done its job. I am not sure how to easily incorporate this into current
signaling framework.

Regards,
Qingqing

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 02:15:37
Message-ID:	200506170215.j5H2Fb104554@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin Sherry wrote:
> In January I was in Toronto with Jan, Tom and others and some ideas about
> vacuum were being discussed. The basic idea is that when we dirty pages we
> need we set a bit in a bitmap to say that the page has been dirty. A
> convenient place to do this is when we are writing dirty buffers out to
> disk. In many situations, this can happen inside the bgwriter meaning that
> there should be little contention for this bitmap. Of course, individual
> backends may be writing pages out and would have to account for the
> dirty pages at that point.
>
> Now this bitmap can occur on a per heap segment basis (ie, per 1 GB heap
> file). You only need 2 pages for the bitmap to represent all the pages in
> the segment, which is fairly nice. When vacuum is run, instead of visiting
> every page, it would see which pages have been dirtied in the bitmap and
> visit only pages. With large tables and small numbers of modified
> tuples/pages, the effect this change would have would be pretty
> impressive.

Added to TODO:

* Create a bitmap of pages that need vacuuming

Instead of sequentially scanning the entire table, have the background
writer or some other process record pages that have expired rows, then
VACUUM can look at just those pages rather than the entire table. In
the event of a system crash, the bitmap would probably be invalidated.

From:	"Thomas F(dot) O'Connell" <tfo(at)sitening(dot)com>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	Alvaro Herrera <alvherre(at)surnet(dot)cl>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 04:55:34
Message-ID:	CD96413A-E1FF-465A-B532-0C935DCE8D47@sitening.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Gavin,

For the record, I don't consider myself a PostgreSQL newbie, nor do I
manage any 2 TB databases (much less tables), but I do have an
unusual production use case: thousands (> 10,000) of tables, many of
them inherited, and many of them with hundreds of thousands (a few
with millions) of rows.

Honestly, creating crontab vacuum management for this scenario would
be a nightmare, and pg_autovacuum has been a godsend. Considering the
recent revelations of O(n^2) iterations over table lists in the
current versions and the stated and apparent ease with which this
problem could be solved by integrating the basic functionality of
pg_autovacuum into the backend, I can personally attest to there
being real-world use cases that would benefit tremendously from
integrated autovacuum.

A few months ago, I attempted to solve the wrong problem by
converting a hardcoded threshold into another command-line option. If
I had spotted the O(n^2) problem, I might've spent the time working
on it then instead of the new command-line option. I suppose it's
possible that I'll head down this road anyway if it looks like
integrated pg_autovacuum is going to be put on hold indefinitely
after this discussion.

Anyway, just wanted to throw out some food for thought for the
practicality of a tool like pg_autovacuum.

--
Thomas F. O'Connell
Co-Founder, Information Architect
Sitening, LLC

Strategic Open Source: Open Your i™

http://www.sitening.com/
110 30th Avenue North, Suite 6
Nashville, TN 37203-6320
615-260-0005

On Jun 16, 2005, at 5:22 PM, Gavin Sherry wrote:

> On Thu, 16 Jun 2005, Alvaro Herrera wrote:
>
>
>> On Thu, Jun 16, 2005 at 04:20:34PM +1000, Gavin Sherry wrote:
>>
>>
>>> 2) By no fault of its own, autovacuum's level of granularity is
>>> the table
>>> level. For people dealing with non-trivial amounts of data (and
>>> we're not
>>> talking gigabytes or terabytes here), this is a serious drawback.
>>> Vacuum
>>> at peak times can cause very intense IO bursts -- even with the
>>> enhancements in 8.0. I don't think the solution to the problem is
>>> to give
>>> users the impression that it is solved and then vacuum their
>>> tables during
>>> peak periods. I cannot stress this enough.
>>>
>>
>> People running systems with petabyte-sized tables can disable
>> autovacuum
>> for those tables, and leave it running for the rest. Then they can
>> schedule whatever maintenance they see fit on their gigantic tables.
>> Trying to run a database with more than a dozen gigabytes of data
>> without expert advice (or at least reading the manual) would be
>> extremely stupid anyway.
>>
>
> As I've said a few times, I'm not concerned about such users. I'm
> concerned about users with some busy tables of a few hundred
> megabytes. I
> still don't think VACUUM at arbitary times on such tables is suitable.
>
> Thanks,
>
> Gavin

From:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 07:00:59
Message-ID:	42B2752B.6090402@commandprompt.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh Berkus wrote:
> Josh,
>
>
>>Just my own two cents. First I am not knocking the work that has been on
>>autovacuum. I am sure that it was a leap on its own to get it to work.
>>However I will say that I just don't see the reason for it.
>
>
> I've personally seen at least a dozen user requests for "autovacuum in the
> backend", and had this conversation about 1,100 times:
>
> NB: "After a week, my database got really slow."
> Me: "How often are you running VACUUM ANALYZE?"
> NB: "Running what?"

Can't argue that except... RTFM ;). I am not saying it doesn't have a
validity. I am just saying that if you actually pay attention to
PostgreSQL and maintain it, you don't need it ;)

Sincerely,

Joshua D. Drake

From:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 07:03:26
Message-ID:	42B275BE.1000701@commandprompt.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Matthew T. O'Connor wrote:
> Joshua D. Drake wrote:
>
>> Just my own two cents. First I am not knocking the work that has been
>> on autovacuum. I am sure that it was a leap on its own to get it to
>> work. However I will say that I just don't see the reason for it.
>
>
>
> The major reasons for autovacuum as I see it are as follows:
>
> * Reduces administrative overhead having to keep track of what tables
> need to be vacuumed how often.

Creates more overhead and thus reduces performance.

> * Reduces the total amount of time the system spends vacuuming since it
> only vacuums when needed.

Can be easily done with cron.

> * Keeps stats up-to-date automatically

Which can be done with cron

> * Eliminates newbie confusion

RTFM

> * Eliminates one of the criticisms that the public has against
> PostgreSQL (justifed or not)

Agreed.

Just so everyone knows from the get go here. I am purposely playing a
little devils advocate. Autovacuum has some drawbacks. I think we should
be **publicly** aware of them before we pursue integration.

Heaven knows it would make my life easier if it was integrated but anyway...

Sincerely,

Joshua D. Drake

>
> Also, as VACUUM improves, autovacuum will improve with it.
>

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Russell Smith <mr-russ(at)pws(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 07:16:16
Message-ID:	42B278C0.7050703@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

>> * Reduces the total amount of time the system spends vacuuming since
>> it only vacuums when needed.
>
>
> Can be easily done with cron.
>
>> * Keeps stats up-to-date automatically
>
>
> Which can be done with cron
>
>> * Eliminates newbie confusion
>
>
> RTFM
>
>> * Eliminates one of the criticisms that the public has against
>> PostgreSQL (justifed or not)
>
>
> Agreed.

I few weeks ago I have set up a database with more than 1.800 tables
(some complex business thing). inventing a clever cron-vacuum strategy
is almost impossible (or at least very painful). there should be a
mechanism (fortunately there is pg_autovacuum) to make this a bit more
practical.

in case of small databases this is not an issue.
small is always simple. complex and large are the major challenges.

best regards,

hans

--
Cybertec Geschwinde u Schoenig
Schoengrabern 134, A-2020 Hollabrunn, Austria
Tel: +43/664/393 39 74
www.cybertec.at, www.postgresql.at

From:	Andreas Pflug <pgadmin(at)pse-consulting(dot)de>
To:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 08:26:46
Message-ID:	42B28946.8080002@pse-consulting.de
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Qingqing Zhou wrote:
> "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us> writes:
>
>>Yeah --- a libpq-based solution is not what I think of as integrated at
>>all, because it cannot do anything that couldn't be done by the existing
>>external autovacuum process. About all you can buy there is having the
>>postmaster spawn the autovacuum process, which is slightly more
>>convenient to use but doesn't buy any real new functionality.
>>
>
>
> One reason of not using lib-pq is that this one has to wait for the
> completion of each vacuum (we don't has async execution in libpq right?),

There *is* async execution in libpq, and it works.

Regards,
Andreas

From:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 08:56:17
Message-ID:	200506171856.17584.mr-russ@pws.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> > The major reasons for autovacuum as I see it are as follows:
> >
> > * Reduces administrative overhead having to keep track of what tables
> > need to be vacuumed how often.
>
> Creates more overhead and thus reduces performance.
Or reduces vacuum overhead because the vacuum strategy is much better than
it was when you used cron. Especially as people get a chance to improve autovac.

> > * Reduces the total amount of time the system spends vacuuming since it
> > only vacuums when needed.
>
> Can be easily done with cron.
Can you do partial table vacuums with CRON?
You can work out the smartest time to vacuum with cron? I thought it just scheduled tasks at certain times.

>
> > * Keeps stats up-to-date automatically
>
> Which can be done with cron
An what is the management strategy for adjusting analyze when things change that you weren't aware of? (eg, big table changes that were unexpected)

>
> > * Eliminates newbie confusion
>
> RTFM

RTFM = MySQL in a lot of cases to be honest.

>
> > * Eliminates one of the criticisms that the public has against
> > PostgreSQL (justifed or not)
>
> Agreed.
This is really the same as the previous RTFM question/response. People criticise because vacuum is foreign to them,
and for newbie's that equals too hard, next db please. As much as it is a technical issue, it's an advocacy issue too.

Plus we finally get XID wraparound protection. We finally decided that for 8.1 we needed some protection, which I think
Tom committed. This again may be a newbie thing. But there are a lot of newbies out there then. We've see on the lists
and on IRC this problem pop up a number of times. And people say "Why didn't it tell me", RTFM it's exactly what they want
to hear, or the fact they thought they read the manual, and missed understanding that bit.

>
>
> Just so everyone knows from the get go here. I am purposely playing a
> little devils advocate. Autovacuum has some drawbacks. I think we should
> be **publicly** aware of them before we pursue integration.

It does have a number of issues. But I feel the integration issue is being addressed with a very short term view.
Once it's integrated there are a lot of patches, tweaks and changes that just can't be made until it is integrated.
The usefulness of some of the vacuum ideas that have been presented in the past will be able to become a reality.
The dead space map is a perfect example. People have talked about it for most of the time I've been around.
But until we have an integrated vacuum none of that can really happen.
>
> Heaven knows it would make my life easier if it was integrated but anyway...
>
I understand these are not nessecarily Josh's view, but I thought I would offer comments on them.

> Sincerely,
>
> Joshua D. Drake
>
Regards

Russell Smith
>
>
>
> >
> > Also, as VACUUM improves, autovacuum will improve with it.
> >
Or because of autovacuum, vacuum and autovacuum will improve.

From:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 09:01:15
Message-ID:	200506171901.16148.mr-russ@pws.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> Added to TODO:
>
> * Create a bitmap of pages that need vacuuming
>
> Instead of sequentially scanning the entire table, have the background
> writer or some other process record pages that have expired rows, then
> VACUUM can look at just those pages rather than the entire table. In
> the event of a system crash, the bitmap would probably be invalidated.
>
Further to this, is there any use case for allowing FSM, or this DSM to spill to disk
if the space fills up. It would allow the possibility of unusual changes to the db
to not loose space. You could just load part of the overflow from the disk back
int the FSM in memory and continue using free space.

Regards

Russell Smith

From:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 09:05:01
Message-ID:	200506171905.01784.mr-russ@pws.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

> >4) Related to this, I guess, is that a user's FSM settings might be
> >completely inappropriate. The 'Just read the manual' or 'Just read the
> >logs' argument doesn't cut it, because the main argument for autovacuum in
> >the backend is that people do not and will not.
> >
> >
>
> Agreed, it doesn't solve all problems, and I'm not arguing that the
> integration of AV makes PostgreSQL newbie safe it just helps reduce the
> newbie problem. Again if the default FSM settings are inappropriate
> for a database then the user is probably doing something more
> complicated that a "my cat minka" database and will need to learn some
> tuning skills anyway.
>
> >5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
> >we're telling users about VACUUM less often than we are now, there's bound
> >to be bloating issues (see 4).
> >
> >
>
But what's stopping the implementation of a Partial VACUUM FULL, where we lock the table,
move enough blocks to shorten the relation so that there is say < 10% bloat, or whatever is
appropriate for that table. Or even just short the table a few block, and repeat the process
when you have some time too.

> Not totally true, regular VACUUM can shrink tables a little (I think
> only if there is free space at the end of the table it can cutoff
> without moving data around). But if AV is on and the settings are
> reasonable, then a table shouldn't bloat much or at all. Also, I don't
> think we are telling people to VACUUM less, in fact tables that need it
> will usually get VACUUM'd more, we are just telling the users that if
> they turn AV on, they don't have to manage all the VACUUMing.

Regards

Russell Smith

From:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
To:	Andreas Pflug <pgadmin(at)pse-consulting(dot)de>
Cc:	Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 09:08:07
Message-ID:	200506171908.07610.mr-russ@pws.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Fri, 17 Jun 2005 06:26 pm, Andreas Pflug wrote:
> Qingqing Zhou wrote:
> > "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us> writes:
> >
> >>Yeah --- a libpq-based solution is not what I think of as integrated at
> >>all, because it cannot do anything that couldn't be done by the existing
> >>external autovacuum process. About all you can buy there is having the
> >>postmaster spawn the autovacuum process, which is slightly more
> >>convenient to use but doesn't buy any real new functionality.
> >>
> >
> >
> > One reason of not using lib-pq is that this one has to wait for the
> > completion of each vacuum (we don't has async execution in libpq right?),
>
> There *is* async execution in libpq, and it works.

I would have thought the main reasons for not using libpq means you are locked
into only using commands that are available to all users via SQL. If you don't use
libpq, you open up the ability to use functions that can make use of information available
to the backend, and to also run functions in a way that it is not possible to do via SQL.

Regards

Russell Smith.

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 09:16:14
Message-ID:	Pine.LNX.4.58.0506171913170.31550@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Fri, 17 Jun 2005, Russell Smith wrote:

> > Added to TODO:
> >
> > * Create a bitmap of pages that need vacuuming
> >
> > Instead of sequentially scanning the entire table, have the background
> > writer or some other process record pages that have expired rows, then
> > VACUUM can look at just those pages rather than the entire table. In
> > the event of a system crash, the bitmap would probably be invalidated.
> >
> Further to this, is there any use case for allowing FSM, or this DSM to spill to disk
> if the space fills up. It would allow the possibility of unusual changes to the db
> to not loose space. You could just load part of the overflow from the disk back
> int the FSM in memory and continue using free space.

FSM splilling to disk would be a problem. The reason is that when we need
to allocate an empty page, we hit the FSM first. If that operation becomes
disk bound, large updates and inserts are going to really suck from a
performance point of view.

The idea I discussed is disk backed, because its the first few pages of
every heap segment. This map doesn't mean that pages are free. It means
they've been modified.

Gavin

From:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
To:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
Cc:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 09:18:30
Message-ID:	Pine.LNX.4.58.0506171917180.31550@linuxworld.com.au
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Fri, 17 Jun 2005, Russell Smith wrote:

> > >4) Related to this, I guess, is that a user's FSM settings might be
> > >completely inappropriate. The 'Just read the manual' or 'Just read the
> > >logs' argument doesn't cut it, because the main argument for autovacuum in
> > >the backend is that people do not and will not.
> > >
> > >
> >
> > Agreed, it doesn't solve all problems, and I'm not arguing that the
> > integration of AV makes PostgreSQL newbie safe it just helps reduce the
> > newbie problem. Again if the default FSM settings are inappropriate
> > for a database then the user is probably doing something more
> > complicated that a "my cat minka" database and will need to learn some
> > tuning skills anyway.
> >
> > >5) It doesn't actually shrink tables -- ie, there's no VACUUM FULL. If
> > >we're telling users about VACUUM less often than we are now, there's bound
> > >to be bloating issues (see 4).
> > >
> > >
> >
> But what's stopping the implementation of a Partial VACUUM FULL, where we lock the table,
> move enough blocks to shorten the relation so that there is say < 10% bloat, or whatever is
> appropriate for that table. Or even just short the table a few block, and repeat the process
> when you have some time too.

Its a question of where you start off from again. You cannot just say
'I've vacuumed the first 100 pages' because it could well have changed
underneath you.

Gavin

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	Josh Berkus <josh(at)agliodbs(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 11:58:53
Message-ID:	42B2BAFD.6000001@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Joshua D. Drake wrote:

> Josh Berkus wrote:
>
>> I've personally seen at least a dozen user requests for "autovacuum
>> in the backend", and had this conversation about 1,100 times:
>>
>> NB: "After a week, my database got really slow."
>> Me: "How often are you running VACUUM ANALYZE?"
>> NB: "Running what?"
>
>
> Can't argue that except... RTFM ;). I am not saying it doesn't have a
> validity. I am just saying that if you actually pay attention to
> PostgreSQL and maintain it, you don't need it ;)

I think everyone on this list would agree with you. The only reason I
think the newbie protection is important (and I don't think it's the
most important reason for autovacuum) is that perception is reality to
some extent. Valid or not we still suffer from a reputation of being
more complicated and slower than mysql. Steps towards reducing /
eliminating that perception can only be good for us as I think lots of
developers make their first database decision based solely on their
perceptions and then just stick with what they know.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 12:04:35
Message-ID:	42B2BC53.9060507@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Joshua D. Drake wrote:

> Matthew T. O'Connor wrote:
>
>> The major reasons for autovacuum as I see it are as follows:
>>
>> * Reduces administrative overhead having to keep track of what tables
>> need to be vacuumed how often.
>
>
> Creates more overhead and thus reduces performance.

In the general case, I disagree. Overall having your tables vacuumed
and analyzed only when needed and never when not needed can only reduce
system overhead. Granted there are limitations in the contrib version
of autovacuum, some of which go away in the integrated case.

>> * Reduces the total amount of time the system spends vacuuming since
>> it only vacuums when needed.
>
> Can be easily done with cron.

Really? What happens when your load / usage patterns change? When a
table is added that gets heavily used?

>> * Keeps stats up-to-date automatically
>
>
> Which can be done with cron

Same response as above.

>> * Eliminates newbie confusion
>
>
> RTFM

;-)

>> * Eliminates one of the criticisms that the public has against
>> PostgreSQL (justifed or not)
>
>
> Agreed.
>
>
> Just so everyone knows from the get go here. I am purposely playing a
> little devils advocate. Autovacuum has some drawbacks. I think we should
> be **publicly** aware of them before we pursue integration.

Understood.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
Cc:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 12:11:50
Message-ID:	42B2BE06.1080409@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Russell Smith wrote:

>>>* Reduces the total amount of time the system spends vacuuming since it
>>>only vacuums when needed.
>>>
>>>
>>Can be easily done with cron.
>>
>>
>Can you do partial table vacuums with CRON?
>You can work out the smartest time to vacuum with cron? I thought it just scheduled tasks at certain times.
>
>

To be fair, autovacuum can't do partial table vacuums either, in fact
nothing can right now. Perhaps someday something like this will be
feasible.

>>>* Eliminates one of the criticisms that the public has against
>>>PostgreSQL (justifed or not)
>>>
>>>
>>Agreed.
>>
>>
>This is really the same as the previous RTFM question/response. People criticise because vacuum is foreign to them,
>and for newbie's that equals too hard, next db please. As much as it is a technical issue, it's an advocacy issue too.
>
>

This bullet point is absolutely an advocacy issue. Every developer that
says "next db please" will probably not come back to PostgreSQL for
quite some time, thus bolstering the userbase of the competition and
reducing the userbase of PostgreSQL.

>Plus we finally get XID wraparound protection. We finally decided that for 8.1 we needed some protection, which I think
>Tom committed. This again may be a newbie thing. But there are a lot of newbies out there then. We've see on the lists
>and on IRC this problem pop up a number of times. And people say "Why didn't it tell me", RTFM it's exactly what they want
>to hear, or the fact they thought they read the manual, and missed understanding that bit.
>
>

I think this point hasn't been stressed enough. With nested
transactions these days (not to mention faster hardware) I can see XID
wraparound becoming a much bigger issue.

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>
Cc:	Andreas Pflug <pgadmin(at)pse-consulting(dot)de>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 12:21:23
Message-ID:	42B2C043.9060400@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Russell Smith wrote:

>On Fri, 17 Jun 2005 06:26 pm, Andreas Pflug wrote:
>
>
>>Qingqing Zhou wrote:
>>
>>
>>>One reason of not using lib-pq is that this one has to wait for the
>>>completion of each vacuum (we don't has async execution in libpq right?),
>>>
>>>
>>There *is* async execution in libpq, and it works.
>>
>>
>
>I would have thought the main reasons for not using libpq means you are locked
>into only using commands that are available to all users via SQL. If you don't use
>libpq, you open up the ability to use functions that can make use of information available
>to the backend, and to also run functions in a way that it is not possible to do via SQL.
>

Right now we don't really want to fire off more than one VACUUM at a
time since it will create a more substantial IO storm issue than we
already have with vacuum. Perhaps with the introduction of vacuum delay
settings and table spaces we could / should rethink this, but for now
it's the easiest way to go.

As for the standard SQL issue, FSM data (or anything else we might want)
could be exported via regular SQL via some type of super-user only
system function. So that isn't really the issue. I don't remember all
the details but you can look at the discussion when my patch was
rejected (around July of 2004). People just didn't like including libpq
into the backend for reasons I don't remember. I don't *think* this is
up for discussion, I *think* autovacuum has to work without libpq if it
is going to be accepted.

Matthew

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Russell Smith <mr-russ(at)pws(dot)com(dot)au>, Andreas Pflug <pgadmin(at)pse-consulting(dot)de>, Qingqing Zhou <zhouqq(at)cs(dot)toronto(dot)edu>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 13:54:52
Message-ID:	20092.1119016492@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

"Matthew T. O'Connor" <matthew(at)zeut(dot)net> writes:
> ... People just didn't like including libpq
> into the backend for reasons I don't remember.

One reason I can think of is that there would be global-symbol conflicts
--- libpq has copies of some backend routines, but they are not
identical.

In any case, the argument that's being made here is that an integrated
autovac would be simple and newbie-friendly. I think it's impossible
for a libpq-using autovac to ever be invisible to the DBA, if only
because he has to configure it with an appropriate username/password,
make sure pg_hba.conf will actually let it into every database, etc.
There are way too many foot-guns in that scenario for my taste.

regards, tom lane

From:	Rod Taylor <pg(at)rbt(dot)ca>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>, Alvaro Herrera <alvherre(at)surnet(dot)cl>, Russell Smith <mr-russ(at)pws(dot)com(dot)au>, "Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 14:35:48
Message-ID:	1119018948.78366.263.camel@home
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Fri, 2005-06-17 at 00:03 -0700, Joshua D. Drake wrote:
> Matthew T. O'Connor wrote:
> > Joshua D. Drake wrote:
> >
> >> Just my own two cents. First I am not knocking the work that has been
> >> on autovacuum. I am sure that it was a leap on its own to get it to
> >> work. However I will say that I just don't see the reason for it.
> >
> >
> >
> > The major reasons for autovacuum as I see it are as follows:
> >
> > * Reduces administrative overhead having to keep track of what tables
> > need to be vacuumed how often.
>
> Creates more overhead and thus reduces performance.

Agreed, except I have a number of tables that are over scheduled with
cron because about once a week (at a random time) they do go through
heavy enough churn to require the more frequent vacuum, but the rest of
the time they're fairly quiet. It's not ideal, but autovacuum was the
difference between a 10 minute vacuum 24/7, or a 10 minute vacuum on the
table for the heavy 2 or so hours which randomly appear.

In the case of unexpected or uneven loads, cron isn't particularly
useful.

From:	Josh Berkus <josh(at)agliodbs(dot)com>
To:	"Joshua D(dot) Drake" <jd(at)commandprompt(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 16:18:53
Message-ID:	200506170918.53983.josh@agliodbs.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Josh,

> Just so everyone knows from the get go here. I am purposely playing a
> little devils advocate.

Well, please stop it. We discussed AV over a year ago when we ran out of time
to integrate it with 8.0. This disucussion now is hindering any discussion
of what needs to be *done* to integrate it. This isn't a debating society.

Folks, I'm sorry to be so grumpy about this, but so far 80% of the posts on
this thread have been re-arguing a discussion we had in 2004. Which isn't
helping Alvaro get anything done.

--
Josh Berkus
Aglio Database Solutions
San Francisco

From:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
To:	Christopher Browne <cbbrowne(at)acm(dot)org>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-17 16:21:44
Message-ID:	42B2F898.7060806@zeut.net
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Christopher Browne wrote:

>swm(at)linuxworld(dot)com(dot)au (Gavin Sherry) wrote:
>
>
>>I guess the main point is, if something major like this ships in the
>>backend it says to users that the problem has gone away. pg_autovacuum is
>>a good contrib style solution: it addresses a problem users have and
>>attempts to solve it the way other users might try and solve it. When you
>>consider it in the backend, it looks like a workaround. I think users are
>>better served by solving the real problem.
>>
>>
>
>Hear, hear!
>
>It seems to me that the point in time at which it is *really*
>appropriate to put this into the backend is when the new GUC variable
>"dead_tuple_map_size" (akin to FSM) is introduced, and there is a new
>sort of 'VACUUM DEAD TUPLES' command which goes through the DTPM (Dead
>Tuple Page Map).
>
>In THAT case, there would be the ability to do a VACUUM on the "dead
>bits" of the table that consists of 50M rows without having to go
>through the 49M rows that haven't been touched in months.
>
>

This will make VACUUM less painful, but it doesn't eliminate the need /
desire for autovacuum. I agree this would be good, but I see it as a
separate issue.

From:	"Jim C(dot) Nasby" <decibel(at)decibel(dot)org>
To:	"Matthew T(dot) O'Connor" <matthew(at)zeut(dot)net>
Cc:	Christopher Browne <cbbrowne(at)acm(dot)org>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Autovacuum in the backend
Date:	2005-06-18 00:04:36
Message-ID:	20050618000436.GL44623@decibel.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

On Fri, Jun 17, 2005 at 12:21:44PM -0400, Matthew T. O'Connor wrote:
<snip more stuff about how we need to track pages with dead tuples>
>
> This will make VACUUM less painful, but it doesn't eliminate the need /
> desire for autovacuum. I agree this would be good, but I see it as a
> separate issue.

Not only is it a seperate issue, but there's also no way it could
possibly be done for 8.1, whereas autovacuum most likely will make it
into 8.1. Additionally, there are noted improvements that come about by
putting autovacuum in the backend instead of leaving it in contrib. And
as others have mentioned numerous times, any improvements made to vacuum
will help out vacuum as well. There simply isn't a downside to putting
it in the backend that anyone's brought up.

Autovacuum was originally scheduled for 8.0. There's been plans to put
it in the backend for close to 2 years now. There's no reason at all to
push it out any farther.
--
Jim C. Nasby, Database Consultant decibel(at)decibel(dot)org
Give your computer some brain candy! www.distributed.net Team #1828

Windows: "Where do you want to go today?"
Linux: "Where do you want to go tomorrow?"
FreeBSD: "Are you guys coming, or what?"

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Gavin Sherry <swm(at)linuxworld(dot)com(dot)au>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Space reuse and autovacuum
Date:	2005-06-21 20:06:52
Message-ID:	200506212006.j5LK6qR23759@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-general pgsql-hackers

Let's look at what TODO has for vacuum and how autovacuum fits that:

Vacuum
======

* Improve speed with indexes

For large table adjustements during vacuum, it is faster to reindex
rather than update the index.

This is something we should figure out how to do automatically.

* Reduce lock time by moving tuples with read lock, then write
lock and truncate table

Moved tuples are invisible to other backends so they don't require a
write lock. However, the read lock promotion to write lock could lead
to deadlock situations.

The deadlock problem here seems bad.

* -Add a warning when the free space map is too small

Done.

* Maintain a map of recently-expired rows

This allows vacuum to target specific pages for possible free space
without requiring a sequential scan.

I think of this as a secondary Free-space-map (FSM), where instead of
recording rows/pages that have free space, we records rows/pages that
have expired rows that might be free for reuse if all transactions where
the are visible are completed.

* Auto-fill the free space map by scanning the buffer cache or by
checking pages written by the background writer

This could be used to populate the secondary FSM above.

This is an alternative to the FSM that tracks _all_ possible free space
rather than just a limited amount like a seconary FSM in shared memory.

* Auto-vacuum
o Move into the backend code
o Use free-space map information to guide refilling
o Do VACUUM FULL if table is nearly empty?

It seems no matter what TODO items we complete above, we will need some
type of automatic vacuum to direct filling the free space map. It might
be done using a different method than a sequential scan vacuum, but it
will be needed, so we are good to integrate autovacuum then improve how
it does its job in future releases.