Re: Expose checkpoint start/finish times into SQL.

From: Greg Smith <gsmith(at)gregsmith(dot)com>
To: pgsql-patches(at)postgresql(dot)org
Subject: Re: Expose checkpoint start/finish times into SQL.
Date: 2008-04-04 07:01:06
Message-ID: Pine.GSO.4.64.0804040251110.9874@westnet.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-patches

On Fri, 4 Apr 2008, Tom Lane wrote:

> The actual advice I'd give to a DBA faced with such a case is to
> kill -ABRT the bgwriter and send the stack trace to -hackers.

And that's a perfect example of where they're trying to get to. They
didn't notice the problem until after the crash. The server didn't come
back up (busy processing WALs) and that downtime was caught by a
monitoring system. At that point it was too late to collect debugging
information on what was wrong inside the server processes that might have
given a clue what happened.

If they'd have noticed it while the server was up, perhaps because the
"last checkpoint" value hadn't changed in a long time (which seems like it
might be available via stats even if, as you say, the background writer is
out of its mind at that point), they could have done such a kill and
collected some actual useful information here. That's the theory at
least.

--
* Greg Smith gsmith(at)gregsmith(dot)com http://www.gregsmith.com Baltimore, MD

In response to

Responses

Browse pgsql-patches by date

  From Date Subject
Next Message Tom Lane 2008-04-04 07:14:28 Re: Expose checkpoint start/finish times into SQL.
Previous Message Tom Lane 2008-04-04 06:36:35 Re: Expose checkpoint start/finish times into SQL.