Quick Links

Deadlocks in HS (on 9.0 :( )

From:	Greg Stark <stark(at)mit(dot)edu>
To:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Deadlocks in HS (on 9.0 :( )
Date:	2014-07-15 15:54:05
Message-ID:	CAM-w4HOfjp_Q2Qu20CFJt88fkq0b3xaKgD5e8JyYECT3JvsizA@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

We've observed a 9.0 database have undetected deadlocks repeatedly in
hot standby mode.

I think what's happening is that autovacuum is kicking off a VACUUM of
some system catalogs -- it seems to usually be pg_statistics' toast
table actually. At the end of the vacuum it briefly gets the exclusive
lock to truncate the table. On the standby it replays that and records
the exclusive lock being taken. It then sees a cleanup record that
pauses replay because a HS standby transaction is running that can see
the xid being cleaned up. That transaction then blocks against the
exclusive lock and deadlocks against recovery.

We expect upgrading to 9.3 to fix the problem for us due to the xid
feedback mechanism. But is this still a known problem when feedback is
not enabled? And is it a problem we should try to find a backpatchable
fix for?

I'm pondering whether we really need to log the exclusive lock taken
by vacuum when truncating. Worst case is a scan is in progress,
perhaps we can make scans understand how to handle tables that have
been truncated concurrently? We could always make the truncate replay
command acquire the lock and release it itself right away.

--
greg

Responses

Re: Deadlocks in HS (on 9.0 :( ) at 2014-07-16 04:25:55 from Noah Misch
Re: Deadlocks in HS (on 9.0 :( ) at 2014-07-16 08:00:32 from Andres Freund

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Tom Lane	2014-07-15 16:03:35	Re: SSL compression info in psql header
Previous Message	MauMau	2014-07-15 15:27:29	Re: [bug fix] pg_ctl always uses the same event source