Re: Completely broken replica after PANIC: WAL contains references to invalid pages

From: Andres Freund <andres(at)2ndquadrant(dot)com>
To: Sergey Konoplev <gray(dot)ru(at)gmail(dot)com>
Cc: Simon Riggs <simon(at)2ndquadrant(dot)com>, pgsql-bugs <pgsql-bugs(at)postgresql(dot)org>, Maxim Boguk <maxim(dot)boguk(at)gmail(dot)com>, Максим Панченко <Panchenko(at)gw(dot)tander(dot)ru>, Толстенко Илья <tolstenko_iv(at)gw(dot)tander(dot)ru>, Сизов Сергей Павлович <sizov_sp(at)gw(dot)tander(dot)ru>, Соболев Виталий Анатольевич <sobolev_va(at)gw(dot)tander(dot)ru>
Subject: Re: Completely broken replica after PANIC: WAL contains references to invalid pages
Date: 2013-11-04 09:48:42
Message-ID: 20131104094842.GH3567@awork2.anarazel.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On 2013-11-01 00:35:18 -0700, Sergey Konoplev wrote:
> On Tue, Apr 2, 2013 at 11:26 AM, Andres Freund <andres(at)2ndquadrant(dot)com> wrote:
> > The attached patch fixes this although I don't like the way it knowledge of the
> > point up to which StartupSUBTRANS zeroes pages is handled.
>
> So, after half a year the same failure has happened again on the same
> replica, but now patched with the Andres' patch (9.2.4 + the patch)
> that was supposed to fix it.

I think that's actually a different bug - what the patch was fixing was
the clog/subtrans errors, not the "reference to invalid pages".

I've submitted a fix for at least one cause causing that independently
from this bug which is included in 9.2.5
(17fa4c321ccf9693de406faffe6b235e949aa25f). Note thought that that fix
needed a fix iff using a cassert enabled build
(4da24f12e63313b7dbb6b3e3d0317e04045df636) which is *not* in 9.2.5.

Greetings,

Andres Freund

--
Andres Freund http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Andres Freund 2013-11-04 10:12:21 Re: [BUGS] BUG #8573: int4range memory consumption
Previous Message Tom Lane 2013-11-03 20:41:52 Re: psql security fail?