Re: Detecting corrupt table

Lists: pgsql-general
From: David Larochelle <dlarochelle(at)cyber(dot)law(dot)harvard(dot)edu>
To: pgsql-general(at)postgresql(dot)org
Subject: Detecting corrupt table
Date: 2012-04-17 22:05:38
Message-ID: CABipw1mtbhkYJZwo2QZ7HLc3+YadW2XA_Jc5XD6=-8dKh0Uj4w@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-general

Our database has some corrupt tables and I'm trying to figure out what data
can be salvaged and what needs to be restored from backup or regenerated.

Initially I tried running select count(*) on all user tables. While this
did detect some corrupt tables, it missed others. For example, I was able
to run count(*) on a table but then got an error while trying to back it up.

pg_dump: Error message from server: ERROR: missing chunk number 0 for
toast value 368243665 in pg_toast_284730161
pg_dump: The command was: COPY public.stories (stories_id, media_id, url,
guid, title, description, publish_date, collect_date, story_texts_id,
full_text_rss) TO stdout;

Is there a simple way to determine which parts of the database are corrupt?
I'm currently running a script to back up each table individually using
something like the following:

psql -c "select tablename from pg_tables where tableowner = 'db_user' ORDER
by tablename " | tail -n +3 | head -n -2 | xargs -n 1 -i pg_dump --verbose
--table={} --file={}_.dump

But I'm worried that this approach will also miss database corruption and
was wondering if anyone has other suggestions.

Thanks,

David


From: Willy-Bas Loos <willybas(at)gmail(dot)com>
To: David Larochelle <dlarochelle(at)cyber(dot)law(dot)harvard(dot)edu>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: Detecting corrupt table
Date: 2012-04-19 07:38:25
Message-ID: CAHnozTgX4gZsWzXs7jQ2F9QkXp93ZgQGziG5xA0x81hgRweK2A@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-general

you might want to log any errors resulting from pg_dump and then grep
through them to verify. or you could record the exit status ( $? ) for each
pg_dump command.

I was also thinking about how to check if something malformed your data on
disk. I could think of some ways to do that, but it doesn't look like you
are looking for that.

hth

WBL

On Wed, Apr 18, 2012 at 12:05 AM, David Larochelle <
dlarochelle(at)cyber(dot)law(dot)harvard(dot)edu> wrote:

> Our database has some corrupt tables and I'm trying to figure out what
> data can be salvaged and what needs to be restored from backup or
> regenerated.
>
> Initially I tried running select count(*) on all user tables. While this
> did detect some corrupt tables, it missed others. For example, I was able
> to run count(*) on a table but then got an error while trying to back it up.
>
>
> pg_dump: Error message from server: ERROR: missing chunk number 0 for
> toast value 368243665 in pg_toast_284730161
> pg_dump: The command was: COPY public.stories (stories_id, media_id, url,
> guid, title, description, publish_date, collect_date, story_texts_id,
> full_text_rss) TO stdout;
>
>
>
> Is there a simple way to determine which parts of the database are
> corrupt? I'm currently running a script to back up each table individually
> using something like the following:
>
> psql -c "select tablename from pg_tables where tableowner = 'db_user'
> ORDER by tablename " | tail -n +3 | head -n -2 | xargs -n 1 -i pg_dump
> --verbose --table={} --file={}_.dump
>
>
> But I'm worried that this approach will also miss database corruption and
> was wondering if anyone has other suggestions.
>
> Thanks,
>
>
> David
>
>
>

--
"Patriotism is the conviction that your country is superior to all others
because you were born in it." -- George Bernard Shaw