8.0.0 gmake check fails if on disk, passes on ram disk....

Lists: pgsql-bugs
From: Jeff Ross <jross(at)openvistas(dot)net>
To: pgsql-bugs(at)postgresql(dot)org
Subject: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-22 00:03:08
Message-ID: 41F1983C.4070003@openvistas.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

Hi,

If I put the source for 8.0.0 on disk (RAID 1) , configure, compile and
run gmake check, it fails with 33 errors, 30 of which are

! psql: could not send startup packet: Broken pipe

If I put the same source code up on a ram disk, configure and compile it
the same way, all 96 tests pass.

OS is OpenBSD 3.6.

I'm currently running 7.3.5, which I'd love to upgrade but I'm a little
leary until I can determine what is causing this error.

Any thoughts greatly appreciated!

Jeff Ross


From: Michael Fuhr <mike(at)fuhr(dot)org>
To: Jeff Ross <jross-dated-1106784195(dot)b5de5a(at)openvistas(dot)net>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-24 09:00:30
Message-ID: 20050124090030.GA40495@winnie.fuhr.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

On Fri, Jan 21, 2005 at 05:03:08PM -0700, Jeff Ross wrote:

> If I put the source for 8.0.0 on disk (RAID 1) , configure, compile and
> run gmake check, it fails with 33 errors, 30 of which are
>
> ! psql: could not send startup packet: Broken pipe

Somebody else using OpenBSD reported a similar problem recently.
Here's part of that thread:

http://archives.postgresql.org/pgsql-bugs/2005-01/msg00099.php

> If I put the same source code up on a ram disk, configure and compile it
> the same way, all 96 tests pass.

Interesting. Is this behavior consistent? What's different 'twixt
the RAID disk and the RAM disk?

--
Michael Fuhr
http://www.fuhr.org/~mfuhr/


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Michael Fuhr <mike(at)fuhr(dot)org>
Cc: Jeff Ross <jross-dated-1106784195(dot)b5de5a(at)openvistas(dot)net>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-24 15:24:56
Message-ID: 21305.1106580296@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

Michael Fuhr <mike(at)fuhr(dot)org> writes:
> On Fri, Jan 21, 2005 at 05:03:08PM -0700, Jeff Ross wrote:
>> If I put the same source code up on a ram disk, configure and compile it
>> the same way, all 96 tests pass.

> Interesting. Is this behavior consistent? What's different 'twixt
> the RAID disk and the RAM disk?

If the problem is at bottom a too low processes-per-user limit, as it
was for Jean-Gerard, then maybe the RAM-disk case passes because of
different timing details. This theory is a bit of a stretch though.

In any case, we're being shown the wrong output. What I want to know is
what appears in the postmaster log when these failures happen?

regards, tom lane


From: Jeff Ross <jross(at)openvistas(dot)net>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-24 16:03:04
Message-ID: 41F51C38.3000704@openvistas.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

Tom Lane wrote:
> Michael Fuhr <mike(at)fuhr(dot)org> writes:
>
>>On Fri, Jan 21, 2005 at 05:03:08PM -0700, Jeff Ross wrote:
>>
>>>If I put the same source code up on a ram disk, configure and compile it
>>>the same way, all 96 tests pass.
>
>
>>Interesting. Is this behavior consistent? What's different 'twixt
>>the RAID disk and the RAM disk?
>
>
> If the problem is at bottom a too low processes-per-user limit, as it
> was for Jean-Gerard, then maybe the RAM-disk case passes because of
> different timing details. This theory is a bit of a stretch though.
>
> In any case, we're being shown the wrong output. What I want to know is
> what appears in the postmaster log when these failures happen?
>
> regards, tom lane
>
> ---------------------------(end of broadcast)---------------------------
> TIP 9: the planner will ignore your desire to choose an index scan if your
> joining column's datatypes do not match
>
>
I did a make clean, make, and gmake check this morning. Fewer tests
failed (16 of the 96) on the raid1, and again no tests failed on the ram
disk.

Rather than post it in the e-mail, I've put the postmaster.log at

http://www.openvistas.net/postmaster.log

Thanks!


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Jeff Ross <jross-dated-1107014591(dot)2716b0(at)openvistas(dot)net>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-24 16:16:54
Message-ID: 21885.1106583414@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

Jeff Ross <jross(at)openvistas(dot)net> writes:
> Rather than post it in the e-mail, I've put the postmaster.log at
> http://www.openvistas.net/postmaster.log

I see multiple occurrences of

LOG: could not fork new process for connection: Resource temporarily unavailable

so indeed your process limit is too low. It's curious that OpenBSD
seems more prone than other platforms to produce a "broken pipe" error
instead of reporting the error message sent back by the postmaster ...
but there is something of a race condition there, so we can't complain
too much.

I have no idea why running the tests off ramdisk would make a difference
in the probability of this failure, but it's academic. Raise your
process limit.

regards, tom lane


From: Jeff Ross <jross(at)openvistas(dot)net>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: pgsql-bugs(at)postgresql(dot)org
Subject: Re: 8.0.0 gmake check fails if on disk, passes on ram disk....
Date: 2005-01-24 16:33:27
Message-ID: 41F52357.5000306@openvistas.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-bugs

Tom Lane wrote:
> Jeff Ross <jross(at)openvistas(dot)net> writes:
>
>>Rather than post it in the e-mail, I've put the postmaster.log at
>> http://www.openvistas.net/postmaster.log
>
>
> I see multiple occurrences of
>
> LOG: could not fork new process for connection: Resource temporarily unavailable
>
> so indeed your process limit is too low. It's curious that OpenBSD
> seems more prone than other platforms to produce a "broken pipe" error
> instead of reporting the error message sent back by the postmaster ...
> but there is something of a race condition there, so we can't complain
> too much.
>
> I have no idea why running the tests off ramdisk would make a difference
> in the probability of this failure, but it's academic. Raise your
> process limit.
>
> regards, tom lane
>
>

Thank you, Tom! That was indeed the problem.

Jeff Ross