Quick Links

Re: ugly locking corner cases ...

Lists:	pgsql-hackers

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	pgsql-hackers Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	ugly locking corner cases ...
Date:	2010-10-04 11:02:04
Message-ID:	2A2C985E-1A64-42A9-BE67-431B3AC8DEC8@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers

hello all ...

it seems we have found a fairly nasty problem.
imagine a long transaction which piles up XX.XXX of locks (count on pg_locks) inside the same transaction by doing some tasty savepoints, with hold cursors and so on.
in this case we see that a normal count issued in a second database connection will take ages. in a practical case we did a plain seq_scan in connection 2. instead of 262 ms (cached case) it started to head north linearily with the number of locks taken by connection 1. in an extreme case it took around 1.5 hours or so (on XXX.XXX pg_locks entries).

i tracked down the issue quickly and make the following profile (in 10k locks or so):

Flat profile:

Each sample counts as 0.01 seconds.
% cumulative self self total
time seconds seconds calls s/call s/call name
32.49 6.01 6.01 98809118 0.00 0.00 SimpleLruReadPage_ReadOnly
26.97 11.00 4.99 98837761 0.00 0.00 LWLockAcquire
21.89 15.05 4.05 98837761 0.00 0.00 LWLockRelease
8.70 16.66 1.61 98789370 0.00 0.00 SubTransGetParent
4.38 17.47 0.81 19748 0.00 0.00 SubTransGetTopmostTransaction
2.41 17.92 0.45 98851951 0.00 0.00 TransactionIdPrecedes
0.59 18.03 0.11 LWLockAssign
0.54 18.13 0.10 LWLockConditionalAcquire
0.46 18.21 0.09 19748 0.00 0.00 TransactionLogFetch
0.38 18.28 0.07 SimpleLruReadPage
0.27 18.33 0.05 SubTransSetParent
0.05 18.34 0.01 136778 0.00 0.00 AllocSetAlloc
0.05 18.35 0.01 42996 0.00 0.00 slot_deform_tuple
0.05 18.36 0.01 42660 0.00 0.00 TransactionIdIsCurrentTransactionId

it seems we are running into a nice shared buffer / locking contention here and the number of calls explodes (this profiling infos is coming from a seq scan on a 500.000 rows table - 400 mb or so).

i am thinking of doing a workaround for this problem

many thanks,

hans

--
Cybertec Schönig & Schönig GmbH
Gröhrmühlgasse 26
A-2700 Wiener Neustadt
Web: http://www.postgresql-support.de

From:	Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>
To:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
Cc:	pgsql-hackers Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: ugly locking corner cases ...
Date:	2010-10-04 11:23:21
Message-ID:	4CA9B929.7090604@enterprisedb.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers

On 04.10.2010 14:02, Hans-Jürgen Schönig wrote:
> it seems we have found a fairly nasty problem.
> imagine a long transaction which piles up XX.XXX of locks (count on pg_locks) inside the same transaction by doing some tasty savepoints, with hold cursors and so on.
> in this case we see that a normal count issued in a second database connection will take ages. in a practical case we did a plain seq_scan in connection 2. instead of 262 ms (cached case) it started to head north linearily with the number of locks taken by connection 1. in an extreme case it took around 1.5 hours or so (on XXX.XXX pg_locks entries).
>
> i tracked down the issue quickly and make the following profile (in 10k locks or so):
>
> Flat profile:
>
> Each sample counts as 0.01 seconds.
> % cumulative self self total
> time seconds seconds calls s/call s/call name
> 32.49 6.01 6.01 98809118 0.00 0.00 SimpleLruReadPage_ReadOnly
> 26.97 11.00 4.99 98837761 0.00 0.00 LWLockAcquire
> 21.89 15.05 4.05 98837761 0.00 0.00 LWLockRelease
> 8.70 16.66 1.61 98789370 0.00 0.00 SubTransGetParent
> 4.38 17.47 0.81 19748 0.00 0.00 SubTransGetTopmostTransaction
> 2.41 17.92 0.45 98851951 0.00 0.00 TransactionIdPrecedes
> 0.59 18.03 0.11 LWLockAssign
> 0.54 18.13 0.10 LWLockConditionalAcquire
> 0.46 18.21 0.09 19748 0.00 0.00 TransactionLogFetch
> 0.38 18.28 0.07 SimpleLruReadPage
> 0.27 18.33 0.05 SubTransSetParent
> 0.05 18.34 0.01 136778 0.00 0.00 AllocSetAlloc
> 0.05 18.35 0.01 42996 0.00 0.00 slot_deform_tuple
> 0.05 18.36 0.01 42660 0.00 0.00 TransactionIdIsCurrentTransactionId
>
> it seems we are running into a nice shared buffer / locking contention here and the number of calls explodes (this profiling infos is coming from a seq scan on a 500.000 rows table - 400 mb or so).

That doesn't seem related to the lock manager. Is the long-running
transaction inserting a lot of tuples (by INSERT or UPDATE) to the same
table that the seqscan scans? With a lot of different subtransaction
xids. That profile looks like the seqscan is spending a lot of time
swapping pg_subtrans pages in and out of the slru buffers.

Increasing NUM_SUBTRANS_BUFFERS should help. A more sophisticated
solution would be to allocate slru buffers (for clog and other slru
caches as well) dynamically from shared_buffers. That's been discussed
before but no-one has gotten around to it.

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

From:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
To:	Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>
Cc:	pgsql-hackers Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: ugly locking corner cases ...
Date:	2010-10-04 12:37:13
Message-ID:	05FC981A-2BC9-474F-A9C4-5081E93CAF52@cybertec.at
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers

On Oct 4, 2010, at 1:23 PM, Heikki Linnakangas wrote:

> On 04.10.2010 14:02, Hans-Jürgen Schönig wrote:
>> it seems we have found a fairly nasty problem.
>> imagine a long transaction which piles up XX.XXX of locks (count on pg_locks) inside the same transaction by doing some tasty savepoints, with hold cursors and so on.
>> in this case we see that a normal count issued in a second database connection will take ages. in a practical case we did a plain seq_scan in connection 2. instead of 262 ms (cached case) it started to head north linearily with the number of locks taken by connection 1. in an extreme case it took around 1.5 hours or so (on XXX.XXX pg_locks entries).
>>
>> i tracked down the issue quickly and make the following profile (in 10k locks or so):
>>
>> Flat profile:
>>
>> Each sample counts as 0.01 seconds.
>> % cumulative self self total
>> time seconds seconds calls s/call s/call name
>> 32.49 6.01 6.01 98809118 0.00 0.00 SimpleLruReadPage_ReadOnly
>> 26.97 11.00 4.99 98837761 0.00 0.00 LWLockAcquire
>> 21.89 15.05 4.05 98837761 0.00 0.00 LWLockRelease
>> 8.70 16.66 1.61 98789370 0.00 0.00 SubTransGetParent
>> 4.38 17.47 0.81 19748 0.00 0.00 SubTransGetTopmostTransaction
>> 2.41 17.92 0.45 98851951 0.00 0.00 TransactionIdPrecedes
>> 0.59 18.03 0.11 LWLockAssign
>> 0.54 18.13 0.10 LWLockConditionalAcquire
>> 0.46 18.21 0.09 19748 0.00 0.00 TransactionLogFetch
>> 0.38 18.28 0.07 SimpleLruReadPage
>> 0.27 18.33 0.05 SubTransSetParent
>> 0.05 18.34 0.01 136778 0.00 0.00 AllocSetAlloc
>> 0.05 18.35 0.01 42996 0.00 0.00 slot_deform_tuple
>> 0.05 18.36 0.01 42660 0.00 0.00 TransactionIdIsCurrentTransactionId
>>
>> it seems we are running into a nice shared buffer / locking contention here and the number of calls explodes (this profiling infos is coming from a seq scan on a 500.000 rows table - 400 mb or so).
>
> That doesn't seem related to the lock manager. Is the long-running transaction inserting a lot of tuples (by INSERT or UPDATE) to the same table that the seqscan scans? With a lot of different subtransaction xids. That profile looks like the seqscan is spending a lot of time swapping pg_subtrans pages in and out of the slru buffers.
>
> Increasing NUM_SUBTRANS_BUFFERS should help. A more sophisticated solution would be to allocate slru buffers (for clog and other slru caches as well) dynamically from shared_buffers. That's been discussed before but no-one has gotten around to it.
>
> --
> Heikki Linnakangas
> EnterpriseDB http://www.enterprisedb.com

hello ...

yeah, it seems this solves the problem.
i had a closer look at the SQL trace and did some more profiling.
this was the case.

many thanks for the quick hint.

hans

--
Cybertec Schönig & Schönig GmbH
Gröhrmühlgasse 26
A-2700 Wiener Neustadt
Web: http://www.postgresql-support.de

From:	Alvaro Herrera <alvherre(at)commandprompt(dot)com>
To:	Hans-Jürgen Schönig <postgres(at)cybertec(dot)at>
Cc:	pgsql-hackers Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: ugly locking corner cases ...
Date:	2010-10-04 15:15:13
Message-ID:	1286204945-sup-9631@alvh.no-ip.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers

Excerpts from Hans-Jürgen Schönig's message of lun oct 04 07:02:04 -0400 2010:

> i tracked down the issue quickly and make the following profile (in 10k locks or so):
>
> Flat profile:
>
> Each sample counts as 0.01 seconds.
> % cumulative self self total
> time seconds seconds calls s/call s/call name
> 32.49 6.01 6.01 98809118 0.00 0.00 SimpleLruReadPage_ReadOnly
> 26.97 11.00 4.99 98837761 0.00 0.00 LWLockAcquire
> 21.89 15.05 4.05 98837761 0.00 0.00 LWLockRelease
> 8.70 16.66 1.61 98789370 0.00 0.00 SubTransGetParent
> 4.38 17.47 0.81 19748 0.00 0.00 SubTransGetTopmostTransaction
> 2.41 17.92 0.45 98851951 0.00 0.00 TransactionIdPrecedes
> 0.59 18.03 0.11 LWLockAssign
> 0.54 18.13 0.10 LWLockConditionalAcquire
> 0.46 18.21 0.09 19748 0.00 0.00 TransactionLogFetch
> 0.38 18.28 0.07 SimpleLruReadPage
> 0.27 18.33 0.05 SubTransSetParent
> 0.05 18.34 0.01 136778 0.00 0.00 AllocSetAlloc
> 0.05 18.35 0.01 42996 0.00 0.00 slot_deform_tuple
> 0.05 18.36 0.01 42660 0.00 0.00 TransactionIdIsCurrentTransactionId
>
> it seems we are running into a nice shared buffer / locking contention here and the number of calls explodes (this profiling infos is coming from a seq scan on a 500.000 rows table - 400 mb or so).

Isn't this really a problem with subtransaction rather than locks,
though? I wonder if the problem would go away if the subtransactions
were cachable in the PGPROC subxid cache (i.e. try enlarging
PGPROC_MAX_CACHED_SUBXIDS a bunch of orders of magnitude), or if the
pg_subtrans SLRU cache was bigger (i.e. try enlarging
NUM_SUBTRANS_BUFFERS).

--
Álvaro Herrera <alvherre(at)commandprompt(dot)com>
The PostgreSQL Company - Command Prompt, Inc.
PostgreSQL Replication, Consulting, Custom Development, 24x7 support