why after increase the hash table partitions, TPMC decrease

Lists: pgsql-hackers
From: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
To: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: why after increase the hash table partitions, TPMC decrease
Date: 2014-09-02 08:39:06
Message-ID: E8870A2F6A4B1045B1C292B77EAB207C5BA392E6@SZXEMA501-MBX.china.huawei.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers


We use benchmarksql to start tpcc test in postgresql 9.3.3.
Before test we set benchmarksql client number about 800. And we increase the hash partitions from 16 to 1024 , in order to reduce the hash partition locks competition.
We expect that after increase the number of partitions, reduces lock competition, TPMC should be increased. But the test results on the contrary, after modified to 1024, TPMC did not increase, but decrease.
Why such result?

We modify the following macro definition:
NUM_BUFFER_PARTITIONS 1024
LOG2_NUM_PREDICATELOCK_PARTITIONS 10
LOG2_NUM_LOCK_PARTITIONS 10


From: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
Cc: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: why after increase the hash table partitions, TPMC decrease
Date: 2014-09-02 11:30:50
Message-ID: CAA4eK1KDTCNsfq=wJTgPb3aJGj-k7SyxwT9jyvPR5P-wQoY3MA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Tue, Sep 2, 2014 at 2:09 PM, Xiaoyulei <xiaoyulei(at)huawei(dot)com> wrote:
>
>
>
> We use benchmarksql to start tpcc test in postgresql 9.3.3.
>
> Before test we set benchmarksql client number about 800. And we increase
the hash partitions from 16 to 1024 , in order to reduce the hash partition
locks competition.
>
> We expect that after increase the number of partitions, reduces lock
competition, TPMC should be increased.

I think you can expect some increase mainly if your test is
read only and you have sufficient RAM such that it can contain
all the data, for other cases there can be I/O due to which you
might not see any increase.

> But the test results on the contrary, after modified to 1024, TPMC did
not increase, but decrease.
>
> Why such result?
>
> We modify the following macro definition:
>
> NUM_BUFFER_PARTITIONS 1024
>
> LOG2_NUM_PREDICATELOCK_PARTITIONS 10
>
> LOG2_NUM_LOCK_PARTITIONS 10

Increasing these numbers might lead to error
"too many LWLocks taken", unless you increase
MAX_SIMUL_LWLOCKS. Once you can check the server
log if it contains any errors, that might lead to decrease in
performance.

Also another side effect would be that increasing above numbers
will lead to increase in shared memory usage.

With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com


From: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
To: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
Cc: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: 答复: [HACKERS] why after increase the hash table partitions, TPMC decrease
Date: 2014-09-02 11:50:42
Message-ID: E8870A2F6A4B1045B1C292B77EAB207C5BA39383@SZXEMA501-MBX.china.huawei.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

I already modified MAX_SIMUL_LWLOCKS to make sure it is enough.

Total RAM is 130G, and I set shared_buffers 16G, CPU and IO is not full. 50% CPUs are idle. So I think maybe pg is blocked by some place in itself.

发件人: Amit Kapila [mailto:amit(dot)kapila16(at)gmail(dot)com]
发送时间: 2014年9月2日 19:31
收件人: Xiaoyulei
抄送: pgsql-hackers(at)postgresql(dot)org
主题: Re: [HACKERS] why after increase the hash table partitions, TPMC decrease

On Tue, Sep 2, 2014 at 2:09 PM, Xiaoyulei <xiaoyulei(at)huawei(dot)com<mailto:xiaoyulei(at)huawei(dot)com>> wrote:
>
>
>
> We use benchmarksql to start tpcc test in postgresql 9.3.3.
>
> Before test we set benchmarksql client number about 800. And we increase the hash partitions from 16 to 1024 , in order to reduce the hash partition locks competition.
>
> We expect that after increase the number of partitions, reduces lock competition, TPMC should be increased.

I think you can expect some increase mainly if your test is
read only and you have sufficient RAM such that it can contain
all the data, for other cases there can be I/O due to which you
might not see any increase.

> But the test results on the contrary, after modified to 1024, TPMC did not increase, but decrease.
>
> Why such result?
>
> We modify the following macro definition:
>
> NUM_BUFFER_PARTITIONS 1024
>
> LOG2_NUM_PREDICATELOCK_PARTITIONS 10
>
> LOG2_NUM_LOCK_PARTITIONS 10

Increasing these numbers might lead to error
"too many LWLocks taken", unless you increase
MAX_SIMUL_LWLOCKS. Once you can check the server
log if it contains any errors, that might lead to decrease in
performance.

Also another side effect would be that increasing above numbers
will lead to increase in shared memory usage.

With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com<http://www.enterprisedb.com/>


From: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
To: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
Cc: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: 答复: [HACKERS] why after increase the hash table partitions, TPMC decrease
Date: 2014-09-02 14:44:25
Message-ID: CAA4eK1LJ+Wii20Yrz5P_X47Hm9GRvCKdOpm=_4ipBoq3RwpDxg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Tue, Sep 2, 2014 at 5:20 PM, Xiaoyulei <xiaoyulei(at)huawei(dot)com> wrote:
>
> I already modified MAX_SIMUL_LWLOCKS to make sure it is enough.

Okay.

>
>
> Total RAM is 130G, and I set shared_buffers 16G, CPU and IO is not full.
50% CPUs are idle.

As far as I understand, benchmarkSQL measures an OLTP
workload performance which means it contains mix of reads
and writes, now I am not sure how you have identified that
increasing buffer partitions can improve the performance.
Have you used any profiling?

> So I think maybe pg is blocked by some place in itself.

Yeah, there's another lock BufFreelistLock which is a major
cause of contention in buffer allocation and for which already
work is in progress for 9.5. However as mentioned previously,
that will be useful mainly for Read only loads.

With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com


From: Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
To: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
Cc: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: why after increase the hash table partitions, TPMC decrease
Date: 2014-09-02 22:49:57
Message-ID: CAMkU=1ym9DaKKS-ABZRh=vRJs9GLvgNQH9Un1vMiL6JWOWa0gg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Tue, Sep 2, 2014 at 1:39 AM, Xiaoyulei <xiaoyulei(at)huawei(dot)com> wrote:

>
>
> We use benchmarksql to start tpcc test in postgresql 9.3.3.
>
> Before test we set benchmarksql client number about 800. And we increase
> the hash partitions from 16 to 1024 , in order to reduce the hash
> partition locks competition.
>

Can you give the complete invocation parameters for benchmarksql?

How many CPUs/cores do you have?

> We expect that after increase the number of partitions, reduces
> lock competition, TPMC should be increased. But the test results on the
> contrary, after modified to 1024, TPMC did not increase, but decrease.
>
> Why such result?
>

Increasing the partition numbers would only help with contention that is
due to hash collision. If all of the contention is on, for example, the
root block of one index, then increasing the number partitions won't change
that, because that block is always going to map to a single partition.

Can you "perf" or some other profiling tool to find where your bottleneck
is?

Cheers,

Jeff


From: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
To: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
Cc: "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: RE: 答复: [HACKERS] why after increase the hash table partitions, TPMC decrease
Date: 2014-09-03 03:02:51
Message-ID: E8870A2F6A4B1045B1C292B77EAB207C5BA39739@SZXEMA501-MBX.china.huawei.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

benchmarSQL has about half reads. So I think it should be effective.

I don't think BufFreelistLock take much time, it just get a buffer from list. It should be very fast.

The test server has 2 CPUs and 12 cores in each CPU. 24 processor totally. CPU Idle time is over 50%. IO only 10%(data is in SSD)

I perf one process of pg. The hot spot is hash search. Attachment is perf data file.

3.63% postgres postgres [.] hash_search_with_hash_value
3.10% postgres postgres [.] AllocSetAlloc
3.04% postgres postgres [.] LWLockAcquire
2.73% postgres postgres [.] _bt_compare
2.66% postgres postgres [.] SearchCatCache
2.18% postgres postgres [.] ExecInitExpr
2.11% postgres postgres [.] GetSnapshotData
1.57% postgres postgres [.] PinBuffer
1.41% postgres postgres [.] XLogInsert
1.36% postgres libc-2.11.3.so [.] _int_malloc
1.31% postgres postgres [.] LWLockRelease
1.09% postgres libc-2.11.3.so [.] __GI_memcpy
0.89% postgres postgres [.] _bt_checkkeys
0.82% postgres libc-2.11.3.so [.] __strncpy_ssse3
0.81% postgres postgres [.] palloc
0.81% postgres postgres [.] fmgr_info_cxt_security
0.76% postgres postgres [.] equal
0.75% postgres postgres [.] s_lock
0.73% postgres postgres [.] heap_hot_search_buffer

>From: Amit Kapila [mailto:amit(dot)kapila16(at)gmail(dot)com]
>Sent: Tuesday, September 02, 2014 10:44 PM
>To: Xiaoyulei
>Cc: pgsql-hackers(at)postgresql(dot)org
>Subject: Re: 答复: [HACKERS] why after increase the hash table partitions, TPMC decrease
>
>On Tue, Sep 2, 2014 at 5:20 PM, Xiaoyulei <xiaoyulei(at)huawei(dot)com> wrote:
>>
>> I already modified MAX_SIMUL_LWLOCKS to make sure it is enough.
>
>Okay.
>
>>  
>>
>> Total RAM is 130G, and I set shared_buffers 16G, CPU and IO is not full. 50% CPUs are idle.
>
>As far as I understand, benchmarkSQL measures an OLTP
>workload performance which means it contains mix of reads
>and writes, now I am not sure how you have identified that
>increasing buffer partitions can improve the performance.
>Have you used any profiling?
>
>> So I think maybe pg is blocked by some place in itself.
>
>Yeah, there's another lock BufFreelistLock which is a major
>cause of contention in buffer allocation and for which already
>work is in progress for 9.5.  However as mentioned previously,
>that will be useful mainly for Read only loads.
>
>
>
>
>With Regards,
>Amit Kapila.
>EnterpriseDB: http://www.enterprisedb.com

Attachment Content-Type Size
perf.data.bz2 application/octet-stream 1.5 MB

From: Robert Haas <robertmhaas(at)gmail(dot)com>
To: Xiaoyulei <xiaoyulei(at)huawei(dot)com>
Cc: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: [HACKERS] RE: 答复: [HACKERS] why after increase the hash table partitions, TPMC decrease
Date: 2014-09-04 13:07:33
Message-ID: CA+Tgmob8WYJ8HKZvTnYedqxfbG9nsA7Bsb-=C3U15dkkjUJtWw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Tue, Sep 2, 2014 at 11:02 PM, Xiaoyulei <xiaoyulei(at)huawei(dot)com> wrote:
> benchmarSQL has about half reads. So I think it should be effective.
>
> I don't think BufFreelistLock take much time, it just get a buffer from list. It should be very fast.

You're wrong. That list is usually empty right now; so it does a
linear scan of the buffer pool looking for a good eviction candidate.

> The test server has 2 CPUs and 12 cores in each CPU. 24 processor totally. CPU Idle time is over 50%. IO only 10%(data is in SSD)
>
> I perf one process of pg. The hot spot is hash search. Attachment is perf data file.

I think you need to pass -g to perf so that you get a call-graph
profile. Then you should be able to expand the entry for
hash_search_with_hash_value() and see what's calling it.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company