Quick Links

Re: Poor memory context performance in large hash joins

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
Cc:	pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Poor memory context performance in large hash joins
Date:	2017-02-24 18:59:45
Message-ID:	8652.1487962785@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Jeff Janes <jeff(dot)janes(at)gmail(dot)com> writes:
> On Thu, Feb 23, 2017 at 2:28 PM, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
>> Maybe it's time to convert that to a doubly-linked list.

> I don't think that would help. You would need some heuristic to guess
> whether the chunk you are looking for is near the front, or near the end.

Uh, what? In a doubly-linked list, you can remove an element in O(1)
time, you don't need any searching. It basically becomes
item->prev->next = item->next;
item->next->prev = item->prev;
modulo possible special cases for the head and tail elements.

>> Although if the
>> hash code is producing a whole lot of requests that are only a bit bigger
>> than the separate-block threshold, I'd say It's Doing It Wrong. It should
>> learn to aggregate them into larger requests.

> Right now it is using compiled-in 32KB chunks. Should it use something
> like max(32kb,work_mem/128) instead?

I'd say it should double the size of the request each time. That's what
we do in most places.

regards, tom lane

In response to

Re: Poor memory context performance in large hash joins at 2017-02-24 18:19:02 from Jeff Janes

Responses

Re: Poor memory context performance in large hash joins at 2017-02-24 19:57:17 from Jeff Janes

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Bruce Momjian	2017-02-24 19:37:30	Re: Checksums by default?
Previous Message	Jeff Janes	2017-02-24 18:47:14	Re: Poor memory context performance in large hash joins