Re: select count() out of memory

From: tfinneid(at)student(dot)matnat(dot)uio(dot)no
To: "Tom Lane" <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: tfinneid(at)student(dot)matnat(dot)uio(dot)no, "Alvaro Herrera" <alvherre(at)commandprompt(dot)com>, "Gregory Stark" <stark(at)enterprisedb(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: select count() out of memory
Date: 2007-10-25 16:34:39
Message-ID: 47727.134.32.140.234.1193330079.squirrel@webmail.uio.no
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

> tfinneid(at)student(dot)matnat(dot)uio(dot)no writes:
>>> In that case, why use partitions at all? They are simple independent
>>> tables.
>
>> For two reasons,
>> - the data logically belongs together.
>> - because its more practical to create tables as childs of a parent
>> table
>> than as independent tables.
>> - changes to the table is applied to all partitions, and prohibits
>> tables with different dd.
>> - performing the create operation does not require the source code to
>> contain the ddl of the parent table.
>
> In other words, you really should have only one table; they aren't
> independent. What you need to do is dial down your ideas of how many
> partitions are reasonable to have.

Yes, but no. Each partition represents a chunk of information on a
discrete timeline. So there is no point in grouping it all into a single
table, because the access pattern is to only access data from a specific
point in time, i.e. a single partition, usually the latest. Since the
amount of data is so big, approx 3MB per second, and each partition needs
to be indexed before the clients start reading the data (in the same
second). I find its better to use partitions, even though I am not
actually using it.

regards

thomas

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Roberts, Jon 2007-10-25 16:35:32 Re: subversion support?
Previous Message Erik Jones 2007-10-25 16:27:09 Re: select count() out of memory