Re: Why DISTINCT ... DESC is slow?

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: "Brandon Aiken" <BAiken(at)winemantech(dot)com>
Cc: "Erik Jones" <erik(at)myemma(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: Why DISTINCT ... DESC is slow?
Date: 2006-12-12 17:30:07
Message-ID: 5163.1165944607@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

"Brandon Aiken" <BAiken(at)winemantech(dot)com> writes:
> If you have, say, an index(x, y) then that index will often double as an
> index(x). It will generally not double as an index(y).

It's not hard to understand why, if you think about the sort ordering of
a double-column index:

x y

1 1
1 2
1 3
2 1
2 2
2 3
3 1
...

All similar values of x are brought together, so scanning the index for
x alone works just the same as it would in a one-column index ... the
index entries are bigger so it's marginally less efficient, but only
marginally. On the other hand, the entries for a specific value or
range of y will be scattered all over the index, so it's almost useless
to use the index for a search on y alone.

As of PG 8.1 or 8.2 (I forget) the optimizer will *consider* using such
an index for a y-only query, but it'll nearly always decide it's a bad
idea.

regards, tom lane

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Belinda M. Giardine 2006-12-12 17:39:00 Re: date comparisons
Previous Message Paul Silveira 2006-12-12 17:28:32 Re: shell script to populate array values