Re: how to implement selectivity injection in postgresql

Lists: pgsql-hackers
From: Rajmohan C <csrajmohan(at)gmail(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Subject: how to implement selectivity injection in postgresql
Date: 2014-08-13 16:33:31
Message-ID: CAHaqV0gRg3uQgVdF=W9KASSu0bxMtd5Hp1wjzLMsJ8A3DHeRFQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

SELECT c1, c2, c3, FROM T1, T2, T3
WHERE T1.x = T2.x AND
T2.y=T3.y AND
T1.x >= ? selectivity 0.00001 AND
T2.y > ? selectivity 0.5 AND
T3.z = ? selectivity 0.2 AND
T3.w = ?

I need to implement Selectivity injection as shown in above query in
PostgreSQL by which we can inject selectivity of each predicate or at least
selectivity at relation level directly as part of query. Is there any
on-going work on this front? If there is no ongoing work on this, How
should I start implementing this feature?


From: Euler Taveira <euler(at)timbira(dot)com(dot)br>
To: Rajmohan C <csrajmohan(at)gmail(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: how to implement selectivity injection in postgresql
Date: 2014-08-13 18:13:43
Message-ID: 53EBAAD7.3050503@timbira.com.br
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On 13-08-2014 13:33, Rajmohan C wrote:
> SELECT c1, c2, c3, FROM T1, T2, T3
> WHERE T1.x = T2.x AND
> T2.y=T3.y AND
> T1.x >= ? selectivity 0.00001 AND
> T2.y > ? selectivity 0.5 AND
> T3.z = ? selectivity 0.2 AND
> T3.w = ?
>
> I need to implement Selectivity injection as shown in above query in
> PostgreSQL by which we can inject selectivity of each predicate or at least
> selectivity at relation level directly as part of query. Is there any
> on-going work on this front? If there is no ongoing work on this, How
> should I start implementing this feature?
>
Do you want to force a selectivity? Why don't you let the optimizer do
it for you? Trust me it can do it better than you. If you want to force
those selectivities for an academic exercise, that information belongs
to catalog or could be SET before query starts.

Start reading backend/optimizer/README.

--
Euler Taveira Timbira - http://www.timbira.com.br/
PostgreSQL: Consultoria, Desenvolvimento, Suporte 24x7 e Treinamento


From: Euler Taveira <euler(at)timbira(dot)com(dot)br>
To: Rajmohan C <csrajmohan(at)gmail(dot)com>
Cc: Pgsql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: how to implement selectivity injection in postgresql
Date: 2014-08-13 18:49:44
Message-ID: 53EBB348.5000005@timbira.com.br
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On 13-08-2014 15:28, Rajmohan C wrote:
> Yeah. I have to do it for my academic research. Is it available in
> catalogs? It is to be computed at run time from the predicates in the query
> right?
>
The selectivity information is available at runtime. See
backend/optimizer/path/costsize.c.

--
Euler Taveira Timbira - http://www.timbira.com.br/
PostgreSQL: Consultoria, Desenvolvimento, Suporte 24x7 e Treinamento


From: Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
To: Rajmohan C <csrajmohan(at)gmail(dot)com>
Cc: pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: how to implement selectivity injection in postgresql
Date: 2014-08-13 19:58:17
Message-ID: CAMkU=1wYryU1R51u4wZBimH3K90q3NSrugUw2Xka1DOcqwVb5g@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

On Wed, Aug 13, 2014 at 9:33 AM, Rajmohan C <csrajmohan(at)gmail(dot)com> wrote:

> SELECT c1, c2, c3, FROM T1, T2, T3
> WHERE T1.x = T2.x AND
> T2.y=T3.y AND
> T1.x >= ? selectivity 0.00001 AND
> T2.y > ? selectivity 0.5 AND
> T3.z = ? selectivity 0.2 AND
> T3.w = ?
>
> I need to implement Selectivity injection as shown in above query in
> PostgreSQL by which we can inject selectivity of each predicate or at least
> selectivity at relation level directly as part of query. Is there any
> on-going work on this front? If there is no ongoing work on this, How
> should I start implementing this feature?
>

My plan was to create a boolean operator which always returns true, but
estimates its own selectivity as 0.001 (or better yet, parameterize that
selectivity estimate, if that is possible) which can be inserted into the
place where lower selectivity estimate is needed with an "AND".

And another one that always returns false, but has a selectivity estimate
near 1, for use in OR conditions when the opposite change is needed.

I think that will be much easier to do than to extent the grammar. And
probably more acceptable to the core team.

I think this could be done simply in an extension module without even
needing to change the core code, but I never got around to investigating
exactly how.

Cheers,

Jeff


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
Cc: Rajmohan C <csrajmohan(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: how to implement selectivity injection in postgresql
Date: 2014-08-13 22:25:02
Message-ID: 7883.1407968702@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers

Jeff Janes <jeff(dot)janes(at)gmail(dot)com> writes:
> On Wed, Aug 13, 2014 at 9:33 AM, Rajmohan C <csrajmohan(at)gmail(dot)com> wrote:
>> I need to implement Selectivity injection as shown in above query in
>> PostgreSQL by which we can inject selectivity of each predicate or at least
>> selectivity at relation level directly as part of query.

> My plan was to create a boolean operator which always returns true, but
> estimates its own selectivity as 0.001 (or better yet, parameterize that
> selectivity estimate, if that is possible) which can be inserted into the
> place where lower selectivity estimate is needed with an "AND".

That doesn't seem especially helpful/convenient, especially not if you're
trying to affect the estimation of a join clause. The last discussion
I remember on this subject was to invent a special dummy function that
would be understood by the planner and would work sort of like
__builtin_expect() in gcc:

selectivity(condition bool, probability float8) returns bool

Semantically the function would just return its first argument (and
the function itself would disappear at runtime) but the planner would
take the value of the second argument as a selectivity estimate overriding
whatever it might've otherwise deduced about the "condition". So
you'd use it like

SELECT ... WHERE selectivity(id = 42, 0.0001)

and get functionally the same results as for

SELECT ... WHERE id = 42

but with a different selectivity estimate for that WHERE condition.

regards, tom lane