Quick Links

Re: Open 7.4 items

Lists:	pgsql-hackerspgsql-patches

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	PostgreSQL-development <pgsql-hackers(at)postgreSQL(dot)org>
Subject:	Open 7.4 items
Date:	2003-10-05 04:41:21
Message-ID:	200310050441.h954fLN13943@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

P O S T G R E S Q L

7 . 4 O P E N I T E M S

Current at ftp://momjian.postgresql.org/pub/postgresql/open_items.

Changes
-------
Fix REVOKE ALL ON FUNCTION error when removing owner permissions
Improve speed of building of constraints during restore
What to do about exposing the list of possible SQLSTATE error codes

Documentation Changes
---------------------
Move release notes to SGML
Freeze message strings

--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

From:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 05:50:29
Message-ID:	20031004224857.N57664@megazone.bigpanda.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

On Sun, 5 Oct 2003, Bruce Momjian wrote:

> Improve speed of building of constraints during restore

Did we get consensus on what to do with this, whether we're doing
only the superuser option to not check, only speeding up fk constraint
checks by using a statement instead of the repeated calls, both, or
something else?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 06:01:18
Message-ID:	28254.1065333678@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
>> Improve speed of building of constraints during restore

> Did we get consensus on what to do with this,

Not really, it was still up in the air I thought. However, the
discussion will become moot if we don't have an implementation
of the faster-checking alternative to look at pretty soon. Do you
have something nearly ready to show?

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 13:34:54
Message-ID:	200310051334.h95DYsQ20221@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:
> Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> >> Improve speed of building of constraints during restore
>
> > Did we get consensus on what to do with this,
>
> Not really, it was still up in the air I thought. However, the
> discussion will become moot if we don't have an implementation
> of the faster-checking alternative to look at pretty soon. Do you
> have something nearly ready to show?

Last I remember, there was the idea to make ALTER TABLE use a query to
check all constraints at once, rather than per row, _and_ there was an
idea to turn it off by the superuser. However, if we get the first one,
and it is fast, we might not even use the second one.

From:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 18:40:10
Message-ID:	20031005112452.T74878@megazone.bigpanda.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

On Sun, 5 Oct 2003, Tom Lane wrote:

> Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> >> Improve speed of building of constraints during restore
>
> > Did we get consensus on what to do with this,
>
> Not really, it was still up in the air I thought. However, the
> discussion will become moot if we don't have an implementation
> of the faster-checking alternative to look at pretty soon. Do you
> have something nearly ready to show?

It's not cleaned up, but yes. It appears to work for the simple tests I've
done and should fall back if the permissions don't work to do a single
query on both tables.

I wasn't sure what to do about some of the spi error conditions. For many
of them I'm just returning false now so that it will try the other
mechanism in case that might work. I'm not really sure if that's worth it,
or if I should just elog(ERROR) and give up.

Attachment	Content-Type	Size
single_check.diff	text/plain	14.6 KB

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 18:59:08
Message-ID:	9614.1065380348@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> On Sun, 5 Oct 2003, Tom Lane wrote:
>> Not really, it was still up in the air I thought. However, the
>> discussion will become moot if we don't have an implementation
>> of the faster-checking alternative to look at pretty soon. Do you
>> have something nearly ready to show?

> It's not cleaned up, but yes. It appears to work for the simple tests I've
> done and should fall back if the permissions don't work to do a single
> query on both tables.

Okay, I'll look this over, make any improvements I can think of, and
post another version in a couple of hours. One thing I can see I'd
like to do is merge the error-reporting code with the main line, so
that there's not any difference in the output format (I don't like
the induced change in the regression test outputs...)

regards, tom lane

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:03:24
Message-ID:	14775.1065387804@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> It's not cleaned up, but yes. It appears to work for the simple tests I've
> done and should fall back if the permissions don't work to do a single
> query on both tables.

Here's my code-reviewed version of the patch. Anyone else want to take
a look?

> I wasn't sure what to do about some of the spi error conditions. For many
> of them I'm just returning false now so that it will try the other
> mechanism in case that might work. I'm not really sure if that's worth it,
> or if I should just elog(ERROR) and give up.

I think you may as well keep it the same as the other RI routines and
just elog() on SPI error. If SPI is broken, the trigger procedure is
gonna fail too.

I changed that, consolidated the error-reporting code, and fixed a couple
other little issues, notably:

* The generated query applied ONLY to the FK table but not the PK table.
I assume this was just an oversight.

* The query is now run using SPI_execp_current and selecting "current"
snapshot. Without this, we could fail in a serializable transaction
if someone else has already committed changes to either relation.
For example:

create pk and fk tables;

begin serializable xact;
insert into pk values(1);
insert into fk values(1);

begin;
insert into fk values(2);
commit;

alter table fk add foreign key ...;

The ALTER will not be blocked from acquiring exclusive lock, since
T2 already committed. But if we run the query in the serializable
snapshot, it won't see the violating row fk=2.

The old trigger-based check avoids this error because the scan loop uses
SnapshotNow to select live rows from the FK table. There is a dual race
condition where T2 deletes a row from the PK table. In current CVS tip
this will be detected and reported as a serialization failure, because
T1 won't be able to get SELECT FOR UPDATE lock on the deleted row. With
the proposed patch you'll instead see a "no such key" failure, which I
think is fine, even though it nominally violates serializability.

Comments? Can anyone else do a code review (Jan??)?

regards, tom lane

Attachment	Content-Type	Size
unknown_filename	text/plain	10.7 KB

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:07:50
Message-ID:	200310052107.h95L7oe23844@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Wow, that's a heap of code --- that's my only comment. :-)

---------------------------------------------------------------------------

Tom Lane wrote:
> Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> > It's not cleaned up, but yes. It appears to work for the simple tests I've
> > done and should fall back if the permissions don't work to do a single
> > query on both tables.
>
> Here's my code-reviewed version of the patch. Anyone else want to take
> a look?
>
> > I wasn't sure what to do about some of the spi error conditions. For many
> > of them I'm just returning false now so that it will try the other
> > mechanism in case that might work. I'm not really sure if that's worth it,
> > or if I should just elog(ERROR) and give up.
>
> I think you may as well keep it the same as the other RI routines and
> just elog() on SPI error. If SPI is broken, the trigger procedure is
> gonna fail too.
>
> I changed that, consolidated the error-reporting code, and fixed a couple
> other little issues, notably:
>
> * The generated query applied ONLY to the FK table but not the PK table.
> I assume this was just an oversight.
>
> * The query is now run using SPI_execp_current and selecting "current"
> snapshot. Without this, we could fail in a serializable transaction
> if someone else has already committed changes to either relation.
> For example:
>
> create pk and fk tables;
>
> begin serializable xact;
> insert into pk values(1);
> insert into fk values(1);
>
> begin;
> insert into fk values(2);
> commit;
>
> alter table fk add foreign key ...;
>
> The ALTER will not be blocked from acquiring exclusive lock, since
> T2 already committed. But if we run the query in the serializable
> snapshot, it won't see the violating row fk=2.
>
> The old trigger-based check avoids this error because the scan loop uses
> SnapshotNow to select live rows from the FK table. There is a dual race
> condition where T2 deletes a row from the PK table. In current CVS tip
> this will be detected and reported as a serialization failure, because
> T1 won't be able to get SELECT FOR UPDATE lock on the deleted row. With
> the proposed patch you'll instead see a "no such key" failure, which I
> think is fine, even though it nominally violates serializability.
>
> Comments? Can anyone else do a code review (Jan??)?
>
> regards, tom lane
>

Content-Description: RIcheck.patch

> *** src/backend/commands/tablecmds.c.orig Thu Oct 2 15:24:52 2003
> --- src/backend/commands/tablecmds.c Sun Oct 5 16:29:51 2003
> ***************
> *** 3455,3460 ****
> --- 3455,3467 ----
> int count;
>
> /*
> + * See if we can do it with a single LEFT JOIN query. A FALSE result
> + * indicates we must proceed with the fire-the-trigger method.
> + */
> + if (RI_Initial_Check(fkconstraint, rel, pkrel))
> + return;
> +
> + /*
> * Scan through each tuple, calling RI_FKey_check_ins (insert trigger)
> * as if that tuple had just been inserted. If any of those fail, it
> * should ereport(ERROR) and that's that.
> *** src/backend/utils/adt/ri_triggers.c.orig Wed Oct 1 17:30:52 2003
> --- src/backend/utils/adt/ri_triggers.c Sun Oct 5 16:42:37 2003
> ***************
> *** 40,45 ****
> --- 40,46 ----
> #include "rewrite/rewriteHandler.h"
> #include "utils/lsyscache.h"
> #include "utils/typcache.h"
> + #include "utils/acl.h"
> #include "miscadmin.h"
>
>
> ***************
> *** 164,170 ****
> Datum *vals, char *nulls);
> static void ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
> Relation pk_rel, Relation fk_rel,
> ! HeapTuple violator, bool spi_err);
>
>
> /* ----------
> --- 165,172 ----
> Datum *vals, char *nulls);
> static void ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
> Relation pk_rel, Relation fk_rel,
> ! HeapTuple violator, TupleDesc tupdesc,
> ! bool spi_err);
>
>
> /* ----------
> ***************
> *** 2540,2546 ****
> --- 2542,2743 ----
> }
>
>
> + /* ----------
> + * RI_Initial_Check -
> + *
> + * Check an entire table for non-matching values using a single query.
> + * This is not a trigger procedure, but is called during ALTER TABLE
> + * ADD FOREIGN KEY to validate the initial table contents.
> + *
> + * We expect that an exclusive lock has been taken on rel and pkrel;
> + * hence, we do not need to lock individual rows for the check.
> + *
> + * If the check fails because the current user doesn't have permissions
> + * to read both tables, return false to let our caller know that they will
> + * need to do something else to check the constraint.
> + * ----------
> + */
> + bool
> + RI_Initial_Check(FkConstraint *fkconstraint, Relation rel, Relation pkrel)
> + {
> + const char *constrname = fkconstraint->constr_name;
> + char querystr[MAX_QUOTED_REL_NAME_LEN * 2 + 250 +
> + (MAX_QUOTED_NAME_LEN + 32) * ((RI_MAX_NUMKEYS * 4)+1)];
> + char pkrelname[MAX_QUOTED_REL_NAME_LEN];
> + char relname[MAX_QUOTED_REL_NAME_LEN];
> + char attname[MAX_QUOTED_NAME_LEN];
> + char fkattname[MAX_QUOTED_NAME_LEN];
> + const char *sep;
> + List *list;
> + List *list2;
> + int spi_result;
> + void *qplan;
> +
> + /*
> + * Check to make sure current user has enough permissions to do the
> + * test query. (If not, caller can fall back to the trigger method,
> + * which works because it changes user IDs on the fly.)
> + *
> + * XXX are there any other show-stopper conditions to check?
> + */
> + if (pg_class_aclcheck(RelationGetRelid(rel), GetUserId(), ACL_SELECT) != ACLCHECK_OK)
> + return false;
> + if (pg_class_aclcheck(RelationGetRelid(pkrel), GetUserId(), ACL_SELECT) != ACLCHECK_OK)
> + return false;
> +
> + /*----------
> + * The query string built is:
> + * SELECT fk.keycols FROM ONLY relname fk
> + * LEFT OUTER JOIN ONLY pkrelname pk
> + * ON (pk.pkkeycol1=fk.keycol1 [AND ...])
> + * WHERE pk.pkkeycol1 IS NULL AND
> + * For MATCH unspecified:
> + * (fk.keycol1 IS NOT NULL [AND ...])
> + * For MATCH FULL:
> + * (fk.keycol1 IS NOT NULL [OR ...])
> + *----------
> + */
> +
> + sprintf(querystr, "SELECT ");
> + sep="";
> + foreach(list, fkconstraint->fk_attrs)
> + {
> + quoteOneName(attname, strVal(lfirst(list)));
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + "%sfk.%s", sep, attname);
> + sep = ", ";
> + }
> +
> + quoteRelationName(pkrelname, pkrel);
> + quoteRelationName(relname, rel);
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + " FROM ONLY %s fk LEFT OUTER JOIN ONLY %s pk ON (",
> + relname, pkrelname);
> +
> + sep="";
> + for (list=fkconstraint->pk_attrs, list2=fkconstraint->fk_attrs;
> + list != NIL && list2 != NIL;
> + list=lnext(list), list2=lnext(list2))
> + {
> + quoteOneName(attname, strVal(lfirst(list)));
> + quoteOneName(fkattname, strVal(lfirst(list2)));
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + "%spk.%s=fk.%s",
> + sep, attname, fkattname);
> + sep = " AND ";
> + }
> + /*
> + * It's sufficient to test any one pk attribute for null to detect a
> + * join failure.
> + */
> + quoteOneName(attname, strVal(lfirst(fkconstraint->pk_attrs)));
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + ") WHERE pk.%s IS NULL AND (", attname);
> +
> + sep="";
> + foreach(list, fkconstraint->fk_attrs)
> + {
> + quoteOneName(attname, strVal(lfirst(list)));
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + "%sfk.%s IS NOT NULL",
> + sep, attname);
> + switch (fkconstraint->fk_matchtype)
> + {
> + case FKCONSTR_MATCH_UNSPECIFIED:
> + sep=" AND ";
> + break;
> + case FKCONSTR_MATCH_FULL:
> + sep=" OR ";
> + break;
> + case FKCONSTR_MATCH_PARTIAL:
> + ereport(ERROR,
> + (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
> + errmsg("MATCH PARTIAL not yet implemented")));
> + break;
> + default:
> + elog(ERROR, "unrecognized match type: %d",
> + fkconstraint->fk_matchtype);
> + break;
> + }
> + }
> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
> + ")");
> +
> + if (SPI_connect() != SPI_OK_CONNECT)
> + elog(ERROR, "SPI_connect failed");
> +
> + /*
> + * Generate the plan. We don't need to cache it, and there are no
> + * arguments to the plan.
> + */
> + qplan = SPI_prepare(querystr, 0, NULL);
> +
> + /*
> + * Run the plan. For safety we force a current query snapshot to be
> + * used. (In serializable mode, this arguably violates serializability,
> + * but we really haven't got much choice.) We need at most one tuple
> + * returned, so pass limit = 1.
> + */
> + spi_result = SPI_execp_current(qplan, NULL, NULL, true, 1);
>
> + /* Check result */
> + if (spi_result != SPI_OK_SELECT)
> + elog(ERROR, "SPI_execp_current returned %d", spi_result);
> +
> + /* Did we find a tuple violating the constraint? */
> + if (SPI_processed > 0)
> + {
> + HeapTuple tuple = SPI_tuptable->vals[0];
> + TupleDesc tupdesc = SPI_tuptable->tupdesc;
> + int nkeys = length(fkconstraint->fk_attrs);
> + int i;
> + RI_QueryKey qkey;
> +
> + /*
> + * If it's MATCH FULL, and there are any nulls in the FK keys,
> + * complain about that rather than the lack of a match. MATCH FULL
> + * disallows partially-null FK rows.
> + */
> + if (fkconstraint->fk_matchtype == FKCONSTR_MATCH_FULL)
> + {
> + bool isnull = false;
> +
> + for (i = 1; i <= nkeys; i++)
> + {
> + (void) SPI_getbinval(tuple, tupdesc, i, &isnull);
> + if (isnull)
> + break;
> + }
> + if (isnull)
> + ereport(ERROR,
> + (errcode(ERRCODE_FOREIGN_KEY_VIOLATION),
> + errmsg("insert or update on table \"%s\" violates foreign key constraint \"%s\"",
> + RelationGetRelationName(rel),
> + constrname),
> + errdetail("MATCH FULL does not allow mixing of null and nonnull key values.")));
> + }
> +
> + /*
> + * Although we didn't cache the query, we need to set up a fake
> + * query key to pass to ri_ReportViolation.
> + */
> + MemSet(&qkey, 0, sizeof(qkey));
> + qkey.constr_queryno = RI_PLAN_CHECK_LOOKUPPK;
> + qkey.nkeypairs = nkeys;
> + for (i = 0; i < nkeys; i++)
> + qkey.keypair[i][RI_KEYPAIR_FK_IDX] = i + 1;
> +
> + ri_ReportViolation(&qkey, constrname,
> + pkrel, rel,
> + tuple, tupdesc,
> + false);
> + }
> +
> + if (SPI_finish() != SPI_OK_FINISH)
> + elog(ERROR, "SPI_finish failed");
> +
> + return true;
> + }
>
>
> /* ----------
> ***************
> *** 2905,2910 ****
> --- 3102,3108 ----
> ri_ReportViolation(qkey, constrname ? constrname : "",
> pk_rel, fk_rel,
> new_tuple ? new_tuple : old_tuple,
> + NULL,
> true);
>
> /* XXX wouldn't it be clearer to do this part at the caller? */
> ***************
> *** 2913,2918 ****
> --- 3111,3117 ----
> ri_ReportViolation(qkey, constrname,
> pk_rel, fk_rel,
> new_tuple ? new_tuple : old_tuple,
> + NULL,
> false);
>
> return SPI_processed != 0;
> ***************
> *** 2950,2956 ****
> static void
> ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
> Relation pk_rel, Relation fk_rel,
> ! HeapTuple violator, bool spi_err)
> {
> #define BUFLENGTH 512
> char key_names[BUFLENGTH];
> --- 3149,3156 ----
> static void
> ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
> Relation pk_rel, Relation fk_rel,
> ! HeapTuple violator, TupleDesc tupdesc,
> ! bool spi_err)
> {
> #define BUFLENGTH 512
> char key_names[BUFLENGTH];
> ***************
> *** 2958,2964 ****
> char *name_ptr = key_names;
> char *val_ptr = key_values;
> bool onfk;
> - Relation rel;
> int idx,
> key_idx;
>
> --- 3158,3163 ----
> ***************
> *** 2972,2989 ****
> errhint("This is most likely due to a rule having rewritten the query.")));
>
> /*
> ! * rel is set to where the tuple description is coming from.
> */
> onfk = (qkey->constr_queryno == RI_PLAN_CHECK_LOOKUPPK);
> if (onfk)
> {
> - rel = fk_rel;
> key_idx = RI_KEYPAIR_FK_IDX;
> }
> else
> {
> - rel = pk_rel;
> key_idx = RI_KEYPAIR_PK_IDX;
> }
>
> /*
> --- 3171,3191 ----
> errhint("This is most likely due to a rule having rewritten the query.")));
>
> /*
> ! * Determine which relation to complain about. If tupdesc wasn't
> ! * passed by caller, assume the violator tuple came from there.
> */
> onfk = (qkey->constr_queryno == RI_PLAN_CHECK_LOOKUPPK);
> if (onfk)
> {
> key_idx = RI_KEYPAIR_FK_IDX;
> + if (tupdesc == NULL)
> + tupdesc = fk_rel->rd_att;
> }
> else
> {
> key_idx = RI_KEYPAIR_PK_IDX;
> + if (tupdesc == NULL)
> + tupdesc = pk_rel->rd_att;
> }
>
> /*
> ***************
> *** 3008,3015 ****
> char *name,
> *val;
>
> ! name = SPI_fname(rel->rd_att, fnum);
> ! val = SPI_getvalue(violator, rel->rd_att, fnum);
> if (!val)
> val = "null";
>
> --- 3210,3217 ----
> char *name,
> *val;
>
> ! name = SPI_fname(tupdesc, fnum);
> ! val = SPI_getvalue(violator, tupdesc, fnum);
> if (!val)
> val = "null";
>
> *** src/include/commands/trigger.h.orig Sun Aug 3 23:01:31 2003
> --- src/include/commands/trigger.h Sun Oct 5 16:11:44 2003
> ***************
> *** 197,201 ****
> --- 197,204 ----
> * in utils/adt/ri_triggers.c
> */
> extern bool RI_FKey_keyequal_upd(TriggerData *trigdata);
> + extern bool RI_Initial_Check(FkConstraint *fkconstraint,
> + Relation rel,
> + Relation pkrel);
>
> #endif /* TRIGGER_H */

>
> ---------------------------(end of broadcast)---------------------------
> TIP 9: the planner will ignore your desire to choose an index scan if your
> joining column's datatypes do not match

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:11:29
Message-ID:	14860.1065388289@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> Wow, that's a heap of code --- that's my only comment. :-)

Most of it is pretty direct cribbing of code that already exists in
the other routines in ri_triggers.c, so it's not really completely
new code, just boilerplate.

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:19:39
Message-ID:	200310052119.h95LJd505783@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:
> Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> > Wow, that's a heap of code --- that's my only comment. :-)
>
> Most of it is pretty direct cribbing of code that already exists in
> the other routines in ri_triggers.c, so it's not really completely
> new code, just boilerplate.

Oh, that makes me feel better. Do we have timings for this code?

From:	"Marc G(dot) Fournier" <scrappy(at)postgresql(dot)org>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:33:46
Message-ID:	20031005183320.V37588@ganymede.hub.org
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

On Sun, 5 Oct 2003, Bruce Momjian wrote:

>
> Wow, that's a heap of code --- that's my only comment. :-)

And you reposted the *whole* patch for that?? *tsk* *tsk*

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	"Marc G(dot) Fournier" <scrappy(at)postgresql(dot)org>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 21:46:08
Message-ID:	200310052146.h95Lk8U08649@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Marc G. Fournier wrote:
>
>
> On Sun, 5 Oct 2003, Bruce Momjian wrote:
>
> >
> > Wow, that's a heap of code --- that's my only comment. :-)
>
> And you reposted the *whole* patch for that?? *tsk* *tsk*

Oops, sorry.

From:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 22:52:34
Message-ID:	20031005152122.M80814@megazone.bigpanda.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

On Sun, 5 Oct 2003, Tom Lane wrote:

> Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> > I wasn't sure what to do about some of the spi error conditions. For many
> > of them I'm just returning false now so that it will try the other
> > mechanism in case that might work. I'm not really sure if that's worth it,
> > or if I should just elog(ERROR) and give up.
>
> I think you may as well keep it the same as the other RI routines and
> just elog() on SPI error. If SPI is broken, the trigger procedure is
> gonna fail too.

Okay.

> I changed that, consolidated the error-reporting code, and fixed a couple
> other little issues, notably:
>
> * The generated query applied ONLY to the FK table but not the PK table.
> I assume this was just an oversight.

Yep, dumb oversight.

> * The query is now run using SPI_execp_current and selecting "current"
> snapshot. Without this, we could fail in a serializable transaction
> if someone else has already committed changes to either relation.

You'd think I'd have remembered this could happen given the recent
discussions. I was wondering if we could get the serialization failure
with for update, but that's disallowed on the nullable side of the outer
join. It's probably okay to give the no such key error in the delete case
(at some point it'd be nice to make it give serialization failure, but
that might take alot more work than is warrented at this time for 7.4)

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 23:39:59
Message-ID:	15768.1065397199@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> Oh, that makes me feel better. Do we have timings for this code?

This is just a single data point, but I made a table of 1 million
rows containing just the int4 primary key column (values 0-1million
in a somewhat random order). Then I copied the same data, sans index,
to produce a foreign key table. Then I tried ALTER ADD PRIMARY KEY.

The results were:

Time to load the 1 million rows: 8 sec

Time to create the PK index: 10 sec

Time to ADD PRIMARY KEY:

with CVS-tip code (fire trigger per row): 78 sec

with proposed patch: anywhere from 5 to 25 sec depending on plan

The default plan if there is no index on the FK table (meaning the
planner will not know its true size) is a nestloop with inner index
scan taking about 17 sec.

If any index has been created on the FK table, you'll probably get a
merge or hash join. I found these took about 20 sec with the default
sort_mem setting, but with sort_mem boosted to 50000 or more, the
hash join got lots faster --- down in the 6-7 second range ---
presumably because it didn't need multiple hash batches.

It'd clearly be worth our while to mention boosting sort_mem as a
helpful thing to do during bulk data load --- it should speed up
btree index creation too. I don't think that tip appears anywhere
in the docs at the moment.

So the patch definitely seems worthwhile, but someone might still care
to argue that there should be a bypass switch available too.

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 23:51:55
Message-ID:	200310052351.h95Nptr04264@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:
> It'd clearly be worth our while to mention boosting sort_mem as a
> helpful thing to do during bulk data load --- it should speed up
> btree index creation too. I don't think that tip appears anywhere
> in the docs at the moment.

Added recently, see last sentence:

<term><varname>sort_mem</varname> (<type>integer</type>)</term>
<listitem>
<para>
Specifies the amount of memory to be used by internal sort operations and
hash tables before switching to temporary disk files. The value is
specified in kilobytes, and defaults to 1024 kilobytes (1 MB).
Note that for a complex query, several sort or hash operations might be
running in parallel; each one will be allowed to use as much memory
as this value specifies before it starts to put data into temporary
files. Also, several running sessions could be doing
sort operations simultaneously. So the total memory used could be many
times the value of <varname>sort_mem</varname>. Sort operations are used
by <literal>ORDER BY</>, merge joins, and <command>CREATE INDEX</>.
Hash tables are used in hash joins, hash-based aggregation, and
hash-based processing of <literal>IN</> subqueries. Because
<command>CREATE INDEX</> is used when restoring a database, it might
be good to temporarily increase this value during a restore.
</para>

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-05 23:58:14
Message-ID:	15910.1065398294@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> Tom Lane wrote:
>> It'd clearly be worth our while to mention boosting sort_mem as a
>> helpful thing to do during bulk data load --- it should speed up
>> btree index creation too. I don't think that tip appears anywhere
>> in the docs at the moment.

> Added recently, see last sentence:

That's a fairly useless place to put it, though, since someone would
only think to look at sort_mem if they already had a clue. It should
be mentioned under bulk data load (in performance tips chapter) and
perhaps also in dump/restore procedures.

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 00:12:49
Message-ID:	200310060012.h960CnQ05926@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:
> Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
> > Tom Lane wrote:
> >> It'd clearly be worth our while to mention boosting sort_mem as a
> >> helpful thing to do during bulk data load --- it should speed up
> >> btree index creation too. I don't think that tip appears anywhere
> >> in the docs at the moment.
>
> > Added recently, see last sentence:
>
> That's a fairly useless place to put it, though, since someone would
> only think to look at sort_mem if they already had a clue. It should
> be mentioned under bulk data load (in performance tips chapter) and
> perhaps also in dump/restore procedures.

There were several places it is needed, so I just hit the one place ---
feel free to add some more.

From:	Neil Conway <neilc(at)samurai(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>, PostgreSQL Patches <pgsql-patches(at)postgresql(dot)org>
Subject:	Re: [HACKERS] Open 7.4 items
Date:	2003-10-06 00:22:38
Message-ID:	1065399757.466.24.camel@tokyo
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

On Sun, 2003-10-05 at 19:58, Tom Lane wrote:
> That's a fairly useless place to put it, though, since someone would
> only think to look at sort_mem if they already had a clue. It should
> be mentioned under bulk data load (in performance tips chapter)

Attached is a doc patch that does this. The way I've worded it may not
be the best, though.

> and perhaps also in dump/restore procedures.

It's already mentioned there.

Should we also suggest turning off fsync when doing restores?

(BTW, is there a reason the docs consistently call them "B-tree
indexes", not "B+-tree indexes"?)

-Neil

Attachment	Content-Type	Size
sort_mem_restore_docs-2.patch	text/x-patch	3.8 KB

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>, PostgreSQL Patches <pgsql-patches(at)postgresql(dot)org>
Subject:	Re: [HACKERS] Open 7.4 items
Date:	2003-10-06 02:41:52
Message-ID:	18824.1065408112@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Neil Conway <neilc(at)samurai(dot)com> writes:
> (BTW, is there a reason the docs consistently call them "B-tree
> indexes", not "B+-tree indexes"?)

The latter might be technically more correct, but most people are going
to think it's a typo. I think B-tree is fine for the purposes of our docs.

regards, tom lane

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 03:48:26
Message-ID:	19237.1065412106@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
> It's probably okay to give the no such key error in the delete case
> (at some point it'd be nice to make it give serialization failure, but
> that might take alot more work than is warrented at this time for 7.4)

Per prior discussion, I think the "no such key" error is more useful
than the serialization error, even if it's the wrong thing according
to a narrow interpretation; so I really don't feel much need to revisit
this later.

regards, tom lane

From:	Jan Wieck <JanWieck(at)Yahoo(dot)com>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 13:34:24
Message-ID:	3F816F60.4090802@Yahoo.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Bruce Momjian wrote:
> Wow, that's a heap of code --- that's my only comment. :-)

Not really.

I'm not sure what conditions could possibley cause SPI_prepare to return
NULL, but it'd be certainly better to check that. Other than that, looks
good to me.

Jan

>
> ---------------------------------------------------------------------------
>
> Tom Lane wrote:
>> Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com> writes:
>> > It's not cleaned up, but yes. It appears to work for the simple tests I've
>> > done and should fall back if the permissions don't work to do a single
>> > query on both tables.
>>
>> Here's my code-reviewed version of the patch. Anyone else want to take
>> a look?
>>
>> > I wasn't sure what to do about some of the spi error conditions. For many
>> > of them I'm just returning false now so that it will try the other
>> > mechanism in case that might work. I'm not really sure if that's worth it,
>> > or if I should just elog(ERROR) and give up.
>>
>> I think you may as well keep it the same as the other RI routines and
>> just elog() on SPI error. If SPI is broken, the trigger procedure is
>> gonna fail too.
>>
>> I changed that, consolidated the error-reporting code, and fixed a couple
>> other little issues, notably:
>>
>> * The generated query applied ONLY to the FK table but not the PK table.
>> I assume this was just an oversight.
>>
>> * The query is now run using SPI_execp_current and selecting "current"
>> snapshot. Without this, we could fail in a serializable transaction
>> if someone else has already committed changes to either relation.
>> For example:
>>
>> create pk and fk tables;
>>
>> begin serializable xact;
>> insert into pk values(1);
>> insert into fk values(1);
>>
>> begin;
>> insert into fk values(2);
>> commit;
>>
>> alter table fk add foreign key ...;
>>
>> The ALTER will not be blocked from acquiring exclusive lock, since
>> T2 already committed. But if we run the query in the serializable
>> snapshot, it won't see the violating row fk=2.
>>
>> The old trigger-based check avoids this error because the scan loop uses
>> SnapshotNow to select live rows from the FK table. There is a dual race
>> condition where T2 deletes a row from the PK table. In current CVS tip
>> this will be detected and reported as a serialization failure, because
>> T1 won't be able to get SELECT FOR UPDATE lock on the deleted row. With
>> the proposed patch you'll instead see a "no such key" failure, which I
>> think is fine, even though it nominally violates serializability.
>>
>> Comments? Can anyone else do a code review (Jan??)?
>>
>> regards, tom lane
>>
>
> Content-Description: RIcheck.patch
>
>> *** src/backend/commands/tablecmds.c.orig Thu Oct 2 15:24:52 2003
>> --- src/backend/commands/tablecmds.c Sun Oct 5 16:29:51 2003
>> ***************
>> *** 3455,3460 ****
>> --- 3455,3467 ----
>> int count;
>>
>> /*
>> + * See if we can do it with a single LEFT JOIN query. A FALSE result
>> + * indicates we must proceed with the fire-the-trigger method.
>> + */
>> + if (RI_Initial_Check(fkconstraint, rel, pkrel))
>> + return;
>> +
>> + /*
>> * Scan through each tuple, calling RI_FKey_check_ins (insert trigger)
>> * as if that tuple had just been inserted. If any of those fail, it
>> * should ereport(ERROR) and that's that.
>> *** src/backend/utils/adt/ri_triggers.c.orig Wed Oct 1 17:30:52 2003
>> --- src/backend/utils/adt/ri_triggers.c Sun Oct 5 16:42:37 2003
>> ***************
>> *** 40,45 ****
>> --- 40,46 ----
>> #include "rewrite/rewriteHandler.h"
>> #include "utils/lsyscache.h"
>> #include "utils/typcache.h"
>> + #include "utils/acl.h"
>> #include "miscadmin.h"
>>
>>
>> ***************
>> *** 164,170 ****
>> Datum *vals, char *nulls);
>> static void ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
>> Relation pk_rel, Relation fk_rel,
>> ! HeapTuple violator, bool spi_err);
>>
>>
>> /* ----------
>> --- 165,172 ----
>> Datum *vals, char *nulls);
>> static void ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
>> Relation pk_rel, Relation fk_rel,
>> ! HeapTuple violator, TupleDesc tupdesc,
>> ! bool spi_err);
>>
>>
>> /* ----------
>> ***************
>> *** 2540,2546 ****
>> --- 2542,2743 ----
>> }
>>
>>
>> + /* ----------
>> + * RI_Initial_Check -
>> + *
>> + * Check an entire table for non-matching values using a single query.
>> + * This is not a trigger procedure, but is called during ALTER TABLE
>> + * ADD FOREIGN KEY to validate the initial table contents.
>> + *
>> + * We expect that an exclusive lock has been taken on rel and pkrel;
>> + * hence, we do not need to lock individual rows for the check.
>> + *
>> + * If the check fails because the current user doesn't have permissions
>> + * to read both tables, return false to let our caller know that they will
>> + * need to do something else to check the constraint.
>> + * ----------
>> + */
>> + bool
>> + RI_Initial_Check(FkConstraint *fkconstraint, Relation rel, Relation pkrel)
>> + {
>> + const char *constrname = fkconstraint->constr_name;
>> + char querystr[MAX_QUOTED_REL_NAME_LEN * 2 + 250 +
>> + (MAX_QUOTED_NAME_LEN + 32) * ((RI_MAX_NUMKEYS * 4)+1)];
>> + char pkrelname[MAX_QUOTED_REL_NAME_LEN];
>> + char relname[MAX_QUOTED_REL_NAME_LEN];
>> + char attname[MAX_QUOTED_NAME_LEN];
>> + char fkattname[MAX_QUOTED_NAME_LEN];
>> + const char *sep;
>> + List *list;
>> + List *list2;
>> + int spi_result;
>> + void *qplan;
>> +
>> + /*
>> + * Check to make sure current user has enough permissions to do the
>> + * test query. (If not, caller can fall back to the trigger method,
>> + * which works because it changes user IDs on the fly.)
>> + *
>> + * XXX are there any other show-stopper conditions to check?
>> + */
>> + if (pg_class_aclcheck(RelationGetRelid(rel), GetUserId(), ACL_SELECT) != ACLCHECK_OK)
>> + return false;
>> + if (pg_class_aclcheck(RelationGetRelid(pkrel), GetUserId(), ACL_SELECT) != ACLCHECK_OK)
>> + return false;
>> +
>> + /*----------
>> + * The query string built is:
>> + * SELECT fk.keycols FROM ONLY relname fk
>> + * LEFT OUTER JOIN ONLY pkrelname pk
>> + * ON (pk.pkkeycol1=fk.keycol1 [AND ...])
>> + * WHERE pk.pkkeycol1 IS NULL AND
>> + * For MATCH unspecified:
>> + * (fk.keycol1 IS NOT NULL [AND ...])
>> + * For MATCH FULL:
>> + * (fk.keycol1 IS NOT NULL [OR ...])
>> + *----------
>> + */
>> +
>> + sprintf(querystr, "SELECT ");
>> + sep="";
>> + foreach(list, fkconstraint->fk_attrs)
>> + {
>> + quoteOneName(attname, strVal(lfirst(list)));
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + "%sfk.%s", sep, attname);
>> + sep = ", ";
>> + }
>> +
>> + quoteRelationName(pkrelname, pkrel);
>> + quoteRelationName(relname, rel);
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + " FROM ONLY %s fk LEFT OUTER JOIN ONLY %s pk ON (",
>> + relname, pkrelname);
>> +
>> + sep="";
>> + for (list=fkconstraint->pk_attrs, list2=fkconstraint->fk_attrs;
>> + list != NIL && list2 != NIL;
>> + list=lnext(list), list2=lnext(list2))
>> + {
>> + quoteOneName(attname, strVal(lfirst(list)));
>> + quoteOneName(fkattname, strVal(lfirst(list2)));
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + "%spk.%s=fk.%s",
>> + sep, attname, fkattname);
>> + sep = " AND ";
>> + }
>> + /*
>> + * It's sufficient to test any one pk attribute for null to detect a
>> + * join failure.
>> + */
>> + quoteOneName(attname, strVal(lfirst(fkconstraint->pk_attrs)));
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + ") WHERE pk.%s IS NULL AND (", attname);
>> +
>> + sep="";
>> + foreach(list, fkconstraint->fk_attrs)
>> + {
>> + quoteOneName(attname, strVal(lfirst(list)));
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + "%sfk.%s IS NOT NULL",
>> + sep, attname);
>> + switch (fkconstraint->fk_matchtype)
>> + {
>> + case FKCONSTR_MATCH_UNSPECIFIED:
>> + sep=" AND ";
>> + break;
>> + case FKCONSTR_MATCH_FULL:
>> + sep=" OR ";
>> + break;
>> + case FKCONSTR_MATCH_PARTIAL:
>> + ereport(ERROR,
>> + (errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
>> + errmsg("MATCH PARTIAL not yet implemented")));
>> + break;
>> + default:
>> + elog(ERROR, "unrecognized match type: %d",
>> + fkconstraint->fk_matchtype);
>> + break;
>> + }
>> + }
>> + snprintf(querystr + strlen(querystr), sizeof(querystr) - strlen(querystr),
>> + ")");
>> +
>> + if (SPI_connect() != SPI_OK_CONNECT)
>> + elog(ERROR, "SPI_connect failed");
>> +
>> + /*
>> + * Generate the plan. We don't need to cache it, and there are no
>> + * arguments to the plan.
>> + */
>> + qplan = SPI_prepare(querystr, 0, NULL);
>> +
>> + /*
>> + * Run the plan. For safety we force a current query snapshot to be
>> + * used. (In serializable mode, this arguably violates serializability,
>> + * but we really haven't got much choice.) We need at most one tuple
>> + * returned, so pass limit = 1.
>> + */
>> + spi_result = SPI_execp_current(qplan, NULL, NULL, true, 1);
>>
>> + /* Check result */
>> + if (spi_result != SPI_OK_SELECT)
>> + elog(ERROR, "SPI_execp_current returned %d", spi_result);
>> +
>> + /* Did we find a tuple violating the constraint? */
>> + if (SPI_processed > 0)
>> + {
>> + HeapTuple tuple = SPI_tuptable->vals[0];
>> + TupleDesc tupdesc = SPI_tuptable->tupdesc;
>> + int nkeys = length(fkconstraint->fk_attrs);
>> + int i;
>> + RI_QueryKey qkey;
>> +
>> + /*
>> + * If it's MATCH FULL, and there are any nulls in the FK keys,
>> + * complain about that rather than the lack of a match. MATCH FULL
>> + * disallows partially-null FK rows.
>> + */
>> + if (fkconstraint->fk_matchtype == FKCONSTR_MATCH_FULL)
>> + {
>> + bool isnull = false;
>> +
>> + for (i = 1; i <= nkeys; i++)
>> + {
>> + (void) SPI_getbinval(tuple, tupdesc, i, &isnull);
>> + if (isnull)
>> + break;
>> + }
>> + if (isnull)
>> + ereport(ERROR,
>> + (errcode(ERRCODE_FOREIGN_KEY_VIOLATION),
>> + errmsg("insert or update on table \"%s\" violates foreign key constraint \"%s\"",
>> + RelationGetRelationName(rel),
>> + constrname),
>> + errdetail("MATCH FULL does not allow mixing of null and nonnull key values.")));
>> + }
>> +
>> + /*
>> + * Although we didn't cache the query, we need to set up a fake
>> + * query key to pass to ri_ReportViolation.
>> + */
>> + MemSet(&qkey, 0, sizeof(qkey));
>> + qkey.constr_queryno = RI_PLAN_CHECK_LOOKUPPK;
>> + qkey.nkeypairs = nkeys;
>> + for (i = 0; i < nkeys; i++)
>> + qkey.keypair[i][RI_KEYPAIR_FK_IDX] = i + 1;
>> +
>> + ri_ReportViolation(&qkey, constrname,
>> + pkrel, rel,
>> + tuple, tupdesc,
>> + false);
>> + }
>> +
>> + if (SPI_finish() != SPI_OK_FINISH)
>> + elog(ERROR, "SPI_finish failed");
>> +
>> + return true;
>> + }
>>
>>
>> /* ----------
>> ***************
>> *** 2905,2910 ****
>> --- 3102,3108 ----
>> ri_ReportViolation(qkey, constrname ? constrname : "",
>> pk_rel, fk_rel,
>> new_tuple ? new_tuple : old_tuple,
>> + NULL,
>> true);
>>
>> /* XXX wouldn't it be clearer to do this part at the caller? */
>> ***************
>> *** 2913,2918 ****
>> --- 3111,3117 ----
>> ri_ReportViolation(qkey, constrname,
>> pk_rel, fk_rel,
>> new_tuple ? new_tuple : old_tuple,
>> + NULL,
>> false);
>>
>> return SPI_processed != 0;
>> ***************
>> *** 2950,2956 ****
>> static void
>> ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
>> Relation pk_rel, Relation fk_rel,
>> ! HeapTuple violator, bool spi_err)
>> {
>> #define BUFLENGTH 512
>> char key_names[BUFLENGTH];
>> --- 3149,3156 ----
>> static void
>> ri_ReportViolation(RI_QueryKey *qkey, const char *constrname,
>> Relation pk_rel, Relation fk_rel,
>> ! HeapTuple violator, TupleDesc tupdesc,
>> ! bool spi_err)
>> {
>> #define BUFLENGTH 512
>> char key_names[BUFLENGTH];
>> ***************
>> *** 2958,2964 ****
>> char *name_ptr = key_names;
>> char *val_ptr = key_values;
>> bool onfk;
>> - Relation rel;
>> int idx,
>> key_idx;
>>
>> --- 3158,3163 ----
>> ***************
>> *** 2972,2989 ****
>> errhint("This is most likely due to a rule having rewritten the query.")));
>>
>> /*
>> ! * rel is set to where the tuple description is coming from.
>> */
>> onfk = (qkey->constr_queryno == RI_PLAN_CHECK_LOOKUPPK);
>> if (onfk)
>> {
>> - rel = fk_rel;
>> key_idx = RI_KEYPAIR_FK_IDX;
>> }
>> else
>> {
>> - rel = pk_rel;
>> key_idx = RI_KEYPAIR_PK_IDX;
>> }
>>
>> /*
>> --- 3171,3191 ----
>> errhint("This is most likely due to a rule having rewritten the query.")));
>>
>> /*
>> ! * Determine which relation to complain about. If tupdesc wasn't
>> ! * passed by caller, assume the violator tuple came from there.
>> */
>> onfk = (qkey->constr_queryno == RI_PLAN_CHECK_LOOKUPPK);
>> if (onfk)
>> {
>> key_idx = RI_KEYPAIR_FK_IDX;
>> + if (tupdesc == NULL)
>> + tupdesc = fk_rel->rd_att;
>> }
>> else
>> {
>> key_idx = RI_KEYPAIR_PK_IDX;
>> + if (tupdesc == NULL)
>> + tupdesc = pk_rel->rd_att;
>> }
>>
>> /*
>> ***************
>> *** 3008,3015 ****
>> char *name,
>> *val;
>>
>> ! name = SPI_fname(rel->rd_att, fnum);
>> ! val = SPI_getvalue(violator, rel->rd_att, fnum);
>> if (!val)
>> val = "null";
>>
>> --- 3210,3217 ----
>> char *name,
>> *val;
>>
>> ! name = SPI_fname(tupdesc, fnum);
>> ! val = SPI_getvalue(violator, tupdesc, fnum);
>> if (!val)
>> val = "null";
>>
>> *** src/include/commands/trigger.h.orig Sun Aug 3 23:01:31 2003
>> --- src/include/commands/trigger.h Sun Oct 5 16:11:44 2003
>> ***************
>> *** 197,201 ****
>> --- 197,204 ----
>> * in utils/adt/ri_triggers.c
>> */
>> extern bool RI_FKey_keyequal_upd(TriggerData *trigdata);
>> + extern bool RI_Initial_Check(FkConstraint *fkconstraint,
>> + Relation rel,
>> + Relation pkrel);
>>
>> #endif /* TRIGGER_H */
>
>>
>> ---------------------------(end of broadcast)---------------------------
>> TIP 9: the planner will ignore your desire to choose an index scan if your
>> joining column's datatypes do not match
>

--
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me. #
#================================================== JanWieck(at)Yahoo(dot)com #

From:	Jan Wieck <JanWieck(at)Yahoo(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 13:37:42
Message-ID:	3F817026.8000204@Yahoo.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:

> Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
>> Oh, that makes me feel better. Do we have timings for this code?
>
> This is just a single data point, but I made a table of 1 million
> rows containing just the int4 primary key column (values 0-1million
> in a somewhat random order). Then I copied the same data, sans index,
> to produce a foreign key table. Then I tried ALTER ADD PRIMARY KEY.
>
> The results were:
>
> Time to load the 1 million rows: 8 sec
>
> Time to create the PK index: 10 sec
>
> Time to ADD PRIMARY KEY:
>
> with CVS-tip code (fire trigger per row): 78 sec
>
> with proposed patch: anywhere from 5 to 25 sec depending on plan
>
> The default plan if there is no index on the FK table (meaning the
> planner will not know its true size) is a nestloop with inner index
> scan taking about 17 sec.

Does an ANALYZE run between index creation and bulk FK checking improve
planning? It's not doing a full DB scan, so it shouldn't do too much
harm, does it?

Jan

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Jan Wieck <JanWieck(at)Yahoo(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 13:45:27
Message-ID:	26555.1065447927@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Jan Wieck <JanWieck(at)Yahoo(dot)com> writes:
> I'm not sure what conditions could possibley cause SPI_prepare to return
> NULL, but it'd be certainly better to check that.

Good thought. I was looking at the other SPI_prepare calls in
ri_triggers.c, which don't check for NULL either, but clearly they
should. Will fix all.

regards, tom lane

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Jan Wieck <JanWieck(at)Yahoo(dot)com>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 13:48:46
Message-ID:	26595.1065448126@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Jan Wieck <JanWieck(at)Yahoo(dot)com> writes:
> Does an ANALYZE run between index creation and bulk FK checking improve
> planning?

It almost certainly would, but I was assuming we had to consider this in
the context of loading existing dump files. We could think about having
pg_dump emit an automatic ANALYZE after the data loading step in the
future though.

regards, tom lane

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Jan Wieck <JanWieck(at)Yahoo(dot)com>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 18:26:24
Message-ID:	200310061826.h96IQOO10578@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Tom Lane wrote:
> Jan Wieck <JanWieck(at)Yahoo(dot)com> writes:
> > Does an ANALYZE run between index creation and bulk FK checking improve
> > planning?
>
> It almost certainly would, but I was assuming we had to consider this in
> the context of loading existing dump files. We could think about having
> pg_dump emit an automatic ANALYZE after the data loading step in the
> future though.

I don't have a problem with having it be slower for 7.4 upgrades if we
can make it reasonable for future loads.

From:	Rod Taylor <rbt(at)rbt(dot)ca>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Jan Wieck <JanWieck(at)Yahoo(dot)com>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL Development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 18:43:09
Message-ID:	1065465788.69373.12.camel@jester
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

> It almost certainly would, but I was assuming we had to consider this in
> the context of loading existing dump files. We could think about having
> pg_dump emit an automatic ANALYZE after the data loading step in the
> future though.

Rather than running ANALYZE, how about simply dumping out and restoring
current statistics? The old stats should be close enough, even with the
lack of empty space.

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Rod Taylor <rbt(at)rbt(dot)ca>
Cc:	Jan Wieck <JanWieck(at)Yahoo(dot)com>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, PostgreSQL Development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 18:49:51
Message-ID:	303.1065466191@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Rod Taylor <rbt(at)rbt(dot)ca> writes:
> Rather than running ANALYZE, how about simply dumping out and restoring
> current statistics?

Nope. That would assume that the stats are the same across revisions.
Not to mention requiring superuser privileges.

regards, tom lane

From:	Gaetano Mendola <mendola(at)bigfoot(dot)com>
To:	"pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Cc:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Subject:	Re: Open 7.4 items
Date:	2003-10-06 19:17:18
Message-ID:	3F81BFBE.7020803@bigfoot.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Bruce Momjian wrote:
> P O S T G R E S Q L
>
> 7 . 4 O P E N I T E M S
>
>
> Current at ftp://momjian.postgresql.org/pub/postgresql/open_items.
>

On the same folder there is: PITR_20020822_02.gz

tell me that we are near to have it :-)

Regards
Gaetano Mendola

From:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
To:	Neil Conway <neilc(at)samurai(dot)com>
Cc:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>, Jan Wieck <JanWieck(at)Yahoo(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>, PostgreSQL Patches <pgsql-patches(at)postgresql(dot)org>
Subject:	Re: [HACKERS] Open 7.4 items
Date:	2003-10-10 02:08:49
Message-ID:	200310100208.h9A28nn28716@candle.pha.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Lists:	pgsql-hackers pgsql-patches

Patch applied. Thanks.

---------------------------------------------------------------------------

Neil Conway wrote:
> On Sun, 2003-10-05 at 19:58, Tom Lane wrote:
> > That's a fairly useless place to put it, though, since someone would
> > only think to look at sort_mem if they already had a clue. It should
> > be mentioned under bulk data load (in performance tips chapter)
>
> Attached is a doc patch that does this. The way I've worded it may not
> be the best, though.
>
> > and perhaps also in dump/restore procedures.
>
> It's already mentioned there.
>
> Should we also suggest turning off fsync when doing restores?
>
> (BTW, is there a reason the docs consistently call them "B-tree
> indexes", not "B+-tree indexes"?)
>
> -Neil
>

[ Attachment, skipping... ]

>
> ---------------------------(end of broadcast)---------------------------
> TIP 5: Have you checked our extensive FAQ?
>
> http://www.postgresql.org/docs/faqs/FAQ.html