Re: dollar quoting and pg_dump

Lists: pgsql-hackerspgsql-patches
From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: dollar quoting and pg_dump
Date: 2004-03-23 15:08:54
Message-ID: 40605306.5040603@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches


I had a brief look at this today. Basically, I thought of adding a new
routine to dumputils.c thus:

void appendStringLiteralDQ(PQExpBuffer buf, const char *str, const
char *dqprefix)

and using it in dumping function bodies and comments on all objects,
with a prefix argument of "function" and "comment" respectively. There
might be other places where we want to use dollar quoting, but this
would be a good start, ISTM.

Basically, this routine would start with $ (+ dqprefix if not null) and
then keep adding characters (in turn "_1234567890") until that string
was not found in str, then appending "$" and using that as the delimiter.

Thoughts?

cheers

andrew


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Andrew Dunstan <andrew(at)dunslane(dot)net>
Cc: Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-23 15:28:16
Message-ID: 20521.1080055696@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
> ... using it in dumping function bodies and comments on all objects,
> with a prefix argument of "function" and "comment" respectively. There
> might be other places where we want to use dollar quoting, but this
> would be a good start, ISTM.

Do we really need to be that verbose? Why not start with the minimal $$
and extend only if needed? On the KISS principle, trying "$$", "$X$",
"$XX$", "$XXX$", etc seems sufficient.

For that matter, I'm not convinced we should use $$ for comments. They
don't have nearly the problem that functions do with embedded quotes.

A thought: maybe just put this logic into the regular
appendStringLiteral routine, and trigger it when the string contains any
quotes or backslashes; if it has none, you can just use quotes ...

BTW, I've been holding off making this change myself, realizing that it
will completely break backwards compatibility of pg_dump output to 7.4
and earlier. Not sure if anyone is trying to use CVS tip pg_dump with
older releases, but it seems possible given that the dump ordering issue
is finally solved. Might be a good idea to make it disablable with a
fallback to regular quoting.

regards, tom lane


From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-23 16:03:05
Message-ID: 40605FB9.4010306@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Tom Lane wrote:

>Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
>
>
>>... using it in dumping function bodies and comments on all objects,
>>with a prefix argument of "function" and "comment" respectively. There
>>might be other places where we want to use dollar quoting, but this
>>would be a good start, ISTM.
>>
>>
>
>Do we really need to be that verbose? Why not start with the minimal $$
>and extend only if needed? On the KISS principle, trying "$$", "$X$",
>"$XX$", "$XXX$", etc seems sufficient.
>
>

It's a matter of taste, I guess. I'm certainly not dogmatic about it.
The function design in my head is flexible enough for either.

>For that matter, I'm not convinced we should use $$ for comments. They
>don't have nearly the problem that functions do with embedded quotes.
>
>

Well, I keep a master schema file and I like to decorate it with fairly
verbose comments, so users can see what the object is for and how it
works. I've been caught a few times with forgetting to double quotes
inside the comments. But again, Maybe you are right, and we should at
least start with just the obvious case.

>A thought: maybe just put this logic into the regular
>appendStringLiteral routine, and trigger it when the string contains any
>quotes or backslashes; if it has none, you can just use quotes ...
>
>

I did think of fallback, but rejected it on the KISS principle :-) I
also prefer consistency in style - I want all my functions dollar quoted
even if they don't currently contain characters in need of escape.

>BTW, I've been holding off making this change myself, realizing that it
>will completely break backwards compatibility of pg_dump output to 7.4
>and earlier. Not sure if anyone is trying to use CVS tip pg_dump with
>older releases, but it seems possible given that the dump ordering issue
>is finally solved. Might be a good idea to make it disablable with a
>fallback to regular quoting.
>
>

Makes sense. "-X disable-dollar-quoting"? Or we could have it turned off
by default and require it to be specifically turned on - that might
conform to the principle of least surprise, at least for now.

For that matter, we could also have a "verbose-dollar-quoting" feature,
and/or a "dollar-quote-objects=functions,comments,......" feature

But let's walk before we start to run ;-)

cheers

andrew


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Andrew Dunstan <andrew(at)dunslane(dot)net>
Cc: Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-23 16:22:45
Message-ID: 21156.1080058965@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
> Tom Lane wrote:
>> A thought: maybe just put this logic into the regular
>> appendStringLiteral routine, and trigger it when the string contains any
>> quotes or backslashes; if it has none, you can just use quotes ...

> I did think of fallback, but rejected it on the KISS principle :-) I
> also prefer consistency in style - I want all my functions dollar quoted
> even if they don't currently contain characters in need of escape.

Good point. Never mind that idea then.

>> Might be a good idea to make it disablable with a
>> fallback to regular quoting.

> Makes sense. "-X disable-dollar-quoting"? Or we could have it turned off
> by default and require it to be specifically turned on - that might
> conform to the principle of least surprise, at least for now.

I don't mind if it's on by default; just thinking that some people might
appreciate a way to turn it off. "-X disable-dollar-quoting" sounds
fine.

regards, tom lane


From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: "Patches (PostgreSQL)" <pgsql-patches(at)postgresql(dot)org>
Subject: dollar quoting and pg_dump
Date: 2004-03-23 19:53:06
Message-ID: 406095A2.7060205@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Here's my attempt, as discussed earlier today. As always, comments
welcome. I did provide (and use) a fallback mechanism after all, for
the case of a function with a non-empty probin. A few examples from a
dump of the regression db:

--
-- Name: tg_hub_adjustslots(character, integer, integer); Type:
FUNCTION; Schema
: public; Owner: andrew
--

CREATE FUNCTION tg_hub_adjustslots(character, integer, integer) RETURNS
integer
AS $_$
declare
hname alias for $1;
oldnslots alias for $2;
newnslots alias for $3;
begin
if newnslots = oldnslots then
return 0;
end if;
if newnslots < oldnslots then
delete from HSlot where hubname = hname and slotno > newnslots;
return 0;
end if;
for i in oldnslots + 1 .. newnslots loop
insert into HSlot (slotname, hubname, slotno, slotlink)
values ('HS.dummy', hname, i, '');
end loop;
return 0;
end;
$_$
LANGUAGE plpgsql;

--
-- Name: ttdummy(); Type: FUNCTION; Schema: public; Owner: andrew
--

CREATE FUNCTION ttdummy() RETURNS "trigger"
AS '/home/andrew/pgwork/tip/pgsql/src/test/regress/regress.so',
'ttdummy'
LANGUAGE c;

--
-- Name: user_relns(); Type: FUNCTION; Schema: public; Owner: andrew
--

CREATE FUNCTION user_relns() RETURNS SETOF name
AS $$select relname
from pg_class c, pg_namespace n
where relnamespace = n.oid and
(nspname !~ 'pg_.*' and nspname <> 'information_schema') and
relkind <> 'i' $$
LANGUAGE sql;

cheers

andrew

Attachment Content-Type Size
dumpdq.patch text/plain 6.9 KB

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Andrew Dunstan <andrew(at)dunslane(dot)net>
Cc: "Patches (PostgreSQL)" <pgsql-patches(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-23 22:07:35
Message-ID: 7007.1080079655@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
> Here's my attempt, as discussed earlier today. As always, comments
> welcome. I did provide (and use) a fallback mechanism after all, for
> the case of a function with a non-empty probin.

Applied with light editorializing ("fallback" seemed a weird description
of what that function was doing, for instance).

At this point I think we are done with the coding part of this project,
and it's time to get to work on the documentation ...

regards, tom lane


From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: "Patches (PostgreSQL)" <pgsql-patches(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-23 22:15:00
Message-ID: 4060B6E4.2040005@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Tom Lane wrote:

>Andrew Dunstan <andrew(at)dunslane(dot)net> writes:
>
>
>>Here's my attempt, as discussed earlier today. As always, comments
>>welcome. I did provide (and use) a fallback mechanism after all, for
>>the case of a function with a non-empty probin.
>>
>>
>
>Applied with light editorializing ("fallback" seemed a weird description
>of what that function was doing, for instance).
>
>At this point I think we are done with the coding part of this project,
>and it's time to get to work on the documentation ...
>
>
>

It is being worked on.

cheers

andrew


From: Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: Andrew Dunstan <andrew(at)dunslane(dot)net>, Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-24 01:36:16
Message-ID: 4060E610.3070908@familyhealth.com.au
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

> I don't mind if it's on by default; just thinking that some people might
> appreciate a way to turn it off. "-X disable-dollar-quoting" sounds
> fine.

Does it _have_ to be dollars? Other languages call this feature
'heretext' IIRC.

Chris


From: Andrew Dunstan <andrew(at)dunslane(dot)net>
To: PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-24 02:28:17
Message-ID: 4060F241.2040205@dunslane.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Christopher Kings-Lynne wrote:

>> I don't mind if it's on by default; just thinking that some people might
>> appreciate a way to turn it off. "-X disable-dollar-quoting" sounds
>> fine.
>
>
> Does it _have_ to be dollars? Other languages call this feature
> 'heretext' IIRC.
>

This is not heretext, and calling it such would be quite misleading.
Nobody has yet come up with a better name for it.

cheers

andrew


From: Jon Jensen <jon(at)endpoint(dot)com>
To: pgsql-hackers(at)postgresql(dot)org
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-24 02:36:52
Message-ID: Pine.LNX.4.58.0403240235300.30839@louche.swelter.net
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

On Wed, 24 Mar 2004, Christopher Kings-Lynne wrote:

> > I don't mind if it's on by default; just thinking that some people might
> > appreciate a way to turn it off. "-X disable-dollar-quoting" sounds
> > fine.
>
> Does it _have_ to be dollars? Other languages call this feature
> 'heretext' IIRC.

No, but I think the rough consensus was that the behavior of this feature
is different enough from shell and Perl here documents that it deserved a
different name. In particular the $ is part of both the opening and
closing token, and newlines aren't relevant at either the beginning or
end.

Jon


From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
Cc: Andrew Dunstan <andrew(at)dunslane(dot)net>, Postgresql Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: dollar quoting and pg_dump
Date: 2004-03-24 03:27:57
Message-ID: 10030.1080098877@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Lists: pgsql-hackers pgsql-patches

Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au> writes:
> Does it _have_ to be dollars? Other languages call this feature
> 'heretext' IIRC.

I'm not in love with the name "dollar quoting" either ... but "here
text" would be quite misleading. See the archives for the discussions
that led us to develop our definition. The features that go by that
name usually have dependencies on formatting (ie newline boundaries)
that we specifically rejected.

Right now would be a fine time to invent a better name if you can think
of one ...

regards, tom lane