Re: Implementing full UTF-8 support (aka supporting 0x00)

From: Geoff Winkless <pgsqladmin(at)geoff(dot)dj>
To: Álvaro Hernández Tortosa <aht(at)8kdata(dot)com>
Cc: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Implementing full UTF-8 support (aka supporting 0x00)
Date: 2016-08-03 19:42:40
Message-ID: CAEzk6feZ3PcQhSPEitH-5_QuQNGq4FjkXiEO8+ocRQmVtj7Mig@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 3 August 2016 at 20:36, Álvaro Hernández Tortosa <aht(at)8kdata(dot)com> wrote:
> Isn't the correct syntax something like:
>
> select E'\uc080', U&'\c080';
>
> ?
>
> It is a single character, 16 bit unicode sequence (see
> https://www.postgresql.org/docs/current/static/sql-syntax-lexical.html).

No, what you've done there is created the three-byte utf8 sequence \xec8280

# select U&'\c080'::bytea;
bytea
----------
\xec8280

It's not a UCS2 c080, it's utf8 c080.

Geoff

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Andres Freund 2016-08-03 20:07:49 Re: Optimizing numeric SUM() aggregate
Previous Message Tom Lane 2016-08-03 19:41:54 Re: Implementing full UTF-8 support (aka supporting 0x00)