Re: Differences in UTF8 between 8.0 and 8.1
- From: Christopher Kings-Lynne <chriskl(at)familyhealth(dot)com(dot)au>
- To: Paul Lindner <lindner(at)inuus(dot)com>
- Cc: andrew(at)supernews(dot)com, pgsql-hackers(at)postgresql(dot)org
- Subject: Re: Differences in UTF8 between 8.0 and 8.1
- Date: Thu, 27 Oct 2005 09:49:48 +0800
- Message-id: <4360323C(dot)2020701(at)familyhealth(dot)com(dot)au>
However I'm running into another problem now. The command:
iconv -c -f UTF8 -t UTF8
does strip out the invalid characters. However, iconv reads the
entire file into memory before it writes out any data. This is not so
good for multi-gigabyte dump files and doesn't allow for it to be used
in a pipe between pg_dump and psql.
Anyone have any other recommendations? GNU recode might do it, but
I'm a bit stymied by the syntax. A quick perl script using
Text::Iconv didn't work either. I'm off to look at some other perl
modules and will try to create a script so I can strip out the invalid
characters.
recode UTF-8..UTF-8 < dump_in.sql > dump_out.sql
Chris
Home |
Main Index |
Thread Index