Re: jsonb format is pessimal for toast compression

From: Claudio Freire <klaussfreire(at)gmail(dot)com>
To: Larry White <ljw1001(at)gmail(dot)com>
Cc: obartunov(at)gmail(dot)com, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, Bruce Momjian <bruce(at)momjian(dot)us>, Peter Geoghegan <pg(at)heroku(dot)com>, Stephen Frost <sfrost(at)snowman(dot)net>, Robert Haas <robertmhaas(at)gmail(dot)com>, Kevin Grittner <kgrittn(at)ymail(dot)com>, "pgsql-hackers(at)postgresql(dot)org" <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: jsonb format is pessimal for toast compression
Date: 2014-08-14 20:12:08
Message-ID: CAGTBQpY2rP3abfyhkA0GzAMoMRp0VDsZdthpR608d9vEM4q2eA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Thu, Aug 14, 2014 at 4:24 PM, Claudio Freire <klaussfreire(at)gmail(dot)com> wrote:
> On Thu, Aug 14, 2014 at 3:49 PM, Larry White <ljw1001(at)gmail(dot)com> wrote:
>> I attached a json file of approximately 513K. It contains two repetitions of
>> a single json structure. The values are quasi-random. It might make a decent
>> test case of meaningfully sized data.
>
>
> I have a 59M in plain SQL (10M compressed, 51M on-disk table size)
> collection of real-world JSON data.
>
> This data is mostly counters and anciliary info stored in json for the
> flexibility, more than anything else, since it's otherwise quite
> structured: most values share a lot between each other (in key names)
> but there's not much redundancy within single rows.
>
> Value length stats (in text format):
>
> min: 14
> avg: 427
> max: 23239
>
> If anyone's interested, contact me personally (I gotta anonimize the
> info a bit first, since it's production info, and it's too big to
> attach on the ML).

Oh, that one has a 13k toast, not very interesting.

But I've got another (very similar), 47M table, 40M toast, length distribution:

min: 19
avg: 474
max: 20370

Not sure why it's got a bigger toast having a similar distribution.
Tells just how meaningless min/avg/max stats are :(

In response to

Browse pgsql-hackers by date

  From Date Subject
Next Message Rukh Meski 2014-08-14 21:18:12 LIMIT for UPDATE and DELETE
Previous Message Kevin Grittner 2014-08-14 20:07:38 Re: Immediate standby promotion