(neon.tech)

265 points davidgomes | 3 comments | 17 Oct 24 21:23 UTC | HN request time: 0.736s | source

Show context

paulryanrogers ◴[17 Oct 24 23:59 UTC] No.41875055[source]▶

Upgrades are hard. There was no replication in the before times. The original block-level replication didn't work among different major versions. Slony was a painful workaround based on triggers that amplified writes.

Newer PostgreSQL versions are better. Yet still not quite as robust or easy as MySQL.

At a certain scale even MySQL upgrades can be painful. At least when you cannot spare more than a few minutes of downtime.

replies(7): >>41875126 #>>41876174 #>>41876232 #>>41876375 #>>41877029 #>>41877268 #>>41877959 #

api ◴[18 Oct 24 00:10 UTC] No.41875126[source]▶

>>41875055 #

I've always wondered why Postgres is so insanely popular. I mean it has some nice things like very powerful support for a very comprehensive subset of SQL functionality, but most apps don't need all that.

It really feels like early 1990s vintage Unix software. It's clunky and arcane and it's hard to feel confident doing anything complex with it.

replies(9): >>41875168 #>>41875240 #>>41875306 #>>41875587 #>>41876346 #>>41876770 #>>41877119 #>>41877951 #>>41878394 #

tpmoney ◴[18 Oct 24 00:42 UTC] No.41875306[source]▶

>>41875126 #

> I've always wondered why Postgres is so insanely popular.

In no particular order, my preference for postgres is driven by:

  * Date / time functions that don't suck
  * UTF-8 is really UTF-8
  * 99% of a backup can be done live with nothing more than rsyncing the data directory and the WAL files
  * Really comprehensive documentation
  * LTREE and fuzzy string match extensions
  * Familiarity from using it for years

MySQL/Maria I'm sure is fine, but it's one of hose things where it's just different enough and I haven't encountered a compelling use case for changing my preference.

replies(3): >>41875336 #>>41876296 #>>41879736 #

fhdsgbbcaA ◴[18 Oct 24 00:49 UTC] No.41875336[source]▶

>>41875306 #

UTF-8 is what made me switch. It’s insane MySQL has something called UTF-8 that isn't really UTF-8, but do have a type UTF8MB4 that actually is correct. This means if you use UFT-8 in MySQL, you can’t use emoji for example.

replies(3): >>41875430 #>>41876285 #>>41878032 #

bastawhiz ◴[18 Oct 24 01:01 UTC] No.41875430[source]▶

>>41875336 #

And the fact that adding real utf-8 support limited (limits?) the length of strings that can be indexed

replies(1): >>41875764 #

evanelias ◴[18 Oct 24 02:05 UTC] No.41875764{3}[source]▶

>>41875430 #

Postgres limits btree keys to 2704 bytes, which is actually slightly smaller than MySQL's limit of 3072 bytes, assuming the default InnoDB storage engine.

That said, when using utf8mb4 in an index key, MySQL uses the "worst case" of each character being 4 bytes. So it effectively limits the max key size to 3072/4 = 768 characters, when a column is using the utf8mb4 character set.

For practical purposes, this doesn't cause much pain, as it's generally inadvisable to use complete long-ish strings as a key. And there are various workarounds, like using prefixes or hashes as the key, or using binary strings as keys to get the full 3072 bytes (if you don't need collation behaviors).

replies(1): >>41875901 #

bastawhiz ◴[18 Oct 24 02:36 UTC] No.41875901{4}[source]▶

>>41875764 #

> So it effectively limits the max key size to 3072/4 = 768 characters, when a column is using the utf8mb4 character set.

This is exactly what I mean. 768 characters for an index is woefully bad. And for no obviously great reason: you can just index the encoded UTF-8 text.

This was literally reason why a former company (who will remain nameless) refused to add Unicode support. It's not even an imagined problem.

replies(2): >>41876001 #>>41876347 #

evanelias ◴[18 Oct 24 03:01 UTC] No.41876001{5}[source]▶

>>41875901 #

Why did you need to index fairly long strings in their entirety in a way that preserves collation behaviors?

And why is a 768 character limit woefully bad, but a 2704 character limit is totally fine?

replies(2): >>41876259 #>>41876450 #

bastawhiz ◴[18 Oct 24 04:08 UTC] No.41876259{6}[source]▶

>>41876001 #

A URL, for instance, can't be safely stored in 768 characters, but it can be stored safely in 2704. If you then wanted to sort those URLs so that all URLs for each domain and path within that domain are adjacent, you need an index. Especially if you want to paginate over them with a cursor. Doing that without an index on the raw value is a royal pain in the ass.

Hell, even just being able to sort user-submitted strings up to a kilobyte. Why up to a kilobyte? Some users have strings that are kind of long. If I have to define a second column that's the truncated prefix, that's just a silly waste of space because MySQL decided to use utf-32 under the hood.

replies(3): >>41876474 #>>41876664 #>>41876823 #

1. The_Colonel ◴[18 Oct 24 06:25 UTC] No.41876823[source]▶

>>41876259 #

> it can be stored safely in 2704

No, it can't. URL doesn't have any length limit, regardless of the fact that different software will impose different limits.

replies(1): >>41880612 #

2. bastawhiz ◴[18 Oct 24 15:57 UTC] No.41880612[source]▶

>>41876823 (TP) #

Browser address bars have a limit of 2048, so if that's your use case, yes it's safe.

replies(1): >>41885721 #

3. The_Colonel ◴[19 Oct 24 04:54 UTC] No.41885721[source]▶

>>41880612 #

Safari has 80 000, Firefox 65K.

There are plenty of needs to store URLs which will never go through a browser.

You can only claim that "some URL use cases" can be stored in 2048 characters.

↑

Why does everyone run ancient Postgres versions?