ULID: Universally Unique Lexicographically Sortable Identifier

(packagemain.tech)

68 points der_gopher | 5 comments | 01 Dec 25 20:45 UTC | HN request time: 0.002s | source

Show context

sedatk ◴[09 Dec 25 22:26 UTC] No.46211578[source]▶

Whenever ULID comes up, I need to remind that it has a sequential ID generation mode in its spec which is prone to conflicts on multi-threads, processes or hosts which kills the purpose of a "universal" identifier. If you need a sequential ID, just use an integer, preferably one that's autoincremented by the database.

It's best to stick to UUIDv7 because of such quirks of ULID.

replies(6): >>46212131 #>>46212144 #>>46212658 #>>46212724 #>>46213359 #>>46213609 #

cpburns2009 ◴[09 Dec 25 23:26 UTC] No.46212131[source]▶

>>46211578 #

> I need to remind that it has a sequential ID generation mode in its spec which is prone to conflicts on multi-threads, processes or hosts which kills the purpose of a "universal" identifier.

Can you expand on how this can actually cause a problem? My understanding is different processes and hosts should never conflict because of the 80 bits of random data. The only way I can conceive of a conflict is multiple threads using the same non-thread-safe generator during the same millisecond.

replies(1): >>46212222 #

sedatk ◴[09 Dec 25 23:39 UTC] No.46212222[source]▶

>>46212131 #

You're right, not hosts or processes in that case. I forgot about random part as it's been a while since I looked at it. However, a single instance of a ULID generator must support this mode, which means that on multi-threaded architectures, it must lock the sequence as it still uses a single random value. That again, kills the purpose of a client-side, lock-free generation of universal identifiers as you said.

replies(2): >>46212315 #>>46212388 #

0x457 ◴[10 Dec 25 00:03 UTC] No.46212388[source]▶

>>46212222 #

You only need to lock sequence if you care about IDs being ordered within a millisecond. That generally only matters when you create a batch of IDs at once, in that case you don't need to lock anything: generate ULID, keep incrementing sequence in that batch either by doing on the same thread, or by moving it from thread to thread. Kinda like creating an iterator and zip'ing it with iterator of thing you need IDs for.

I've switched to using UUIDv7 tho. It made sense to use ULID before v7, but now ULID only has one thing going on - smaller string representation. That doesn't matter if your storage can store UUIDs natively (i.e. as 128 bit integer)

If your goal is to have global order intact, then neither ULID nor UUIDv7 is going to work for you.

replies(2): >>46212558 #>>46212830 #

1. vbezhenar ◴[10 Dec 25 00:25 UTC] No.46212558{3}[source]▶

>>46212388 #

I hope that's not literally incrementing a sequence. Because it would lead to trivial neighbor ID guessing attacks.

I've implemented this thing, though not called it ULID. I've dedicated some bits for timestamp, some bits for counter within millisecond and rest for randomness. So they always ordered and always unpredictable.

Another approach is to keep latest generated UUID and if new UUID requested within the same timestamp - generate random part until it's greater than previous one. I think that's pretty good approach as well.

replies(2): >>46212967 #>>46212980 #

2. sedatk ◴[10 Dec 25 01:24 UTC] No.46212967[source]▶

>>46212558 (TP) #

> I hope that's not literally incrementing a sequence

It's literally incrementing it by one:

https://github.com/ulid/javascript/blob/11c2067821ee19e4dc78...

replies(1): >>46214757 #

3. jasonwatkinspdx ◴[10 Dec 25 01:26 UTC] No.46212980[source]▶

>>46212558 (TP) #

> I hope that's not literally incrementing a sequence. Because it would lead to trivial neighbor ID guessing attacks.

It is and it does.

Also the ULID spec suggests you use a CSPRNG, but doesn't mandate that or provide specific advice on appropriate algorithms. So in practice people may reach for whatever hash function is convenient in their project, which may just be FNV or similar with considerably weaker randomness too.

4. vbezhenar ◴[10 Dec 25 06:34 UTC] No.46214757[source]▶

>>46212967 #

Well, that makes little sense for me, you can just use numeric identifier instead. Bulk inserts which generate identifiers in bulk are commonly used.

But that's easy to fix, so just implementation quirk for this particular library, the idea is sound.

replies(1): >>46215297 #

5. sedatk ◴[10 Dec 25 08:07 UTC] No.46215297{3}[source]▶

>>46214757 #

> But that's easy to fix, so just implementation quirk for this particular library, the idea is sound.

It's in ULID spec.

↑