Most active commenters

mickeyp(4)
asa400(3)
sethev(3)
BinaryIgor(3)

Popular/hot comments

>>45781639 #
>>45782026 #

←back to thread

SQLite concurrency and why you should care about it

(jellyfin.org)

1. asa400 ◴[01 Nov 25 13:32 UTC] No.45781519[source]▶

>>45781298 (OP) #

In SQLite, transactions by default start in “deferred” mode. This means they do not take a write lock until they attempt to perform a write.

You get SQLITE_BUSY when transaction #1 starts in read mode, transaction #2 starts in write mode, and then transaction #1 attempts to upgrade from read to write mode while transaction #2 still holds the write lock.

The fix is to set a busy_timeout and to begin any transaction that does a write (any write, even if it is not the first operation in the transaction) in “immediate” mode rather than “deferred” mode.

https://zeroclarkthirty.com/2024-10-19-sqlite-database-is-lo...

replies(9): >>45781577 #>>45781605 #>>45781634 #>>45781639 #>>45782026 #>>45782543 #>>45783088 #>>45787431 #>>45789444 #

2. tlaverdure ◴[01 Nov 25 13:42 UTC] No.45781577[source]▶

>>45781519 (TP) #

Yes, these are both important points. I didn't see any mention of SQLITE_BUSY in the blog post and wonder if that was never configured. Something that people miss quite often.

3. BobbyTables2 ◴[01 Nov 25 13:46 UTC] No.45781605[source]▶

>>45781519 (TP) #

Thats the best explanation I’ve seen of this issue.

However, it screams of a broken implementation.

Imagine if Linux PAM logins randomly failed if someone else was concurrently changing their password or vice versa.

In no other application would random failures due to concurrency be tolerated.

SQLite is broken by design; the world shouldn’t give them a free pass.

replies(1): >>45781628 #

4. asa400 ◴[01 Nov 25 13:49 UTC] No.45781628[source]▶

>>45781605 #

SQLite is a truly remarkable piece of software that is a victim both of its own success and its unwavering commitment to backward compatibility. It has its quirks. There are definitely things we can learn from it.

5. mickeyp ◴[01 Nov 25 13:49 UTC] No.45781634[source]▶

>>45781519 (TP) #

Indeed. Everyone who uses sqlite will get burnt by this one day and spend a lot of time chasing down errant write-upgraded transactions that cling on for a little bit longer than intended.

replies(1): >>45789496 #

6. summarity ◴[01 Nov 25 13:50 UTC] No.45781639[source]▶

>>45781519 (TP) #

I've always tried to avoid situations that could lead to SQLITE_BUSY. SQLITE_BUSY is an architecture smell. For standard SQLite in WAL, I usually structure an app with a read "connection" pool, and a single-entry write connection pool. Making the application aware of who _actually_ holds the write lock gives you the ability to proactively design access patterns, not try to react in the moment, and to get observability into lock contention, etc.

replies(3): >>45781655 #>>45782042 #>>45788855 #

7. mickeyp ◴[01 Nov 25 13:51 UTC] No.45781655[source]▶

>>45781639 #

I mean, you're not wrong, and that is one way to solve it, but the whole point of a sensibly-designed WAL -- never mind database engine -- is that you do not need to commit to some sort of actor model to get your db to serialise writes.

replies(1): >>45782616 #

8. simonw ◴[01 Nov 25 14:47 UTC] No.45782026[source]▶

>>45781519 (TP) #

Yeah I read the OP and my first instinct was that this is SQLITE_BUSY. I've been collecting posts about that here: https://simonwillison.net/tags/sqlite-busy/

replies(3): >>45786047 #>>45788998 #>>45789479 #

9. simonw ◴[01 Nov 25 14:49 UTC] No.45782042[source]▶

>>45781639 #

Even with that pattern (which I use too) you still need to ensure those write operations always start a transaction at the beginning in order to avoid SQLITE_BUSY.

replies(1): >>45783006 #

10. kijin ◴[01 Nov 25 15:41 UTC] No.45782543[source]▶

>>45781519 (TP) #

Wouldn't that "fix" make the problem worse on the whole, by making transactions hold onto write locks longer than necessary? (Not trying to disagree, just curious about potential downsides.)

replies(1): >>45785130 #

11. sethev ◴[01 Nov 25 15:49 UTC] No.45782616{3}[source]▶

>>45781655 #

These are performance optimizations. SQLite does serialize writes. Avoiding concurrent writes to begin with just avoids some overhead on locking.

replies(2): >>45782671 #>>45782743 #

12. mickeyp ◴[01 Nov 25 15:57 UTC] No.45782671{4}[source]▶

>>45782616 #

"performance optimisation" --- yeees, well, if you don't care about data integrity between your reads and writes. Who knows when those writes you scheduled really get written. And what of rollbacks due to constraint violations? There's we co-locate transactions with code: they are intertwined. But yes, a queue-writer is fine for a wide range of tasks, but not everything.

It's that we need to contort our software to make sqlite not suck at writes that is the problem.

replies(2): >>45782838 #>>45785847 #

13. ncruces ◴[01 Nov 25 16:04 UTC] No.45782743{4}[source]▶

>>45782616 #

SQLite, for the most part, uses polling locks. That means it checks if a lock is available to be taken, and if it's not, it sleeps for a bit, then checks again, until this times out.

This becomes increasingly inefficient as contention increases, as you can easily get into a situation where everyone is sleeping, waiting for others, for a few milliseconds.

Ensuring all, or most, writes are serialized, improves this.

14. sethev ◴[01 Nov 25 16:14 UTC] No.45782838{5}[source]▶

>>45782671 #

This is just FUD. The reason SQLite does locking to begin with is to avoid data corruption. Almost every statement this blog post makes about concurrency in SQLite is wrong, so it's little surprise that their application doesn't do what they expect.

>Who knows when those writes you scheduled really get written

When a commit completes for a transaction, that transaction has been durably written. No mystery. That's true whether you decide to restrict writes to a single thread in your application or not.

replies(2): >>45782989 #>>45783020 #

15. mickeyp ◴[01 Nov 25 16:31 UTC] No.45782989{6}[source]▶

>>45782838 #

You are talking about low level stuff like syncing to the filesystem; that data is journalled and ensuring atomicity is maintained and I am in actual fact not.

Dislocating DML from the code that triggers it creates many problems around ensuring proper data integrity and it divorces consistent reads of uncommitted data that you may want to tightly control before committing. By punting it to a dedicated writer you're removing the ability to ensure serialised modification of your data and the ability to cleanly react to integrity errors that may arise. If you don't need that? Go ahead. But it's not fud. We build relational acid compliant databases this way for a reason

replies(1): >>45783202 #

16. summarity ◴[01 Nov 25 16:32 UTC] No.45783006{3}[source]▶

>>45782042 #

Yes, indeed. In my apps, which are mostly Nim, my pool manager ensures this always happens - along with a host of other optimizations. I often start with barebones SQLite and then later switch to LiteSync (distributed SQLite with multi-master replication), so I keep the lock management at the app level to adapt to whatever backend I'm using.

replies(1): >>45785054 #

17. catlifeonmars ◴[01 Nov 25 16:34 UTC] No.45783020{6}[source]▶

>>45782838 #

> When a commit completes for a transaction, that transaction has been durably written. No mystery. That's true whether you decide to restrict writes to a single thread in your application or not.

Usually this is true but there are edge cases for certain journaled file systems. IIRC sqlite.org has a discussion on this.

replies(1): >>45783274 #

18. chasil ◴[01 Nov 25 16:40 UTC] No.45783088[source]▶

>>45781519 (TP) #

In an Oracle database, there is only one process that is allowed to write to tablespace datafiles, the DBWR (or its slaves). Running transactions can write to ram buffers and the redo logs only.

A similar design for SQLite would design for only one writer, with all other processes passing their SQL to it.

19. sethev ◴[01 Nov 25 16:53 UTC] No.45783202{7}[source]▶

>>45782989 #

Oh, I think you're picturing executing your transaction logic and then sending writes off to a background queue. I agree, that's not a general strategy - it only works for certain cases.

I just meant that if you can structure your application to run write transactions in a single thread (the whole transaction and it's associated logic, not just deferring writing the end result to a separate thread) then you minimize contention at the SQLite level.

20. zimpenfish ◴[01 Nov 25 17:01 UTC] No.45783274{7}[source]▶

>>45783020 #

> there are edge cases for certain journaled file systems. IIRC sqlite.org has a discussion on this.

Can't currently find it but I guess it comes under the "if the OS or hardware lies to SQLite, what can it do?" banner?

replies(1): >>45783539 #

21. catlifeonmars ◴[01 Nov 25 17:33 UTC] No.45783539{8}[source]▶

>>45783274 #

That might have been it. Overall the whole “How to corrupt your database article” was quite a good read:

https://sqlite.org/howtocorrupt.html

22. probst ◴[01 Nov 25 20:30 UTC] No.45785054{4}[source]▶

>>45783006 #

I am really curious about LiteSync. Any chance you could share a bit on your experiences with it (recognising it’s somewhat off-topic…). Do you run with multiple primaries? What sort of use cases do you reach to it for? Conflict resolution seems a bit simplistic at first glance (from the perspective of someone very into CRDTs), have you experienced any issues as a result of that?

23. asa400 ◴[01 Nov 25 20:40 UTC] No.45785130[source]▶

>>45782543 #

It’s a reasonable question!

In WAL mode, writers and readers don’t interfere with each other, so you can still do pure read queries in parallel.

Only one writer is allowed at a time no matter what, so writers queue up and you have to take the write lock at some point anyway.

In general, it’s hard to say without benchmarking your own application. This will get rid of SQLITE_BUSY errors firing immediately in the situation of read/write/upgrade-read-to-write scenario I described, however. You’d be retrying the transactions that fail from SQLITE_BUSY anyway, so that retrying is what you’d need to benchmark against.

It’s a subtle problem, but I’d rather queue up writes than have to write the code that retries failed transactions that shouldn’t really be failing.

24. jitl ◴[01 Nov 25 22:05 UTC] No.45785847{5}[source]▶

>>45782671 #

> Who knows when those writes you scheduled really get written

I await the write to complete before my next read in my application logic, same as any other bit of code that interacts with a database or does other IO. Just because another thread handles interacting with the writer connection, doesn't mean my logic thread just walks away pretending the write finished successfully in 0ms.

25. gwking ◴[01 Nov 25 22:35 UTC] No.45786047[source]▶

>>45782026 #

One tidbit that I don't see mentioned here yet is that ATTACH requires a lock. I just went looking for the documentation about this and couldn't find it, especially for WAL mode (https://www.sqlite.org/lockingv3.html mentions the super-journal, but the WAL docs do not mention ATTACH at all).

I have a python web app that creates a DB connection per request (not ideal I know) and immediately attaches 3 auxiliary DBs. This is a low traffic site but we have a serious reliability problem when load increases: the ATTACH calls occasionally fail with "database is locked". I don't know if this is because the ATTACH fails immediately without respecting the normal 5 second database timeout or what. To be honest I haven't implemented connection pooling yet because I want to understand what exactly causes this problem.

replies(1): >>45792829 #

26. liuliu ◴[02 Nov 25 02:33 UTC] No.45787431[source]▶

>>45781519 (TP) #

Note that busy_timeout is not applicable to SQLite in this case (the SQLITE_BUSY issued immediately, no wait in this case).

Also this is because WAL mode (and I believe only for WAL mode, since there is really no concurrent reads in the other mode).

The reason is because pages in WAL mode appended to a single log file. Hence, if you read something inside a BEGIN transaction, later wants to mutate something else, there could be another page already appended and potentially interfere with the strict serializable guarantee for WAL mode. Hence, SQLite has to fail at the point of lock upgrade.

Immediate mode solves this problem because at BEGIN time (or more correctly, at the time of first read in that transaction), a write lock is acquired hence no page can be appended between read -> write, unlike in the deferred mode.

27. andersmurphy ◴[02 Nov 25 08:55 UTC] No.45788855[source]▶

>>45781639 #

Yes! This is the way.

Honestly, its the key to getting the most out of sqlite. It also allows for transaction batching and various other forms if batching that can massively improve write throughput.

28. apitman ◴[02 Nov 25 09:25 UTC] No.45788998[source]▶

>>45782026 #

The necessity of this sort of tribal knowledge kills a lot of the simplicity of sqlite for me. Honestly it seems to have a lot of footguns. I've tried to understand proper concurrent use of sqlite with Golang about 5 times and never come away feeling like I actually get it.

replies(1): >>45790538 #

29. BinaryIgor ◴[02 Nov 25 11:06 UTC] No.45789444[source]▶

>>45781519 (TP) #

Also worth mentioning - it happens more often when you set journal_mode=WAL, which is not a default.

The default is DELETE mode, where the rollback journal is deleted at the conclusion of each transaction. What's more - in this mode (not-WAL), readers can coexist, but they do block the writer (which is always one) and the writer block readers - concurrency is highly limited.

In WAL mode - which pretty much always you should set - there's also at most one writer, but writer can coexist with readers.

30. BinaryIgor ◴[02 Nov 25 11:14 UTC] No.45789479[source]▶

>>45782026 #

Exactly - it's also strange that they didn't find a simple solution of setting PRAGMA busy_timeout=N; if you google/search SQLITE_BUSY: database is locked there are plenty of solid results describing the problem and solution

31. BinaryIgor ◴[02 Nov 25 11:17 UTC] No.45789496[source]▶

>>45781634 #

SQLite has its quirks, but in this particular case all you need is set PRAGMA busy_timeout=<a few seconds> and the problem is solved; and if you google it, it's widely known issue with described (this) solution.

It's just weird that it's set to 0 by default rather than something resonable like 3000 or 5000 ms.

32. nasretdinov ◴[02 Nov 25 14:21 UTC] No.45790538{3}[source]▶

>>45788998 #

With Go it's quite straightforward actually: use WAL mode + two connection pools, one for reads and the other, with MaxConnections set to 1, for writes. This way you should never encounter any concurrency issues, and Go will serialise writes for you too

replies(1): >>45791553 #

33. johannes1234321 ◴[02 Nov 25 16:42 UTC] No.45791553{4}[source]▶

>>45790538 #

I have no experience in Go, but won't that be a full database lock and prevent transactions with more than one operation?

34. sgbeal ◴[02 Nov 25 19:44 UTC] No.45792829{3}[source]▶

>>45786047 #

> I have a python web app that creates a DB connection per request (not ideal I know)

FWIW, "one per request per connection is bad" (for SQLite) is FUD, plain and simple. SQLite's own forum software creates one connection per request (it creates a whole forked process per request, for that matter) and we do not have any problems whatsoever with that approach.

Connection pools (with SQLite) are a solution looking for a problem, not a solution to a real problem.

↑