Most active commenters

ngrilly(3)

Popular/hot comments

>>42138105 #
>>42136324 #

SQLite Index Visualization

(mrsuh.com)

1. edweis ◴[14 Nov 24 14:01 UTC] No.42136200[source]▶

>>42134964 (OP) #

The website is so legible I want to read it.

replies(2): >>42136383 #>>42137479 #

2. IX-103 ◴[14 Nov 24 14:19 UTC] No.42136324[source]▶

>>42134964 (OP) #

> I wanted to see how a database management system (DBMS) stores an index in both disk and memory, and how it searches through an Index...I chose SQLite for my experiments

SQLite is a bit of an outlier in how it handles...everything, but even more so in query processing. SQLite tends to favor simplicity over performance, which causes it to implement things differently than every other DB I've worked with. You have to understand - SQLite isn't competing with other databases. It's competing with JSON and XML files for persistent storage. This means that how it implements anything tells you practically nothing about how a real database would do something.

replies(3): >>42138218 #>>42138240 #>>42138785 #

3. saurik ◴[14 Nov 24 14:25 UTC] No.42136383[source]▶

>>42136200 #

FWIW, I find the font size (I am on an iPhone) way too large, particularly as there is also important text in the diagrams and that text is much smaller, so while I feel a need to shove my phone away from my face to deal with the overly large body text I then have to keep pulling it back in to feel comfortable reading the diagrams, which feel out of place.

4. srcreigh ◴[14 Nov 24 14:33 UTC] No.42136454[source]▶

>>42134964 (OP) #

Great effort!

> By default, each SQLite table row has a unique rowId, which works like a primary key if one isn’t explicitly defined.

It actually uses rowid even if you have a primary key.

You should try visualizing the primary key index for a WITHOUT ROWID table. Those indexes are my favourite

> Both Indexes look similar, but the second Index, with fewer Pages, should be faster.

Less nodes doesn’t really mean “faster”. The most important is the height of the tree.

The second most important is what happens when you find your value in the index. Do you need to load the rest from a separate table(rowid)? Or is the data just there for you (without rowid)? Especially range queries (aka where 50<= col <=100)

replies(2): >>42137197 #>>42147378 #

5. kevincox ◴[14 Nov 24 15:34 UTC] No.42137197[source]▶

>>42136454 #

> Less nodes doesn’t really mean “faster”. The most important is the height of the tree.

In isolation of a single access yes. But when frequently accessing an index overall size can be very important for cache hit rate.

6. bgalbs ◴[14 Nov 24 15:57 UTC] No.42137479[source]▶

>>42136200 #

Yeah, such a relief to see content w/o super dense ad loads, etc. Very cool article.

7. salviati ◴[14 Nov 24 16:51 UTC] No.42138105[source]▶

>>42134964 (OP) #

The term "indexes" serves both as the third-person singular present tense of the verb "to index" and as a plural noun form of "index." In contrast, "indices" is the traditional plural form of "index," particularly prevalent in mathematical and scientific contexts. While "indexes" is commonly used in general English, "indices" is often preferred in technical fields to maintain linguistic precision. Employing "indices" in such contexts helps distinguish between the action of indexing and the plural form of index, thereby enhancing clarity.

replies(4): >>42138511 #>>42138522 #>>42138564 #>>42140104 #

8. cogman10 ◴[14 Nov 24 17:00 UTC] No.42138218[source]▶

>>42136324 #

Meh, it isn't really too far off from the way other DBMS servers handle storage and indexes. The principles are pretty identical (especially when sqlite operates in WAL mode).

9. ngrilly ◴[14 Nov 24 17:01 UTC] No.42138240[source]▶

>>42136324 #

SQLite is a real database engine. I guess what you mean is that SQLite is not competing with database servers.

replies(1): >>42148397 #

10. gloflo ◴[14 Nov 24 17:20 UTC] No.42138511[source]▶

>>42138105 #

Says who with what authority?

All major RDBMS use the term "indexes".

11. CharlesW ◴[14 Nov 24 17:20 UTC] No.42138522[source]▶

>>42138105 #

FWIW, both are fine (https://www.nasdaq.com/articles/indexes-or-indices-whats-the...), and SQLite and PostgreSQL documentation (as two popular examples) use "indexes".

12. orthecreedence ◴[14 Nov 24 17:23 UTC] No.42138564[source]▶

>>42138105 #

It depends on your audience. If you're catering to academics, use "indices." If you're catering to the general person, "indices" comes off as pompous.

replies(1): >>42144528 #

13. graemep ◴[14 Nov 24 17:39 UTC] No.42138785[source]▶

>>42136324 #

> SQLite isn't competing with other databases. It's competing with JSON and XML files for persistent storage

It competes with both. its clearly used for local persistent storage. SO are quite a lot of other things. It also competes with other RDBMSes where a separate server process is not a requirement.

That does mean it serves very different requirements, its just that its use case are a lot wider than just replacing JSON and XML files and similar.

replies(1): >>42140986 #

14. vivzkestrel ◴[14 Nov 24 17:55 UTC] No.42138995[source]▶

>>42134964 (OP) #

would be real nice to see how postgres does the same thing, compare and take notes

15. euroderf ◴[14 Nov 24 19:25 UTC] No.42140104[source]▶

>>42138105 #

Try pluralizing "time series". You won't get far.

So what I've seen in Finland is people using "time series" for the plural and "time serie" for the singular.

replies(1): >>42144626 #

16. threatofrain ◴[14 Nov 24 20:42 UTC] No.42140986{3}[source]▶

>>42138785 #

> It also competes with other RDBMSes where a separate server process is not a requirement.

If you casually list off the top DB's either by usage or by recent hotness then almost all of them will have a server, but you'll also find they're basically all not embedded DB's with exception to RocksDB.

replies(2): >>42147626 #>>42248153 #

17. w10-1 ◴[14 Nov 24 21:16 UTC] No.42141312[source]▶

>>42134964 (OP) #

or emit tgf for yEd, for more layout variants with less work

18. srcreigh ◴[15 Nov 24 07:00 UTC] No.42144528{3}[source]▶

>>42138564 #

Nope. Academics prefer “indexes” when discussing databases.

19. Terr_ ◴[15 Nov 24 07:23 UTC] No.42144626{3}[source]▶

>>42140104 #

I wonder if one could make a grammar-argument that it's like "Attorneys General." :p

20. lyxell ◴[15 Nov 24 14:46 UTC] No.42147378[source]▶

>>42136454 #

> It actually uses rowid even if you have a primary key.

This is true with one exception, if you create an INTEGER PRIMARY KEY, SQLite will use this instead [1].

[1]: https://sqlite.org/rowidtable.html

21. e28eta ◴[15 Nov 24 15:14 UTC] No.42147626{4}[source]▶

>>42140986 #

I’m familiar with this embedded DB, used in Quickbooks desktop: https://en.m.wikipedia.org/wiki/SQL_Anywhere

So… large usage, but probably not very high on the hotness scale

22. ASalazarMX ◴[15 Nov 24 16:29 UTC] No.42148397{3}[source]▶

>>42138240 #

And even that is questionable, since many web applications offer SQLite as another DB back end, and it works just fine for a wider range of workloads than one would expect.

replies(1): >>42149065 #

23. ngrilly ◴[15 Nov 24 17:40 UTC] No.42149065{4}[source]▶

>>42148397 #

Agreed. SQLite is becoming popular on the server-side as well. The latest version of Rails making SQLite the default is particularly interesting.

replies(1): >>42150363 #

24. baq ◴[15 Nov 24 20:05 UTC] No.42150363{5}[source]▶

>>42149065 #

the problem with sqlite has never been performance, it's always been extreme (dead)locking when writing concurrently - how does Rails get around that assuming this is actually recommended for prod deployments?

replies(1): >>42150482 #

25. ngrilly ◴[15 Nov 24 20:18 UTC] No.42150482{6}[source]▶

>>42150363 #

https://fractaledmind.github.io/2024/04/15/sqlite-on-rails-t...

26. akx ◴[26 Nov 24 18:03 UTC] No.42248153{4}[source]▶

>>42140986 #

https://duckdb.org/ begs to differ.

↑