Most active commenters

The_Colonel(4)
capitainenemo(3)

←back to thread

Software development topics I've changed my mind on

(chriskiehl.com)

Show context

imjonse ◴[05 Feb 25 12:12 UTC] No.42947407[source]▶

>>42946281 (OP) #

> ORMs are the devil in all languages and all implementations. Just write the damn SQL

It depends on what you're writing. I've seen enough projects writing raw SQL because of aversion to ORMs being bogged down in reinventing a lot of what ORMs offer. Like with other choices it is too often a premature optimization (for perf or DX) and a sign of prioritizing a sense of craftsmanship at the expense of the deliverables and the sanity of other team members.

replies(8): >>42947752 #>>42947978 #>>42948317 #>>42948504 #>>42953164 #>>42954817 #>>42956074 #>>42956308 #

qaq ◴[05 Feb 25 12:47 UTC] No.42947752[source]▶

>>42947407 #

It's not so much optimization but experience that on any sufficiently large project you gonna run into ORM limitation and end up with mix of ORM and direct queries. So might as well...

replies(2): >>42948245 #>>42954198 #

1. The_Colonel ◴[05 Feb 25 13:35 UTC] No.42948245[source]▶

>>42947752 #

Starting with raw SQL is fun. But at some point you find out you need some caching here, then there, then you have a bunch of custom disconnected caches having bugs with invalidation. Then you need lazy loading and fetch graphs. Step by step you'll build your own (shitty) ORM.

Same thing for people claiming they don't need any frameworks.

replies(4): >>42948519 #>>42950241 #>>42951262 #>>42953407 #

2. wesselbindt ◴[05 Feb 25 13:55 UTC] No.42948519[source]▶

>>42948245 (TP) #

> you find out you need some caching here, then there

Forgive my ignorance, but how do ORMs help with adding caching? Or are you implying they obviate or reduce the need for caching?

replies(2): >>42948686 #>>42948792 #

3. ivan_gammel ◴[05 Feb 25 14:10 UTC] No.42948686[source]▶

>>42948519 #

They do caching themselves so that some of your queries via ORM won’t hit the actual db.

replies(1): >>42949394 #

4. The_Colonel ◴[05 Feb 25 14:19 UTC] No.42948792[source]▶

>>42948519 #

JPA implementations have "managed entities", sometimes called session or 1st level cache which is making sure that every entity is loaded at max. one time within a transaction. Like e.g. checking user/user permissions is something which typically has to be done in several places in course of a single request - you don't want to keep loading them for every check, you don't want to keep passing them across 20 layers, so some form of caching is needed. JPA implementations do it for you automatically (assuming you're fine with transaction-scoped cache) since this is such a core concept to how JPA works (the fact it's also a cache is kind of secondary consequence). JPA implementations typically provide more advanced caching capabilities, caching query results, distributed cache (with proper invalidation) etc.

replies(1): >>42948809 #

5. wesselbindt ◴[05 Feb 25 14:20 UTC] No.42948809{3}[source]▶

>>42948792 #

Ah yep, of course! Thanks!

6. capitainenemo ◴[05 Feb 25 15:03 UTC] No.42949394{3}[source]▶

>>42948686 #

So, given the main issue with ORM is the object/relational part... (https://web.archive.org/web/20160301022121/http://www.revisi...)

Why is caching not a feature in DB connection pools? I mean, most databases have it on their side, why not have it as an option for the same query sets prior to hitting the db, with configurable invalidations? Or is it, and I've just never thought to look for it.

replies(2): >>42949808 #>>42952224 #

7. The_Colonel ◴[05 Feb 25 15:26 UTC] No.42949808{4}[source]▶

>>42949394 #

Integrating cache into connection pools brings little added value since connection pools don't have enough context/information to manage the cache intelligently. You'd have to do all the hard work (like invalidation) yourself anyway.

Example: if you execute "UPDATE orders SET x = 5 WHERE id = 10", the connection pool has no idea what entries to invalidate. ORM knows that since it tracks all managed entities, understands their structure, identity.

replies(1): >>42950152 #

8. capitainenemo ◴[05 Feb 25 15:45 UTC] No.42950152{5}[source]▶

>>42949808 #

I guess I was thinking more of frequently run queries against infrequently modified data or where stale data doesn't matter so much. The sort of things that are ideal cache targets. You'd think you could tag queries like that. Sort of the things that a CDN caches but more granular. Sure if it's stuff that's frequently changed, an ORM could reason about it just like, well, the database does, but then you're back into all the bad things about running your shadow database with a badly fitting model, and you'd be better off just ensuring all or part of the database ran closer to your app with replication, say, in memory on same server.

9. qaq ◴[05 Feb 25 15:50 UTC] No.42950241[source]▶

>>42948245 (TP) #

caching is orthogonal to using or not using ORM. You might opt to have caching with or without ORM in a consistent manner. You can also opt to add read replicas fronted by say pgcat in Postgres case without having separate caching layer.

replies(1): >>42950714 #

10. The_Colonel ◴[05 Feb 25 16:18 UTC] No.42950714[source]▶

>>42950241 #

I guess this is a point where terminology matters. If you work with SQL database in an OOP language, you pretty much always do some object-relational mapping, no matter if you have a big framework or just raw SQL connection.

But this is not what people usually call as ORMs. All the "bad kind of ORM" (JPA impls, Entity Framework, SQLAlchemy, Doctrine, Active Record...) have some concept of an entity session which is tracking the entities being processed. To me, this is a central feature of an ORM, one of its major benefits. It is, incidentally, also serving as a transaction-scoped cache.

I won't of course dispute that you can have caching on other levels as well (which may perform differently, for different use cases).

11. stackskipton ◴[05 Feb 25 16:48 UTC] No.42951262[source]▶

>>42948245 (TP) #

As SRE who dealt with more caching errors then I care to. Alot of caching comes down to YAGNI.

To his point: It's very hard to beat decades of RDBMS research and improvements

Your RDBMS internal caching will likely get you extremely far and speed difference of Redis vs RDBMS call is very unlikely to matter in your standard CRUD App.

replies(1): >>42953875 #

12. ivan_gammel ◴[05 Feb 25 17:43 UTC] No.42952224{4}[source]▶

>>42949394 #

That would be a result set layer, not a connection pool. Could make sense if you worked with rows, but if you use ORM, why mapping cached row again and again? ORMs cache hydrated objects, which seems to be more efficient.

replies(1): >>42953337 #

13. capitainenemo ◴[05 Feb 25 18:54 UTC] No.42953337{5}[source]▶

>>42952224 #

Yeah, it was more to see if there were any benefits to an ORM that could be used without the, well "ORM" part :)

14. gilbetron ◴[05 Feb 25 19:00 UTC] No.42953407[source]▶

>>42948245 (TP) #

There are plenty of libraries/packages for SQL that do all of that for you, too. The choice isn't between a sophisticated ORM and just throwing SQL text at a socket. The fundamental assumption of ORMs is broken, but much of the tooling works well and exists in non-ORM places.

15. lcnPylGDnU4H9OF ◴[05 Feb 25 19:34 UTC] No.42953875[source]▶

>>42951262 #

> Redis vs RDBMS call is very unlikely to matter in your standard CRUD App

To any juniors reading: cache the response payload (or parts of it), not the results of database queries.

↑