The Practical Guide to Scaling Django

(slimsaas.com)

1. feydaykyn ◴[15 Nov 24 21:08 UTC] No.42151073[source]▶

The snippets are not false, but there's so much context missing it's easy to worsen the situation, especially for beginners which seem to be the target audience.

First, this guide should emphasize the need to measure before doing anything : django silk, django debug toolbarsm, etc. Of course, measure after the optimizations too, and measure in production with an apm.

Second, some only work sometimes : select_related / prefetch_related / iterator will lead to giga SQL queries with nested joins all over the place, and ends by exploding ram usage. It will help at first, but soon enough one will pay any missing sql knowledge or naive relationships.

Third, caching without taking the context into account will probably lead to data corruption one way or another. Debugging stale cache issues is not fun, since you cannot reproduce them easily.

Fourth, celery is a whole new world, which requires workers, retry and idempotent logic, etc.

Finally, scaling is also about code: architecture, good practices, basic algorithm, etc

I'll end by linking to more complete resources : - https://docs.djangoproject.com/en/5.1/topics/performance/ - https://loadforge.com/guides/the-ultimate-guide-to-django-pe... - https://medium.com/django-unleashed/django-application-perfo...

replies(3): >>42155088 #>>42155739 #>>42156580 #

2. vldmrs ◴[15 Nov 24 21:33 UTC] No.42151364[source]▶

>>42149694 (OP) #

At first I wanted to criticize the post, buuut after finishing reading it I actually liked it. Very concise and practical

ps - I didn’t know about template “cache” directive

replies(1): >>42154476 #

3. vundercind ◴[15 Nov 24 21:42 UTC] No.42151467[source]▶

>>42149694 (OP) #

Probably 80% of notable performance problems I’ve seen in the kinds of systems that things like Django and Ruby get used for have been terrible queries or patterns of use for databases (I’ve seen 1,000x or worse costs for this versus something more-correct) and nearly all of the other 20% has been areas that plainly just needed some pretty straightforward caching.

The nice thing about that is that spotting those, and the basic approach to fixing them, if not the exact implementation details, are cross-platform skills that apply basically anywhere.

I actually can’t recall any other notable performance problems in those sorts of systems, over the years. Those are so common and the fixes so effective I guess the rest has just never rated attention. I’ve seen different problems in long-lived worker processes though (“make it streaming—everything becomes streaming when scale gets big enough” is the usual platform-agnostic magic bullet in those cases)

A bunch of TFA is basically about those things, so I’m not correcting it, more like nodding along.

Oh wait I just thought of another I’ve seen: serving large files through a scripting language, as in, reading it in and writing it back out with a scripting language. You run into trouble at even modest scale. There’s a magic response header for that, make Nginx or Apache or whatever serve it for you, it’s a fix that’s typically deleting a bunch of code and replacing it with one or two lines. Or else just use s3 and maybe signed URLs like the rest of the world. Problem solved.

replies(4): >>42151984 #>>42152017 #>>42154568 #>>42155730 #

4. megaman821 ◴[15 Nov 24 22:32 UTC] No.42151984[source]▶

>>42151467 #

I have had to combine files into a zipped file on demand before. It is hard to avoid the inherent slowness of that.

replies(3): >>42152323 #>>42153663 #>>42155601 #

5. jonatron ◴[15 Nov 24 22:35 UTC] No.42152017[source]▶

>>42151467 #

The magic header is probably X-Accel-Redirect

replies(2): >>42153650 #>>42156013 #

6. ch4s3 ◴[15 Nov 24 23:08 UTC] No.42152323{3}[source]▶

>>42151984 #

Interesting, was there a business reason to not do that in the background somewhere?

replies(1): >>42153403 #

7. almost ◴[15 Nov 24 23:19 UTC] No.42152416[source]▶

>>42149694 (OP) #

This sort of article seems perfectly poised to be useless to beginners (no context, doesn't tell you how to use the things) and experts (no nuance, just listing basic features) alike. Who is it for? Why does it exist? Why is it posted here?

replies(1): >>42154306 #

8. 8organicbits ◴[15 Nov 24 23:56 UTC] No.42152747[source]▶

>>42149694 (OP) #

Don't store secrets in settings.py. Typically you'd inject those from secrets management as environment variables.

replies(1): >>42156663 #

9. slashnode ◴[16 Nov 24 00:19 UTC] No.42152914[source]▶

>>42149694 (OP) #

The basic outline of this post isn’t bad, the problem is that’s all there is - a basic outline. If you haven’t dealt with these problems before the checklists are meaningless. If you HAVE dealt with these problems before the checklists are redundant

10. megaman821 ◴[16 Nov 24 01:41 UTC] No.42153403{4}[source]▶

>>42152323 #

Yeah, very non-technical users that won't check their email or click on a notification when the zip file is ready for them.

replies(1): >>42250072 #

11. kehrazy ◴[16 Nov 24 01:55 UTC] No.42153454[source]▶

>>42149694 (OP) #

An amazing movie.

12. vundercind ◴[16 Nov 24 02:37 UTC] No.42153650{3}[source]▶

>>42152017 #

Yeah, or the kinda-better-named “x-sendfile” on apache2. Same effect.

13. vundercind ◴[16 Nov 24 02:40 UTC] No.42153663{3}[source]▶

>>42151984 #

Mmm. If you had the right library, might be able to stream it as it’s being created which might help at least with perceived performance, but yeah, that’s a fun one.

replies(1): >>42155847 #

14. jerrygenser ◴[16 Nov 24 04:40 UTC] No.42154306[source]▶

>>42152416 #

Seo and marketing to sell their product is the reason it exists.

15. danpalmer ◴[16 Nov 24 05:12 UTC] No.42154476[source]▶

>>42151364 #

FWIW, I'd advise against template caching. It's awkward to cache bust, and a network round trip to your cache will almost certainly be more expensive than the Python operations to render the template, even with stock Django templating which is slow.

The only place it's possible worth it is if you do a lot of database queries from your template rendering, and you're therefore caching database results (as rendered text). In that case, it's an easy patch. However a much better solution is to fetch all database results up front.

In my previous company we had a very significant Django codebase with plenty of templating, and found that using the templating system for (lazy loaded) database queries or caching was more hassle than it was worth and avoided it as much as possible. Treating template rendering as a pure CPU bound function was always better.

replies(2): >>42155864 #>>42156593 #

16. golergka ◴[16 Nov 24 05:32 UTC] No.42154568[source]▶

>>42151467 #

Knowing SQL and how relational databases actually work is one of the best superpowers a backend developer can have. If you want to go deeper than your database manual, the best place is Andy Pavlo's db course, freely available at youtube. I don't write databases, but after watching it I understand trade-offs and performance considerations much better, and feel much more comfortable reading Postgresql manual.

17. tmarice ◴[16 Nov 24 06:55 UTC] No.42154924[source]▶

>>42149694 (OP) #

If you're on Postgres, StringAgg and ArrayAgg are nice alternatives to prefetch_related to avoid building Python model instances and waste memory.

I wrote a short blog post on recent optimizations we did on our Django codebase: https://tmarice.dev/blog/better-living-through-optimized-dja...

replies(1): >>42156257 #

18. pastage ◴[16 Nov 24 07:37 UTC] No.42155088[source]▶

>>42151073 #

> scaling is also about code

Which is darn hard if you are a beginner in a framework, loops in loops still bites me after reality does the integration test for me. This is especially true when you try to do a simple thing as a beginner. By scaling I am just talking about normal production, going from 2 developers to a couple of thousand customers.

replies(1): >>42155182 #

19. feydaykyn ◴[16 Nov 24 08:07 UTC] No.42155182{3}[source]▶

>>42155088 #

To mind it's a part where the Django guide could be expanded a bit, in order to help scaffold a simple but "open to the future" code architecture. For instance I would warn against fat models and propose a very light "service pattern" architecture

20. The_Amp_Walrus ◴[16 Nov 24 08:38 UTC] No.42155289[source]▶

>>42149694 (OP) #

I did a more detailed yt video on django query optimisation (mostly for ppl new to the framework) for those interested

https://www.youtube.com/watch?v=9uoI6pvuvYs

21. xioxox ◴[16 Nov 24 10:04 UTC] No.42155601{3}[source]▶

>>42151984 #

I have Django code which creates a tar file on the fly from a list of requested files and works well. It doesn't use intermediate storage. The tar format can be pretty simple. I got most of the way into implementing a uncompressed zip version, but then I realised that tar was good enough for my site.

22. robertlagrant ◴[16 Nov 24 10:44 UTC] No.42155730[source]▶

>>42151467 #

> Probably 80% of notable performance problems I’ve seen in the kinds of systems that things like Django and Ruby get used for have been terrible queries or patterns of use for databases (I’ve seen 1,000x or worse costs for this versus something more-correct)

ActiveRecord pattern saves you a few lines of code now, and explodes your foot off later.

23. sjducb ◴[16 Nov 24 10:47 UTC] No.42155739[source]▶

>>42151073 #

100% Always measure before you performance optimise. Lots of times the “fast” solution is slower.

If you need a fast solution then add an integration test so that the system stays fast.

24. jonatron ◴[16 Nov 24 11:18 UTC] No.42155847{4}[source]▶

>>42153663 #

I had to create streaming zips of files from S3 on the fly about 10 years ago, https://github.com/jonatron/django-s3-stream . I didn't find it fun.

25. nprateem ◴[16 Nov 24 11:25 UTC] No.42155864{3}[source]▶

>>42154476 #

It'd be faster to retrieve from a cache than to make a round-trip to a DB to get the data needed for templating.

replies(2): >>42156365 #>>42161011 #

26. tecleandor ◴[16 Nov 24 12:11 UTC] No.42156013{3}[source]▶

>>42152017 #

Ah thanks, I thought it was a figure of speech or something :')

27. medo-bear ◴[16 Nov 24 12:38 UTC] No.42156114[source]▶

>>42149694 (OP) #

So I shouldnt have my business logic done in django templates?

28. rudasn ◴[16 Nov 24 13:18 UTC] No.42156257[source]▶

>>42154924 #

Nice post,thanks for sharing!

It would be nice to include the generated sql queries along with the code samples though. I've been on a similar path recently and being able to see the queries was really helpful (even the ones that failed!).

29. telgareith ◴[16 Nov 24 13:44 UTC] No.42156365{4}[source]▶

>>42155864 #

[assumption]

30. ◴[16 Nov 24 14:08 UTC] No.42156522[source]▶

>>42149694 (OP) #

31. bdzr ◴[16 Nov 24 14:22 UTC] No.42156580[source]▶

>>42151073 #

> Third, caching without taking the context into account will probably lead to data corruption one way or another.

One can only hope it's data corruption and not a sensitive data leak.

32. bdzr ◴[16 Nov 24 14:25 UTC] No.42156593{3}[source]▶

>>42154476 #

I haven't used it, but I think this is well targeted by https://github.com/dabapps/django-zen-queries.

33. halfcat ◴[16 Nov 24 14:38 UTC] No.42156663[source]▶

>>42152747 #

And also, when possible, try to use a key manager over environment variables.

Using a library like keyring [1] is a significant step up from a .env file sitting in your dev environment.

In other words:

- Store secrets in settings.py (bad)

- Store secrets in .env file (better)

- Store secrets in OS-level key vault (even better)

When the secrets are in a plaintext .env file, that file can get leaked in many non-obvious ways. Your antivirus uploads a copy, your IT department runs backups, someone on the team clones your git repo to a OneDrive/Dropbox folder and puts the .env file there. Then any of those services that has a leak, or any of the services those services use has a leak (improperly configured S3 bucket, etc), your secrets are leaked.

[1] https://github.com/jaraco/keyring

34. danpalmer ◴[17 Nov 24 00:55 UTC] No.42161011{4}[source]▶

>>42155864 #

My point was that you shouldn't be doing DB queries in the template. If you're doing the DB queries before templating then you should also be doing the cache queries before templating too.

35. ch4s3 ◴[26 Nov 24 21:07 UTC] No.42250072{5}[source]▶

>>42153403 #

I can feel that deep in my bones.