Concurrency in Haskell: Fast, Simple, Correct

1. michalsustr ◴[17 Apr 25 05:51 UTC] No.43713518[source]▶

I’m not familiar with Haskell concurrency. The combination of green threads and large memory allocations due to immutable data structures sounds like it would be hard to implement a web server handling 10k+ concurrent requests on commodity hardware?

Btw. too bad author talks about microsecond guarantees usage but does not provide a link, that would be interesting reading.

replies(7): >>43713603 #>>43713615 #>>43713878 #>>43714039 #>>43715073 #>>43716268 #>>43724017 #

2. eru ◴[17 Apr 25 06:11 UTC] No.43713603[source]▶

>>43713518 (TP) #

> [...] large memory allocations due to immutable data structures sounds [...]

Why would there be large memory allocations because of immutable data structures? Btw, you can also use immutable data structure in eg Rust fairly easily. And Haskell also supports mutation and mutable data structures.

However, Haskell can use a lot of memory, but that's more to do with pervasive 'boxing' by default, and perhaps laziness.

replies(1): >>43713874 #

3. stevan ◴[17 Apr 25 06:12 UTC] No.43713615[source]▶

>>43713518 (TP) #

> Warp is a high-performance HTTP server library written in Haskell, a purely functional programming language. Both Yesod, a web application framework, and mighty, an HTTP server, are implemented over Warp. According to our throughput benchmark, mighty provides performance on a par with nginx.

Source: https://aosabook.org/en/posa/warp.html

4. nesarkvechnep ◴[17 Apr 25 06:58 UTC] No.43713874[source]▶

>>43713603 #

No reason. OC probably thinks that immutable data structures are always copied when being operated on.

replies(1): >>43714389 #

5. nesarkvechnep ◴[17 Apr 25 06:59 UTC] No.43713878[source]▶

>>43713518 (TP) #

You obviously haven’t ran anything on the BEAM (Erlang’s VM).

replies(1): >>43714394 #

6. lemper ◴[17 Apr 25 07:33 UTC] No.43714039[source]▶

>>43713518 (TP) #

nah bro, warp is quite performant. think there were some consultancies that wrote haskal web app for their clients.

7. michalsustr ◴[17 Apr 25 08:45 UTC] No.43714389{3}[source]▶

>>43713874 #

Yes indeed, unless you use ropes or other specialised structures

replies(4): >>43714422 #>>43715066 #>>43715416 #>>43724028 #

8. michalsustr ◴[17 Apr 25 08:46 UTC] No.43714394[source]▶

>>43713878 #

Correct. Erlang also uses green threads?

replies(1): >>43715051 #

9. tasuki ◴[17 Apr 25 08:52 UTC] No.43714422{4}[source]▶

>>43714389 #

Doesn't it depend on the data structure? Eg prepending to a list is actually cheaper with immutable data structures: you keep the original list and add a new head pointing to its head. Now you have two lists available in your program, but only one stored in memory. Yay!

10. jlouis ◴[17 Apr 25 10:45 UTC] No.43715051{3}[source]▶

>>43714394 #

Yes. And immutable data structures.

When data is immutable, it can be freely shared. Changes to the data essentially uses copy-on-write. And it only writes the delta change, since you don't need a deep copy due to immutability. Add that the garbage collectors of Haskell and Erlang are designed to work with a high allocation rate and have 0 cost for dead data, and this is much faster than what people think.

The way you implement a webserver in either Haskell or Erlang is rather trivial. Whenever there's an incoming request, you make a thread to handle it. So you don't have 1 webserver serving 10k requests. You have 10k webservers serving 1 request each. And since they are started from the same core data, they'll share that due to immutability. See also old-style Apache or PHP and fork().

replies(1): >>43716589 #

11. xmcqdpt2 ◴[17 Apr 25 10:49 UTC] No.43715066{4}[source]▶

>>43714389 #

Lists aren’t copied on prepend.

Tries (like scala’s Vector) or trie maps (the core map types of Scala, Clojure and probably Haskell?) aren’t copied on updates.

In fact, whether a data structure is an immutable or persistent data structure or merely an unmodifiable data structure (like Kotlin uses) is based on whether it requires full copies on most updates or not. In FP languages, immutable data structures aren’t “specialized” at all.

replies(1): >>43717117 #

12. _jackdk_ ◴[17 Apr 25 10:50 UTC] No.43715073[source]▶

>>43713518 (TP) #

The interaction of laziness and purity means that the memory costs are not always what you think. Purity means that it's a lot safer to share structure between old and new versions of a data structure where an imperative language would have to do defensive copying, and laziness means that you can incrementally amortise the cost of expensive rebalancing operations (Okasaki is the standard reference for this).

13. nesarkvechnep ◴[17 Apr 25 11:48 UTC] No.43715416{4}[source]▶

>>43714389 #

Not really. You might want to look into “ Purely functional data structures” by Chris Okazaki.

14. cosmic_quanta ◴[17 Apr 25 13:09 UTC] No.43716268[source]▶

>>43713518 (TP) #

> sounds like it would be hard to implement a web server handling 10k+ concurrent requests on commodity hardware?

In practice, it is not. The canonical Haskell compiler, GHC, is excellent at transforming operations on immutable data, as Haskell programs are written, into efficient mutations, at the runtime level. Also, since web development is quite popular in the Haskell community, lots of people have spent many hours optimizing this precise use-case.

In my experience, the real downside is that compilation times are a bit long -- the compiler is doing a LOT of work after all.

replies(1): >>43716570 #

15. eru ◴[17 Apr 25 13:32 UTC] No.43716570[source]▶

>>43716268 #

> The canonical Haskell compiler, GHC, is excellent at transforming operations on immutable data, as Haskell programs are written, into efficient mutations, at the runtime level.

Yes, at the level of native machine code and memory cells, there's not that much of a difference between immutability + garbage collection, and higher level source code that mutates. Thanks to GC you are going to overwrite the same memory locations over and over again, too.

replies(1): >>43724021 #

16. eru ◴[17 Apr 25 13:34 UTC] No.43716589{4}[source]▶

>>43715051 #

Web servers handling lots of small requests are actually pretty easy to garbage collect to: you just delete all the data at the end of the request.

Either you have a specialised GC that works like this, or probably a good general generational GC can pick up on this pattern on its own.

replies(1): >>43723071 #

17. Y_Y ◴[17 Apr 25 14:10 UTC] No.43717117{5}[source]▶

>>43715066 #

> whether a data structure is an immutable or persistent data structure or merely an unmodifiable data structure...

This hurt my brain. It seems that in some places (e.g. Java land) unmodifiable refers to something that you can't modify but could just be a wrapper around a structure that can be modified. In that case they use immutable to mean something that is nowhere modifiable.

I may be misrepresenting this idea, but I think the terminology is so poor that it deserves to be misunderstood.

replies(1): >>43717750 #

18. mrkeen ◴[17 Apr 25 14:50 UTC] No.43717750{6}[source]▶

>>43717117 #

Think about mutability in Java land this way:

  // Using mutability.
  // `increment` is void, and makes 2 bigger for everyone.
  increment(2); 

  // Typical Java "safety".
  // It's still void, but now it throws a RuntimeException
  // because the developers are saving you from making everyone's 2 bigger.
  increment(2);

  // Immutable
  // Returns 3
  increment(2);

19. jlouis ◴[17 Apr 25 22:53 UTC] No.43723071{5}[source]▶

>>43716589 #

Or you do as Erlang's BEAM VM: each thread has it's own memory area which is GC'ed individually. This means upon request termination, you just terminate the thread and the memory is reclaimed with no need for a GC.

replies(1): >>43730534 #

20. whateveracct ◴[18 Apr 25 01:24 UTC] No.43724017[source]▶

>>43713518 (TP) #

It doesn't actually have "large memory allocations" due to immutable data structures. This is a meme that isn't true. Immutable data structures, especially at small scale, do not have huge performance penalties. You don't copy the entire structure over and over...you copy the O(log n) spine.

Haskell's GC is also fast when you are mostly generating garbage, which is inherently true for web server handlers.

replies(1): >>43724114 #

21. whateveracct ◴[18 Apr 25 01:25 UTC] No.43724021{3}[source]▶

>>43716570 #

Programmers for some reason really don't understand that generational garbage collection provides locality. I am really surprised how often I see C/C++/Rust types not understand this.

replies(1): >>43725599 #

22. whateveracct ◴[18 Apr 25 01:26 UTC] No.43724028{4}[source]▶

>>43714389 #

Well luckily, Haskell is full of said "specialized structures."

containers and unordered-containers handle most of your needs and they only copy their trees' spines (O log n) on update.

replies(1): >>43725606 #

23. butterisgood ◴[18 Apr 25 01:43 UTC] No.43724114[source]▶

>>43724017 #

Deforestation helps with that

A composition of catamorphic and anamorphic functions can eliminate a lot of the in-between allocations (a hylomorphism)

Basically it looks like you’re building a ton of intermediate structure then consuming it - meaning much of the in-between stuff can be eliminated.

Interesting optimizations and a little mind blowing when you see it.

24. eru ◴[18 Apr 25 06:39 UTC] No.43725599{4}[source]▶

>>43724021 #

I think that only applies to a moving GC. A conservative GC (like the Boehm GC for C) doesn't move any items around, and thus doesn't do anything for locality.

Of course, even a moving GC has limits, itwon't turn a hashtable into something that has local accesses.

25. eru ◴[18 Apr 25 06:41 UTC] No.43725606{5}[source]▶

>>43724028 #

Haskell also support eg arrays with O(1) in-place updates just fine.

26. eru ◴[18 Apr 25 18:15 UTC] No.43730534{6}[source]▶

>>43723071 #

In the abstract, this is very similar to spawning a unix process for every request, never free-ing any memory, and letting the memory allocation die with the process.