←back to thread

243 points Towaway69 | 8 comments | | HN request time: 1.298s | source | bottom

Hi There,

Erlang-RED has been my project for the last couple of months and I would love to get some feedback from the HN community.

The idea is to take advantage of Erlangs message passing and low overhead processes to have true concurrency in Node-RED flows. Plus also to bring low-code visual flow-based programming to Erlang.

Show context
oersted ◴[] No.44007812[source]
> Node-RED is a amazing[*] tool for creating flows that describe concurrent processing, it is just a shame the NodeJS is single threaded. So why not use something that is multi-process from the ground up?

That's great! But now that we are doing this, it kind of makes me wish that it was not multi-processing and Erlang, but a more mainstream language with a better library ecosystem and focused on multi-threading instead, Rust comes to mind but there could be a better one.

Is there a visual flow programming language based on Rust that is relatively mature?

replies(6): >>44007924 #>>44007929 #>>44008197 #>>44008520 #>>44008981 #>>44011466 #
rdtsc ◴[] No.44008981[source]
> makes me wish that it was not multi-processing and Erlang, but a more mainstream language with a better library ecosystem and focused on multi-threading instead, Rust comes to mind but there could be a better one.

I am not quite following, why would we drop multi-processing with isolated tiny heaps with a few KBs each and move to multi-threading with a large multi-megabyte stacks per thread and all sharing and writing to the same heap? It seems like a step backward.

> Rust comes to mind but there could be a better one.

I can see the excitement, but at the same time this is becoming a meme. I mean the drive-by comments of "why don't your rewrite it in Rust" on every repo or "show HN" post. It's kind of like nodejs from years past: "why don't your write in nodejs, it's webscale!". The problem is, after a while it starts to have an inverse dose response: it makes people run away from Rust just based on that kind of stuff.

replies(1): >>44009660 #
SigmundA ◴[] No.44009660[source]
>I am not quite following, why would we drop multi-processing with isolated tiny heaps with a few KBs each and move to multi-threading with a large multi-megabyte stacks per thread and all sharing and writing to the same heap? It seems like a step backward.

Performance typically, hard to work in parallel on large amount of data performantly without multiple thread sharing a heap and you typically don't need large amount of threads because you don't actually have that many real cores to run them on.

Lots of little share nothing process are great conceptually but that does create significant overhead.

replies(1): >>44009867 #
1. rdtsc ◴[] No.44009867[source]
> Lots of little share nothing process are great conceptually but that does create significant overhead.

It doesn't, really? I have clusters running 1M+ Erlang processes comfortably per node.

> you typically don't need large amount of threads

Exactly, that's why Erlang only spawns just the right amount of threads. One scheduler thread per CPU, then a bunch of long running CPU task threads (same number as CPUs as well), plus some to do IO (10-20) and that's it.

replies(1): >>44011371 #
2. SigmundA ◴[] No.44011371[source]
Erlang is not as performant for heavy computational loads, this is bared out in many benchmarks, thats not what it's good at.

Message passing share nothing adds overhead when trying to reference data between processes because it must be copied, how would you do multithreaded processing of a large amount of data without a shared heap in a performant way? Only one thread can work on a heap at a time, so what do you do? Chop up the data and copy it around then piece it back together afterward? Thats overhead vs just working on a single heap in a lockless way. Far as I can tell the main Erlang image processing libraries just call out to C libraries that says something of that kind of work.

Yes Erlang indirects computation to a OS thread pool, multiplexing all those little Erlang process on real threads creates scheduling overhead. Those threads cannot work on the same data at the same time unless they call out to a libraries written in another language like C to do the heavy lifting.

.Net does similar things for say web server implementations, it uses a thread pool to execute many concurrent requests and if you use async it can yield those threads back to the pool while say waiting on a DB call to complete, you would not create a thread per http connection so the 4mb stack size is not an issue just like its not with Erlangs thread pool.

replies(2): >>44011910 #>>44012105 #
3. rdtsc ◴[] No.44011910[source]
> Erlang is not as performant for heavy computational loads

Well, sure don't use it for heavy computational loads if it doesn't work for you. It works great for our use case and I think Node-RED is also not for heavy computational workload, but rather for event programming, where lots of events happen concurrently.

> how would you do multithreaded processing of a large amount of data without a shared heap in a performant way?

That's a specific workload, it's not universal, for sure. I haven't worked on cases there is a single large data structure and all million clients have to manipulate it concurrently. Not saying it's implausible, I can see a game server with an arena maybe being like that, but there are other systems that work like that.

replies(2): >>44012371 #>>44013707 #
4. marci ◴[] No.44012105[source]
Nobody choses erlang for heavy computational loads, but for ease of use and a simple mental model. You do heavy compute on it mostly if you don't want to be bothered. Erlang's multi-process thing, it's not to be efficient at compute, it's to be efficient at crash recovery and distributed compute. It was there before multi-core multithread were a thing.
5. Towaway69 ◴[] No.44012371{3}[source]
> I think Node-RED is also not for heavy computational workload

That is correct, Node-RED is designed for routing many thoughts of smallish data packets through some computational units. Each computational unit is basically a node with a message queue and each message is handled and then passed on.

This means that if a node takes too long, its queue will potentially overflow causing failure. However if there are fewer messages than nodes can take longer - there is nothing ingrained in Node-RED that prevents having long running processes and fewer messages.

As long as message queues don't overflow or messages get too large causing massive memory usage, then Node-RED will happily just work.

NodeREDs many use cases are controlling many small devices (home automation) and (I)IoT - collecting datapoints from 100s or 1000s of devices and routing that around to whereever the data is needed.

6. SigmundA ◴[] No.44013707{3}[source]
>Well, sure don't use it for heavy computational loads if it doesn't work for you.

>why would we drop multi-processing with isolated tiny heaps with a few KBs each and move to multi-threading with a large multi-megabyte stacks per thread and all sharing and writing to the same heap?

This is the question I was answering and I gave you an example then you proceeded to say there is no overhead Erlangs process model, which now you admit exists.

>That's a specific workload, it's not universal, for sure. I haven't worked on cases there is a single large data structure and all million clients have to manipulate it concurrently.

Thats not the workload, simple as a single client doing heavy processing of large image or video, multithreaded shared heap wins due the the overhead of Erlang's processing model, thats why you give it up which is answering your original question again.

Geez this is tough crowd, first Erlang has no overhead, then its well that nots whats its good at, which I said. Then its well those are specific workloads, yeah serving little messages to a million clients is a specific workload too. I like Erlang, it's very good at specific things but it sacrifices quite a bit of performance to do that like many good abstractions. That can help overall reliability and performance on certain workloads but other modern languages following the shared heap multi-threading model will probably outperform it if care is taken in the design because they more closely match the underlying OS and hardware execution model of today with less indirection and overhead and they can also do other workloads that Erlang will never be good at with its share nothing green thread/process design.

replies(1): >>44018637 #
7. rdtsc ◴[] No.44018637{4}[source]
> This is the question I was answering and I gave you an example then you proceeded to say there is no overhead Erlangs process model, which now you admit exists

But that's not what NodeRed-Erlang does. The author is confirming it one one of the sibling comments. You invented some new type of workload, were all clients have to access the same heap concurrently and telling the author to rewrite everything in Rust. But that's your use-case, not the the one we're discussion. My use case is not that either.

> Geez this is tough crowd, first Erlang has no overhead, then its well that nots whats its good at, which I said. Then its well those are specific workloads, yeah serving little messages to a million clients is a specific workload too.

The application in this case fits Erlang perfectly. You're the one who introduced a new type of a use case - when you need to concurrently process the same video or audio and arguing that Erlang is not good for it. Yeah, we agree, it's not good for that, use something else, Rust, C etc. It's like we're talking about trains and you're saying, don't use trains, they can't fly, what if you have to cross the ocean or something like.

replies(1): >>44021271 #
8. SigmundA ◴[] No.44021271{5}[source]
> You invented some new type of workload, were all clients have to access the same heap concurrently and telling the author to rewrite everything in Rust. But that's your use-case, not the the one we're discussion. My use case is not that either.

Image processing or video or audio or parsing or working with large amount of data in a multithreaded way is not a new kind of workload its a very common one.

It was stated by the OP that they wished it would have been done in a more standard multithreaded language like rust. You asked why, I gave you an example, you said it wasn't true that there is no overhead to Erlang, I gave further details, you then pivoted to thats not the use case. How do you know what oersted's use case is?

>It's like we're talking about trains and you're saying, don't use trains, they can't fly, what if you have to cross the ocean or something like.

It's more like you asked when would you not use a train? I answered when you need to go across an ocean and planes can do both what trains and boats do just at higher cost but at greater speed.

Erlang is great at specific things, standard multithreaded shared heap runtimes are good at everything they just might take more effort and care to do the things Erlang can do, but they will perform better than Erlang. Erlang can technically do anything since its Turing complete but it will perform unacceptably at many standard workloads just like a train could go across an ocean if put on a barge and towed.