Popular/hot comments

(labs.quansight.org)

Show context

sgarland ◴[16 May 25 12:56 UTC] No.44004897[source]▶

> Instead, many reach for multiprocessing, but spawning processes is expensive

Agreed.

> and communicating across processes often requires making expensive copies of data

SharedMemory [0] exists. Never understood why this isn’t used more frequently. There’s even a ShareableList which does exactly what it sounds like, and is awesome.

[0]: https://docs.python.org/3/library/multiprocessing.shared_mem...

replies(8): >>44004956 #>>44005006 #>>44006103 #>>44006145 #>>44006664 #>>44006670 #>>44007267 #>>44013159 #

chubot ◴[16 May 25 14:46 UTC] No.44006145[source]▶

>>44004897 #

Spawning processes generally takes much less than 1 ms on Unix

Spawning a PYTHON interpreter process might take 30 ms to 300 ms before you get to main(), depending on the number of imports

It's 1 to 2 orders of magnitude difference, so it's worth being precise

This is a fallacy with say CGI. A CGI in C, Rust, or Go works perfectly well.

e.g. sqlite.org runs with a process PER REQUEST - https://news.ycombinator.com/item?id=3036124

replies(9): >>44006287 #>>44007950 #>>44008877 #>>44009754 #>>44009755 #>>44009805 #>>44010011 #>>44012318 #>>44013651 #

1. Sharlin ◴[16 May 25 14:57 UTC] No.44006287[source]▶

>>44006145 #

Unix is not the only platform though (and is process creation fast on all Unices or just Linux?) The point about interpreter init overhead is, of course, apt.

replies(1): >>44007828 #

2. btilly ◴[16 May 25 17:19 UTC] No.44007828[source]▶

>>44006287 (TP) #

Process creation should be fast on all Unices. If it isn't, then the lowly shell script (heavily used in Unix) is going to perform very poorly.

replies(4): >>44009682 #>>44010066 #>>44012374 #>>44013686 #

3. kragen ◴[16 May 25 20:45 UTC] No.44009682[source]▶

>>44007828 #

While I think you've been using Unix longer than I have, shell scripts are known for performing very poorly, and on PDP-11 Unix (where perhaps shell scripts were most heavily used, since Perl didn't exist yet) fork() couldn't even do copy-on-write; it had to literally copy the process's entire data segment, which in most cases also contained a copy of its code. Moving to paged machines like the VAX and especially the 68000 family made it possible to use copy-on-write, but historically speaking, Linux has often been an order of magnitude faster than most other Unices at fork(). However, I think people mostly don't use those Unices anymore. I imagine the BSDs have pretty much caught up by now.

https://news.ycombinator.com/item?id=44009754 gives some concrete details on fork() speed on current Linux: 50μs for a small process, 700μs for a regular process, 1300μs for a venti Python interpreter process, 30000–50000μs for Python interpreter creation. This is on a CPU of about 10 billion instructions per second per core, so forking costs on the order of ½–10 million instructions.

4. fredoralive ◴[16 May 25 21:48 UTC] No.44010066[source]▶

>>44007828 #

Python runs on other operating systems, like NT, where AIUI processes are rather more heavyweight.

Not all use cases of Python and Windows intersect (how much web server stuff is a Windows / IIS / SQL Server / Python stack? Probably not many, although WISP is a nice acronym), but you’ve still got to bear it in mind for people doing heavy numpy stuff on their work laptop or whatever.

5. saagarjha ◴[17 May 25 06:17 UTC] No.44012374[source]▶

>>44007828 #

Yes, this is why using shell scripts on macOS is miserable

6. bobmcnamara ◴[17 May 25 12:05 UTC] No.44013686[source]▶

>>44007828 #

Before we had MMUs and CoW, it was awful slow

↑

The first year of free-threaded Python