Popular/hot comments

(github.com)

Show context

detrites ◴[31 Mar 23 20:58 UTC] No.35393558[source]▶

The pace of collaborative OSS development on these projects is amazing, but the rate of optimisations being achieved is almost unbelievable. What has everyone been doing wrong all these years cough sorry, I mean to say weeks?

Ok I answered my own question.

replies(5): >>35393627 #>>35393885 #>>35393921 #>>35394786 #>>35397029 #

1. xienze ◴[31 Mar 23 22:46 UTC] No.35394786[source]▶

>>35393558 #

> but the rate of optimisations being achieved is almost unbelievable. What has everyone been doing wrong all these years cough sorry, I mean to say weeks?

It’s several things:

* Cutting-edge code, not overly concerned with optimization

* Code written by scientists, who aren’t known for being the world’s greatest programmers

* The obsession the research world has with using Python

Not surprising that there’s a lot of low-hanging fruit that can be optimized.

replies(2): >>35394897 #>>35397540 #

2. Miraste ◴[31 Mar 23 22:57 UTC] No.35394897[source]▶

>>35394786 (TP) #

Why does Python get so much flak for inefficiencies? It's really not that slow, and in ML the speed-sensitive parts are libraries in lower level languages anyway. Half of the optimization from this very post is in Python.

replies(3): >>35395016 #>>35395401 #>>35442394 #

3. chatmasta ◴[31 Mar 23 23:10 UTC] No.35395016[source]▶

>>35394897 #

It might not be slow in general, but it's easy to write slow code in it.

replies(1): >>35397547 #

4. chaorace ◴[31 Mar 23 23:54 UTC] No.35395401[source]▶

>>35394897 #

Python has the misfortune of competing against JS in this arena, which just so happens to have the most obsessively optimized JIT ever.

replies(1): >>35432517 #

5. Closi ◴[01 Apr 23 05:56 UTC] No.35397540[source]▶

>>35394786 (TP) #

I’m not sure this is fair - a lot of the performance optimisations have come from applied mathematicians rather than programmers, and python is not generally considered to be the bottleneck (it is the interface rather than what is running the computation - it calls a C API which then often uses CUDA and may also run on hardware specifically designed for ML).

6. Closi ◴[01 Apr 23 05:58 UTC] No.35397547{3}[source]▶

>>35395016 #

In ML python is effectively the interface rather than the bit that is doing the heavy lifting.

The interface is designed to be easy to use (python) and the bit that is actually doing the work is designed to be heavily performant (which is C & CUDA and may even be running on a TPU).

7. Max-Limelihood ◴[03 Apr 23 22:57 UTC] No.35432517{3}[source]▶

>>35395401 #

In ML? No, the best competition for Python in ML is... well, it's either C++ or Julia, depending on how you define "competition," given Python is effectively a glorified C++ interface.

8. Max-Limelihood ◴[04 Apr 23 16:42 UTC] No.35442394[source]▶

>>35394897 #

It really is that slow.

You're completely correct that the speed-sensitive parts are written in lower-level libraries, but another way to phrase that is "Python can go really fast, as long as you don't use Python." But this also means ML is effectively hamstrung into only using methods that already exist and have been coded in C++, since anything in Python would be too slow to compete.

There's lots of languages that make good tradeoffs between performance and usability. Python is not one of those languages. It is, at best, only slightly harder to use than Julia, yet orders-of-magnitude slower.

↑

Llama.cpp 30B runs with only 6GB of RAM now