←back to thread

281 points GabrielBianconi | 1 comments | | HN request time: 0s | source
Show context
34679 ◴[] No.45064819[source]
"By deploying this implementation locally, it translates to a cost of $0.20/1M output tokens"

Is that just the cost of electricity, or does it include the cost of the GPUs spread out over their predicted lifetime?

replies(3): >>45064954 #>>45066023 #>>45071720 #
dragonslayer56 ◴[] No.45064954[source]
” Our implementation, shown in the figure above, runs on 12 nodes in the Atlas Cloud, each equipped with 8 H100 GPUs.”

Maybe the cost of renting?

replies(2): >>45065147 #>>45065503 #
34679 ◴[] No.45065147[source]
I'm confused because I wouldn't consider a cloud implementation to be local.
replies(3): >>45065380 #>>45065518 #>>45065549 #
randomjoe2 ◴[] No.45065518[source]
Local doesn't refer to "on metal" anymore to many people
replies(3): >>45065653 #>>45065663 #>>45067202 #
mwcz ◴[] No.45065663[source]
"On metal" is muddied too. I've heard people refer to web apps running in an OCI container as being "bare metal" deployment, as opposed to AWS or whatever hosting platform.

That's silly, but the idea that "local" is not the opposite of remote is even sillier.

replies(2): >>45065742 #>>45065883 #
ffsm8 ◴[] No.45065742[source]
You can run an OCI container on bare metal though. It doesn't stop being run on bare metal just because you're running in kernel namespaces, aka docker container

Lots of people were advocating for running their k8s on bare metal servers to maximize the performance of their containers

Now wherever that's applied to your conversation... I've no clue, too little context ( 。 ŏ ﹏ ŏ )

replies(1): >>45066076 #
okasaki ◴[] No.45066076[source]
In my opinion, if you're running k8s on bare metal, that's "k8s on bare metal" but still "<your app> on kubernetes", not "<your app> on bare metal".
replies(1): >>45067194 #
ffsm8 ◴[] No.45067194[source]
Sorry, but then your opinion is just plain wrong

Bare metal in the context of running software is a technical term with a clear meaning that hasn't become contested like "AI" or "Crypto" - and that meaning is that the software is running directly on the hardware.

As k8s isn't virtualization, processes spawned by its orchestrator are still running on bare metal. It's the whole reason why containers are more efficient compared to virtual machines

replies(3): >>45067264 #>>45067809 #>>45083341 #
bee_rider ◴[] No.45067264[source]
Bare metal as in, no operating system? Does Linux really get in the way of these LLM inference engines?
replies(1): >>45067336 #
ffsm8 ◴[] No.45067336[source]
No, as I said in my previous comment: bare metal as in not a virtual machine

https://en.m.wikipedia.org/wiki/Bare-metal_server

replies(1): >>45068930 #
1. pessimizer ◴[] No.45068930[source]
Note that this is a term whose meaning has been expanded to refer to non-VPS servers very recently. Bare-metal has traditionally meant "without an operating system." It did not mean "a server that is an actual server," because that was the default.

It also does not always "clearly" have this new meaning. Somebody who is used to running programs directly (with no intermediate OS) on hardware might not understand what you're saying, or might ask you to clarify, and you probably shouldn't feel put upon by a totally understandable misinterpretation.

edit: Especially when you keep repeating "directly on hardware" when you mean "not on a VM." VMs also run on hardware. You're saying that you're only running on one OS instead an OS in your OS.