(lmsys.org)

281 points GabrielBianconi | 1 comments | 29 Aug 25 14:07 UTC | HN request time: 0s | source

Show context

34679 ◴[29 Aug 25 14:44 UTC] No.45064819[source]▶

"By deploying this implementation locally, it translates to a cost of $0.20/1M output tokens"

Is that just the cost of electricity, or does it include the cost of the GPUs spread out over their predicted lifetime?

” Our implementation, shown in the figure above, runs on 12 nodes in the Atlas Cloud, each equipped with 8 H100 GPUs.”

Maybe the cost of renting?

34679 ◴[29 Aug 25 15:09 UTC] No.45065147[source]▶

I'm confused because I wouldn't consider a cloud implementation to be local.

1. DSingularity ◴[29 Aug 25 15:42 UTC] No.45065549[source]▶

I guess local for him is independent/private.

Deploying DeepSeek on 96 H100 GPUs