(lmsys.org)

281 points GabrielBianconi | 1 comments | 29 Aug 25 14:07 UTC | HN request time: 0.226s | source

Show context

34679 ◴[29 Aug 25 14:44 UTC] No.45064819[source]▶

"By deploying this implementation locally, it translates to a cost of $0.20/1M output tokens"

Is that just the cost of electricity, or does it include the cost of the GPUs spread out over their predicted lifetime?

1. adam_arthur ◴[30 Aug 25 03:39 UTC] No.45071720[source]▶

I'm curious as well.

Depreciation and GPU failure rate over time must be considered, which I don't see mentioned in the article.

Deploying DeepSeek on 96 H100 GPUs