(lmsys.org)

281 points GabrielBianconi | 1 comments | 29 Aug 25 14:07 UTC | HN request time: 0.212s | source

1. cootsnuck ◴[29 Aug 25 18:46 UTC] No.45067901[source]▶

Super helpful to see actual examples of what it (roughly) can look like to deploy production inference workloads, and also the latest optimization efforts.

I consult in this space and clients still don't fully understand how complex it can get to just "run your own LLM".

↑

Deploying DeepSeek on 96 H100 GPUs