Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop

1. kabes ◴[31 Oct 25 21:10 UTC] No.45776731[source]▶

Let's say I have a server with an h200 gpu at home. What's the best open model for coding I can run on it today? And is it somewhat competitive with commercial models like sonnet 4.5?

replies(3): >>45776946 #>>45777002 #>>45777030 #

2. skhameneh ◴[31 Oct 25 21:33 UTC] No.45776946[source]▶

>>45776731 (TP) #

That's still very limiting when comparing to commercial models. To be truly competitive with commercial offerings the bar is closer to 4-8x that for one node .

That said, maybe a quantized version of GLM 4.5 Air, but if we're talking no hardware constraints I find some of the responses from LongCat-Chat-Flash to be favorable over Sonnet when playing around with LMArena.

3. hamdingers ◴[31 Oct 25 21:40 UTC] No.45777002[source]▶

>>45776731 (TP) #

If you do, damn bro

I played around with renting H200s and coding with aider and gpt-oss 120b. It was impressive but not at the level of claude. I decided buying $30k worth of tokens made far more sense than buying 30k worth of one GPU.

4. suprjami ◴[31 Oct 25 21:43 UTC] No.45777030[source]▶

>>45776731 (TP) #

If you have ~$25k to buy a H200 then don't buy one. Rent them out much cheaper and keep renting newer models when your H200 becomes an outdated paperweight.

Assuming you ran inference for the full working day, you'd need to run your H200 for almost 2 years to break even. Realistically you don't run inference full time so you'll never realise the value of the card before it's obsolete.

replies(1): >>45779674 #

5. kabes ◴[01 Nov 25 06:37 UTC] No.45779674[source]▶

>>45777030 #

The company I work for is in the defense industry and by contract can't send any code outside their own datacenter. So cloud-rented H200's are a no-go and obviously commercial LLM's as well. so breaking even is not the goal here.

replies(1): >>45785594 #

6. suprjami ◴[01 Nov 25 21:31 UTC] No.45785594{3}[source]▶

>>45779674 #

In that case I suggest you buy cheaper desktop cards instead of a H200. Two or three 5090s will let you run decent models at very good speed.