(geek.sg)

221 points whitefables | 1 comments | 16 Oct 24 05:26 UTC | HN request time: 0.196s | source

Show context

varun_ch ◴[16 Oct 24 07:27 UTC] No.41856480[source]▶

I’m curious about how good the performance with local LLMs is on ‘outdated’ hardware like the author’s 2060. I have a desktop with a 2070 super that it could be fun to turn into an “AI server” if I had the time…

replies(7): >>41856521 #>>41856558 #>>41856559 #>>41856609 #>>41856875 #>>41856894 #>>41857543 #

1. dtquad ◴[16 Oct 24 08:36 UTC] No.41856875[source]▶

>>41856480 #

I am using an old laptop with a GTX 1060 6 GB VRAM to run a home server with Ubuntu and Ollama. Because of quantization Ollama can run 7B/8B models on an 8 year old laptop GPU with 6 GB VRAM.

↑

I Self-Hosted Llama 3.2 with Coolify on My Home Server