←back to thread

602 points emrah | 2 comments | | HN request time: 0.422s | source
Show context
holografix ◴[] No.43743631[source]
Could 16gb vram be enough for the 27b QAT version?
replies(5): >>43743634 #>>43743704 #>>43743825 #>>43744249 #>>43756253 #
1. jffry ◴[] No.43743704[source]
With `ollama run gemma3:27b-it-qat "What is blue"`, GPU memory usage is just a hair over 20GB, so no, probably not without a nerfed context window
replies(1): >>43743804 #
2. woadwarrior01 ◴[] No.43743804[source]
Indeed, the default context length in ollama is a mere 2048 tokens.