←back to thread

602 points emrah | 1 comments | | HN request time: 0.202s | source
1. gigel82 ◴[] No.43748263[source]
FWIW, the 27b Q4_K_M takes about 23Gb of VRAM with 4k context and 29Gb with 16k context and runs at ~61t/s on my 5090.