←back to thread

164 points ksec | 1 comments | | HN request time: 0.498s | source
1. frankfrank13 ◴[] No.44500703[source]
Would love to try this in ollama/llama.cpp. Using llama.cpp for VsCode is painful since (realistically) I can only generate on the order of <100 tokens at a time