(huggingface.co)

350 points kashifr | 3 comments | 08 Jul 25 16:13 UTC | HN request time: 0.001s | source

Show context

I fixed some chat template issues for llama.cpp and other inference engines! To run it, do:

./llama.cpp/llama-cli -hf unsloth/SmolLM3-3B-GGUF:Q4_K_XL --jinja -ngl 99

1. segmondy ◴[09 Jul 25 01:54 UTC] No.44505656[source]▶

doing the good work, thanks daniel!

replies(1): >>44505857 #

2. danielhanchen ◴[09 Jul 25 02:35 UTC] No.44505857[source]▶

Thank you!

replies(1): >>44509678 #

3. v5v3 ◴[09 Jul 25 13:15 UTC] No.44509678[source]▶

Thanks

Smollm3: Smol, multilingual, long-context reasoner LLM