684 points prettyblocks | 1 comments | 21 Jan 25 19:39 UTC | HN request time: 0.209s | source

I mean anything in the 0.5B-3B range that's available on Ollama (for example). Have you built any cool tooling that uses these models as part of your work flow?

1. kianN ◴[22 Jan 25 00:32 UTC] No.42787162[source]▶

I don’t know if this counts as tiny but I use llama 3B in prod for summarization (kinda).

Its effective context window is pretty small but I have a much more robust statistical model that handles thematic extraction. The llm is essentially just rewriting ~5-10 sentences into a single paragraph.

I’ve found the less you need the language model to actually do, the less the size/quality of the model actually matters.

↑

Ask HN: Is anyone doing anything cool with tiny language models?