684 points prettyblocks | 1 comments | 21 Jan 25 19:39 UTC | HN request time: 0.266s | source

I mean anything in the 0.5B-3B range that's available on Ollama (for example). Have you built any cool tooling that uses these models as part of your work flow?

Show context

jothflee ◴[22 Jan 25 00:31 UTC] No.42787155[source]▶

>>42784365 (OP) #

when i feel like casually listening to something, instead of netflix/hulu/whatever, i'll run a ~3b model (qwen 2.5 or llama 3.2) and generate and audio stream of water cooler office gossip. (when it is up, it runs here: https://water-cooler.jothflee.com).

some of the situations get pretty wild, for the office :)

replies(2): >>42788384 #>>42796474 #

1. jftuga ◴[22 Jan 25 02:54 UTC] No.42788384[source]▶

>>42787155 #

What prompt are you using for this?

↑

Ask HN: Is anyone doing anything cool with tiny language models?