←back to thread

684 points prettyblocks | 1 comments | | HN request time: 0.266s | source

I mean anything in the 0.5B-3B range that's available on Ollama (for example). Have you built any cool tooling that uses these models as part of your work flow?
Show context
jothflee ◴[] No.42787155[source]
when i feel like casually listening to something, instead of netflix/hulu/whatever, i'll run a ~3b model (qwen 2.5 or llama 3.2) and generate and audio stream of water cooler office gossip. (when it is up, it runs here: https://water-cooler.jothflee.com).

some of the situations get pretty wild, for the office :)

replies(2): >>42788384 #>>42796474 #
1. jftuga ◴[] No.42788384[source]
What prompt are you using for this?