←back to thread

111 points galeos | 1 comments | | HN request time: 0.001s | source
Show context
balazstorok ◴[] No.43714642[source]
Does someone have a good understanding how 2B models can be useful in production? What tasks are you using them for? I wonder what tasks you can fine-tune them on to produce 95-99% results (if anything).
replies(7): >>43714663 #>>43714744 #>>43714864 #>>43714922 #>>43714969 #>>43715153 #>>43715192 #
meltyness ◴[] No.43715192[source]
I'm more interested in how users are taking 95-99% to 99.99% for generation-assisted tasks. I haven't seen a review or study of techniques, even though on the ground it's pretty trivial to think of some candidates.
replies(1): >>43716572 #
1. oezi ◴[] No.43716572{3}[source]
Three strategies seem to be:

- Use LLM to evaluate result and retry if it doesn't match.

- let users trigger a retry

- let users edit