(arxiv.org)

111 points galeos | 1 comments | 17 Apr 25 07:27 UTC | HN request time: 0.207s | source

Show context

balazstorok ◴[17 Apr 25 09:35 UTC] No.43714642[source]▶

Does someone have a good understanding how 2B models can be useful in production? What tasks are you using them for? I wonder what tasks you can fine-tune them on to produce 95-99% results (if anything).

replies(7): >>43714663 #>>43714744 #>>43714864 #>>43714922 #>>43714969 #>>43715153 #>>43715192 #

meltyness ◴[17 Apr 25 11:14 UTC] No.43715192[source]▶

>>43714642 #

I'm more interested in how users are taking 95-99% to 99.99% for generation-assisted tasks. I haven't seen a review or study of techniques, even though on the ground it's pretty trivial to think of some candidates.

replies(1): >>43716572 #

1. oezi ◴[17 Apr 25 13:32 UTC] No.43716572[source]▶

>>43715192 #

Three strategies seem to be:

- Use LLM to evaluate result and retry if it doesn't match.

- let users trigger a retry

- let users edit

↑

BitNet b1.58 2B4T Technical Report