/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
BitNet b1.58 2B4T Technical Report
(arxiv.org)
111 points
galeos
| 1 comments |
17 Apr 25 07:27 UTC
|
HN request time: 0.236s
|
source
Show context
balazstorok
◴[
17 Apr 25 09:35 UTC
]
No.
43714642
[source]
▶
>>43714004 (OP)
#
Does someone have a good understanding how 2B models can be useful in production? What tasks are you using them for? I wonder what tasks you can fine-tune them on to produce 95-99% results (if anything).
replies(7):
>>43714663
#
>>43714744
#
>>43714864
#
>>43714922
#
>>43714969
#
>>43715153
#
>>43715192
#
1.
snovv_crash
◴[
17 Apr 25 11:06 UTC
]
No.
43715153
[source]
▶
>>43714642
#
Anything you'd normally train a smaller custom model for, but with an LLM you can use a prompt instead of training.
ID:
GO
↑