/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
BitNet b1.58 2B4T Technical Report
(arxiv.org)
111 points
galeos
| 2 comments |
17 Apr 25 07:27 UTC
|
HN request time: 0.291s
|
source
Show context
balazstorok
◴[
17 Apr 25 09:35 UTC
]
No.
43714642
[source]
▶
>>43714004 (OP)
#
Does someone have a good understanding how 2B models can be useful in production? What tasks are you using them for? I wonder what tasks you can fine-tune them on to produce 95-99% results (if anything).
replies(7):
>>43714663
#
>>43714744
#
>>43714864
#
>>43714922
#
>>43714969
#
>>43715153
#
>>43715192
#
1.
throwaway314155
◴[
17 Apr 25 09:39 UTC
]
No.
43714663
[source]
▶
>>43714642
#
Summarization on mobile/embedded might be a good usecase?
replies(1):
>>43716601
#
ID:
GO
2.
◴[
17 Apr 25 13:35 UTC
]
No.
43716601
[source]
▶
>>43714663 (TP)
#
↑