/top/
/new/
/best/
/ask/
/show/
/job/
^
slacker news
login
about
←back to thread
1-Bit AI Infrastructure
(arxiv.org)
157 points
galeos
| 4 comments |
15 Nov 24 14:28 UTC
|
HN request time: 0.805s
|
source
1.
ttyprintk
◴[
15 Nov 24 14:32 UTC
]
No.
42147278
[source]
▶
>>42147252 (OP)
#
Later a4.8 quantization by some of the same team:
https://news.ycombinator.com/item?id=42092724
https://arxiv.org/abs/2411.04965
replies(1):
>>42191454
#
ID:
GO
2.
skavi
◴[
20 Nov 24 07:12 UTC
]
No.
42191454
[source]
▶
>>42147278 (TP)
#
and the repo for this project:
https://github.com/microsoft/BitNet
replies(1):
>>42192368
#
3.
sinuhe69
◴[
20 Nov 24 10:02 UTC
]
No.
42192368
[source]
▶
>>42191454
#
The demo they showed was full of repeated sentences. The 3B model looks quite dense, TBH. Did they just want to show the speed?
replies(1):
>>42193481
#
4.
newswasboring
◴[
20 Nov 24 12:50 UTC
]
No.
42193481
{3}
[source]
▶
>>42192368
#
3B models, especially in quantized state, almost always behave like this.
↑