←back to thread

157 points galeos | 1 comments | | HN request time: 0s | source
Show context
ttyprintk ◴[] No.42147278[source]
Later a4.8 quantization by some of the same team:

https://news.ycombinator.com/item?id=42092724

https://arxiv.org/abs/2411.04965

replies(1): >>42191454 #
skavi ◴[] No.42191454[source]
and the repo for this project: https://github.com/microsoft/BitNet
replies(1): >>42192368 #
sinuhe69 ◴[] No.42192368[source]
The demo they showed was full of repeated sentences. The 3B model looks quite dense, TBH. Did they just want to show the speed?
replies(1): >>42193481 #
1. newswasboring ◴[] No.42193481[source]
3B models, especially in quantized state, almost always behave like this.