(github.com)

210 points blackcat201 | 1 comments | 31 Oct 25 00:07 UTC | HN request time: 0.209s | source

Show context

textembedding ◴[31 Oct 25 08:22 UTC] No.45769528[source]▶

125 upvotes with 2 comments is kinda sus

replies(3): >>45769778 #>>45770249 #>>45770284 #

muragekibicho ◴[31 Oct 25 08:54 UTC] No.45769778[source]▶

Lots of model releases are like this. We can only upvote. We can't run the model on our personal computers. We can neither test their 'Efficient Attention' concept on our personal computers.

Honestly, it would take 24 hours just to download the 98 GB model if I wanted to try it out (assuming I had a card with 98 GB of ram).

replies(3): >>45770625 #>>45771629 #>>45771703 #

Der_Einzige ◴[31 Oct 25 13:13 UTC] No.45771629[source]▶

>>45769778 #

People here absolutely can afford the ~2 dollars an hour of cloud rental costs for an H100 or even 8 (OCI has cheap H100 nodes). Most people are too lazy to even try and thank goodness for it because I prefer my very high salaries as someone who isn’t too lazy to spin up a cloud instance.

replies(1): >>45772029 #

1. embedding-shape ◴[31 Oct 25 13:56 UTC] No.45772029[source]▶

>>45771629 #

Not to mention some of us have enough disposable income to buy a RTX Pro 6000 so we can run our stuff locally and finally scale up our model training a little bit.

↑

Kimi Linear: An Expressive, Efficient Attention Architecture