S1: A $6 R1 competitor?

(timkellogg.me)

851 points tkellogg | 2 comments | 05 Feb 25 11:05 UTC | HN request time: 0s | source

Show context

bloomingkales ◴[05 Feb 25 15:16 UTC] No.42949616[source]▶

If an LLM output is like a sculpture, then we have to sculpt it. I never did sculpting, but I do know they first get the clay spinning on a plate.

Whatever you want to call this “reasoning” step, ultimately it really is just throwing the model into a game loop. We want to interact with it on each tick (spin the clay), and sculpt every second until it looks right.

You will need to loop against an LLM to do just about anything and everything, forever - this is the default workflow.

Those who think we will quell our thirst for compute have another thing coming, we’re going to be insatiable with how much LLM brute force looping we will do.

replies(3): >>42955281 #>>42955806 #>>42956482 #

zoogeny ◴[05 Feb 25 21:55 UTC] No.42955806[source]▶

>>42949616 #

I can't believe this hasn't been done yet, perhaps it is a cost issue.

My literal first thought about AI was wondering why we couldn't just put it in a loop. Heck, one update per day, or one update per hour would even be a start. You have a running "context", the output is the next context (or a set of transformations on a context that is a bit larger than the output window). Then ramp that up ... one loop per minute, one per second, millisecond, microsecond.

replies(2): >>42955958 #>>42956117 #

layer8 ◴[05 Feb 25 22:06 UTC] No.42955958[source]▶

>>42955806 #

Same. And the next step is that it must feed back into training, to form long-term memory and to continually learn.

replies(1): >>42955988 #

1. zoogeny ◴[05 Feb 25 22:09 UTC] No.42955988{3}[source]▶

>>42955958 #

I analogize this with sleep. Perhaps that is what is needed, 6 hours offline per day to LoRa the base model on some accumulated context from the day.

replies(1): >>42965779 #

2. dev0p ◴[06 Feb 25 19:43 UTC] No.42965779[source]▶

>>42955988 (TP) #

LLMs need to sleep too. Do they dream of electric sheep?

↑