(substack.com)

129 points jxmorris12 | 4 comments | 21 Feb 25 18:23 UTC | HN request time: 0.919s | source

1. munchler ◴[21 Feb 25 22:19 UTC] No.43133737[source]▶

> Yann Lecun ... argued that because language models generate outputs token-by-token, and each token introduces a new probability of error, if we generate outputs that are too long, this per-token error will compound to inevitable failure.

That seems like a poor argument. Each word a human utters also has a chance of being wrong, yet somehow we have been successful overall.

replies(1): >>43133972 #

2. baobabKoodaa ◴[21 Feb 25 22:42 UTC] No.43133972[source]▶

>>43133737 (TP) #

Human errors don't compound the same way. Arguably.

replies(1): >>43134066 #

3. munchler ◴[21 Feb 25 22:51 UTC] No.43134066[source]▶

>>43133972 #

Right, and neither do LLM errors. They can go off course, and then get back on course, just like humans do.

replies(1): >>43136648 #

4. baobabKoodaa ◴[22 Feb 25 06:37 UTC] No.43136648{3}[source]▶

>>43134066 #

I mostly agree with you. Just pointing out what the argument was.

I think our agreement ends if we consider long-running tasks. A human can work a long time on a task like "find a way to make money". An AI, on the other hand, as it gets further and further from human input into autoregressive territory, is more and more likely to become "stuck" on a road to nowhere and needs human intervention to get unstuck.

↑

I think Yann Lecun was right about LLMs (but perhaps only by accident)