(substack.com)

129 points jxmorris12 | 1 comments | 21 Feb 25 18:23 UTC | HN request time: 0.208s | source

1. emmanueloga_ ◴[21 Feb 25 21:11 UTC] No.43133025[source]▶

Lecun's thesis: "if we generate outputs that are too long, the per-token error will compound to inevitable failure".

> The finding that language models can get better by generating longer outputs directly contradicts Yann’s hypothesis.

The author's examples show that the error has been minimized for a few examples of a certain length. This doesn't contradict Lecun, afaict.

I think Yann Lecun was right about LLMs (but perhaps only by accident)