Ask HN: Any insider takes on Yann LeCun's push against current architectures?

1. killthebuddha ◴[14 Mar 25 18:12 UTC] No.43365456[source]▶

I've always felt like the argument is super flimsy because "of course we can _in theory_ do error correction". I've never seen even a semi-rigorous argument that error correction is _theoretically_ impossible. Do you have a link to somewhere where such an argument is made?

replies(3): >>43366044 #>>43367051 #>>43370111 #

2. aithrowawaycomm ◴[14 Mar 25 19:03 UTC] No.43366044[source]▶

>>43365456 (TP) #

In theory transformers are Turing-complete and LLMs can do anything computable. The more down-to-earth argument is that transformer LLMs aren't able to correct errors in a systematic way like Lecun is describing: it's task-specific "whack-a-mole," involving either tailored synthetic data or expensive RLHF.

In particular, if you train an LLM to do Task A and Task B with acceptable accuracy, that does not guarantee it can combine the tasks in a common-sense way. "For each step of A, do B on the intermediate results" is a whole new Task C that likely needs to be fine-tuned. (This one actually does have some theoretical evidence coming from computational complexity, and it was the first thing I noticed in 2023 when testing chain-of-thought prompting. It's not that the LLM can't do Task C, it just takes extra training.)

3. vhantz ◴[14 Mar 25 20:40 UTC] No.43367051[source]▶

>>43365456 (TP) #

> of course we can _in theory_ do error correction

Oh yeah? This is begging the question.

4. tyronehed ◴[15 Mar 25 05:02 UTC] No.43370111[source]▶

>>43365456 (TP) #

As soon as you need to start leaning heavily on error correction, that is an indication that your architecture and solution is not correct. The final solution will need to be elegant and very close to a perfect solution immediately.

You must always keep close to the only known example we have of an intelligence which is the human brain. As soon as you start to wander away from the way the human brain does it, you are on your own and you are not relying on known examples of intelligence. Certainly that might be possible, but since there's only one known example in this universe of intelligence, it seems ridiculous to do anything but stick close to that example, which is the human brain.