385 points vessenes | 1 comments | 10 Mar 25 19:41 UTC | HN request time: 0.194s | source

So, Lecun has been quite public saying that he believes LLMs will never fix hallucinations because, essentially, the token choice method at each step leads to runaway errors -- these can't be damped mathematically.

In exchange, he offers the idea that we should have something that is an 'energy minimization' architecture; as I understand it, this would have a concept of the 'energy' of an entire response, and training would try and minimize that.

Which is to say, I don't fully understand this. That said, I'm curious to hear what ML researchers think about Lecun's take, and if there's any engineering done around it. I can't find much after the release of ijepa from his group.

Show context

janalsncm ◴[14 Mar 25 19:14 UTC] No.43366161[source]▶

>>43325049 (OP) #

I am an MLE not an expert. However, it is a fundamental problem that our current paradigm of training larger and larger LLMs cannot ever scale to the precision people require for many tasks. Even in the highly constrained realm of chess, an enormous neural net will be outclassed by a small program that can run on your phone.

https://arxiv.org/pdf/2402.04494

replies(3): >>43366173 #>>43366198 #>>43368618 #

throw310822 ◴[14 Mar 25 19:16 UTC] No.43366173[source]▶

>>43366161 #

> Even in the highly constrained realm of chess, an enormous neural net will be outclassed by a small program that can run on your phone.

This is true also for the much bigger neural net that works in your brain, and even if you're the world champion of chess. Clearly your argument doesn't hold water.

replies(1): >>43366869 #

janalsncm ◴[14 Mar 25 20:22 UTC] No.43366869[source]▶

>>43366173 #

For the sake of argument let’s say an artificial neural net is approximately the same as the brain. It sounds like you agree with me that smaller programs are both more efficient and more effective than a larger neural net. So you should also agree with me that those who say the only path to AGI is LLM maximalism are misguided.

replies(2): >>43366985 #>>43367106 #

1. jpadkins ◴[14 Mar 25 20:33 UTC] No.43366985[source]▶

>>43366869 #

smaller programs are better than artificial or organic neural net for constrained problems like chess. But chess programs don't generalize to any other intelligence applications, like how organic neural nets do today.

↑

Ask HN: Any insider takes on Yann LeCun's push against current architectures?