Yann LeCun to depart Meta and launch AI startup focused on 'world models'

(www.nasdaq.com)

760 points MindBreaker2605 | 1 comments | 12 Nov 25 07:25 UTC | HN request time: 0s | source

Show context

sebmellen ◴[12 Nov 25 07:57 UTC] No.45897467[source]▶

Making LeCun report to Wang was the most boneheaded move imaginable. But… I suppose Zuckerberg knows what he wants, which is AI slopware and not truly groundbreaking foundation models.

replies(20): >>45897481 #>>45897498 #>>45897518 #>>45897885 #>>45897970 #>>45897978 #>>45898040 #>>45898053 #>>45898092 #>>45898108 #>>45898186 #>>45898539 #>>45898651 #>>45898727 #>>45899160 #>>45899375 #>>45900884 #>>45900885 #>>45901421 #>>45903451 #

gnaman ◴[12 Nov 25 08:02 UTC] No.45897498[source]▶

>>45897467 #

He is also not very interested in LLMs, and that seems to be Zuck's top priority.

replies(2): >>45897523 #>>45898412 #

tinco ◴[12 Nov 25 08:05 UTC] No.45897523[source]▶

>>45897498 #

Yeah I think LeCun is underestimating the impact that LLM's and Diffusion models are going to have, even considering the huge impact they're already having. That's no problem as I'm sure whatever LeCun is working on is going to be amazing as well, but an enterprise like Facebook can't have their top researcher work on risky things when there's surefire paths to success still available.

replies(12): >>45897552 #>>45897567 #>>45897579 #>>45897666 #>>45897673 #>>45898027 #>>45898041 #>>45898615 #>>45898873 #>>45899785 #>>45900106 #>>45900288 #

fxtentacle ◴[12 Nov 25 08:25 UTC] No.45897666{3}[source]▶

>>45897523 #

LLMs and Diffusion solve a completely different problem than world models.

If you want to predict future text, you use an LLM. If you want to predict future frames in a video, you go with Diffusion. But what both of them lack is object permanence. If a car isn't visible in the input frame, it won't be visible in the output. But in the real world, there are A LOT of things that are invisible (image) or not mentioned but only implied (text) that still strongly affect the future. Every kid knows that when you roll a marble behind your hand, it'll come out on the other side. But LLMs and Diffusion models routinely fail to predict that, as for them the object disappears when it stops being visible.

Based on what I heard from others, world models are considered the missing ingredient for useful robots and self-driving cars. If that's halfway accurate, it would make sense to pour A LOT of money into world models, because they will unlock high-value products.

replies(5): >>45897717 #>>45897731 #>>45897916 #>>45898447 #>>45900906 #

1. Workaccount2 ◴[12 Nov 25 14:48 UTC] No.45900906{4}[source]▶

>>45897666 #

>But what both of them lack is object permanence.

This is something that was true last year, but hanging on by a thread this year. Genie shows this off really well, but it's also in the video models as well.[1]

[1]https://storage.googleapis.com/gdm-deepmind-com-prod-public/...

↑