Yann LeCun to depart Meta and launch AI startup focused on 'world models'

(www.nasdaq.com)

765 points MindBreaker2605 | 5 comments | 12 Nov 25 07:25 UTC | HN request time: 0.2s | source

Show context

sebmellen ◴[12 Nov 25 07:57 UTC] No.45897467[source]▶

Making LeCun report to Wang was the most boneheaded move imaginable. But… I suppose Zuckerberg knows what he wants, which is AI slopware and not truly groundbreaking foundation models.

replies(20): >>45897481 #>>45897498 #>>45897518 #>>45897885 #>>45897970 #>>45897978 #>>45898040 #>>45898053 #>>45898092 #>>45898108 #>>45898186 #>>45898539 #>>45898651 #>>45898727 #>>45899160 #>>45899375 #>>45900884 #>>45900885 #>>45901421 #>>45903451 #

ACCount37 ◴[12 Nov 25 09:06 UTC] No.45897970[source]▶

>>45897467 #

That was obviously him getting sidelined. And it's easy to see why.

LLMs get results. None of the Yann LeCun's pet projects do. He had ample time to prove that his approach is promising, and he didn't.

replies(3): >>45898088 #>>45898122 #>>45898749 #

1. chaoz_ ◴[12 Nov 25 11:10 UTC] No.45898749[source]▶

>>45897970 #

I agree. I never understood LeCun's statement that we need to pivot toward the visual aspects of things because the bitrate of text is low while visual input through the eye is high.

Text and languages contain structured information and encode a lot of real-world complexity (or it's "modelling" that).

Not saying we won't pivot to visual data or world simulations, but he was clearly not the type of person to compete with other LLM research labs, nor did he propose any alternative that could be used to create something interesting for end-users.

replies(3): >>45898776 #>>45900490 #>>45901977 #

2. ACCount37 ◴[12 Nov 25 11:15 UTC] No.45898776[source]▶

>>45898749 (TP) #

If LeCun's research has made Meta a powerhouse of video generation or general purpose robotics - the two promising directions that benefit from working with visual I/O and world modeling as LeCun sees it - it could have been a justified detour.

But that sure didn't happen.

3. tarsinge ◴[12 Nov 25 14:18 UTC] No.45900490[source]▶

>>45898749 (TP) #

Text and language contain only approximate information filtered through humans eyes and brains. Also animals don't have language and can show quite advanced capabilities compared to what we can currently do in robotics. And if you do enough mindfulness you can dissociate cognition/consciousness from language. I think we are lured because how important language is for us humans, but intuitively it's obvious to me language (and LLMs) are only a subcomponent, or even irrelevant for say self driving or robotics.

replies(1): >>45901229 #

4. ◴[12 Nov 25 15:15 UTC] No.45901229[source]▶

>>45900490 #

5. KaiserPro ◴[12 Nov 25 16:16 UTC] No.45901977[source]▶

>>45898749 (TP) #

Thats where the research is leading.

The issue is context. trying to make an AI assistant with just text only inputs is doeable but limiting. You need to know the _context_ of all the data, and without visual input most of it is useful.

For example "Where is the other half of this" is almost impossible to solve unless you have an idea of what "this" is.

but to do that you need to have cameras, to use cameras you need to have position, object, and people tracking. And that is a hard problem thats not solved.

the hypothesis is that "world models" solve that with an implicit understanding of the worl and the objects in context

↑