OpenAI, Google and Anthropic are struggling to build more advanced AI

1. osigurdson ◴[15 Nov 24 06:35 UTC] No.42144420[source]▶

This "running out of data" thing suggests that there is something fundamentally wrong with how things are working. A new driver does not need to experience 8000 different rabbit-on-road situations from all angles to know to slow down when we see one on the road. Similarly we don't need 10,000 addition examples to learn how to add. It is as though there is no generalization in the models - just fundamentally search.

replies(2): >>42144498 #>>42149778 #

2. surrTurr ◴[15 Nov 24 06:53 UTC] No.42144498[source]▶

>>42144420 (TP) #

i think you underestimate the amount of data a driver experiences in a single 5 minute drive

replies(2): >>42144649 #>>42159546 #

3. eslaught ◴[15 Nov 24 07:28 UTC] No.42144649[source]▶

>>42144498 #

I never get this argument.

I've seen a deer on a road maybe once. I've seen a rabbit on a road zero times. But I know what to do if I see one.

Is that because the "video" of my perception has many "frames"? Even if that's true at some level, I think it's massively missing the point. Yeah, so I saw that one deer from a lot of angles. But current AI training is like the equivalent of taking every deer that has ever been on camera in the history of the human species.

Somehow I'm still dramatically better at generalization than the AI. Surely that's an algorithm difference.

replies(1): >>42144889 #

4. visarga ◴[15 Nov 24 08:18 UTC] No.42144889{3}[source]▶

>>42144649 #

You might personally have seen a deer just once, but human evolution, and animal evolution prior to that have practiced this skill a lot. AI doesn't have the advantage of evolutionary priors baked in, so it needs explicit walking through many combinations to infer its structure from data, and is remarkably efficient. GPT-4 'only' trained on the amount of language that 30,000 humans use in their lifetime.

But we have seen from AlphaGo that when training data is extensive, it can rediscover strategy on its own and even surpass us. It's not inherently worse than human learning.

replies(2): >>42145974 #>>42148631 #

5. RivieraKid ◴[15 Nov 24 11:32 UTC] No.42145974{4}[source]▶

>>42144889 #

Human DNA is just 750 and only a fraction of it is something that may be called "brain pre-training".

6. jaculabilis ◴[15 Nov 24 16:55 UTC] No.42148631{4}[source]▶

>>42144889 #

> You might personally have seen a deer just once, but human evolution, and animal evolution prior to that have practiced this skill a lot.

Which pre-human animals evolved instincts for swerving a car to avoid a deer?

replies(1): >>42150433 #

7. slashdave ◴[15 Nov 24 18:59 UTC] No.42149778[source]▶

>>42144420 (TP) #

Deep learning is the very opposite of generalization.

replies(1): >>42170301 #

8. osigurdson ◴[15 Nov 24 20:14 UTC] No.42150433{5}[source]▶

>>42148631 #

I'm pretty sure that evolution would select out anything that could not generalize pretty quickly.

9. qnleigh ◴[16 Nov 24 21:31 UTC] No.42159546[source]▶

>>42144498 #

A charitable interpretation of what you're saying is that humans produce lots of original data from their experiences of the world, like thinking about their experiences, imagining what they would have done differently, and perhaps even dreaming. I agree with the root comment that something is fundamentally missing, and probably it is the ability to iteratively learn from one's own experiences, test understanding, and recursively improve.

There are definitely teams working on applying reinforcement learning to LLMs. Maybe that will unlock new potential from finite training data.

10. pas ◴[18 Nov 24 06:42 UTC] No.42170301[source]▶

>>42149778 #

it's not that simple

"""

Intuitively, an overparameterized model will generalize well if the model’s representations capture the essential information necessary for the best model in the model class to perform well

"""

https://iclr-blogposts.github.io/2024/blog/double-descent-de...