The deep learning boom caught almost everyone by surprise

(www.understandingai.org)

306 points slyall | 1 comments | 06 Nov 24 04:05 UTC | HN request time: 0.202s | source

Show context

aithrowawaycomm ◴[06 Nov 24 12:00 UTC] No.42060762[source]▶

>>42057139 (OP) #

I think there is a slight disconnect here between making AI systems which are smart and AI systems which are useful. It’s a very old fallacy in AI: pretending tools which assist human intelligence by solving human problems must themselves be intelligent.

The utility of big datasets was indeed surprising, but that skepticism came about from recognizing the scaling paradigm must be a dead end: vertebrates across the board require less data to learn new things, by several orders of magnitude. Methods to give ANNs “common sense” are essentially identical to the old LISP expert systems: hard-wiring the answers to specific common-sense questions in either code or training data, even though fish and lizards can rapidly make common-sense deductions about manmade objects they couldn’t have possibly seen in their evolutionary histories. Even spiders have generalization abilities seemingly absent in transformers: they spin webs inside human homes with unnatural geometry.

Again it is surprising that the ImageNet stuff worked as well as it did. Deep learning is undoubtedly a useful way to build applications, just like Lisp was. But I think we are about as close to AGI as we were in the 80s, since we have made zero progress on common sense: in the 80s we knew Big Data can poorly emulate common sense, and that’s where we’re at today.

replies(5): >>42061007 #>>42061232 #>>42068100 #>>42068802 #>>42070712 #

j_bum ◴[06 Nov 24 12:17 UTC] No.42061007[source]▶

>>42060762 #

> vertebrates across the board require less data to learn new things, by several orders of magnitude.

Sometimes I wonder if it’s fair to say this.

Organisms have had billions of years of training. We might come online and succeed in our environments with very little data, but we can’t ignore the information that’s been trained into our DNA, so to speak.

What’s billions of years of sensory information that drove behavior and selection, if not training data?

replies(10): >>42062463 #>>42064030 #>>42064183 #>>42064895 #>>42068159 #>>42070063 #>>42071450 #>>42075819 #>>42078291 #>>42085475 #

marcosdumay ◴[06 Nov 24 21:53 UTC] No.42070063[source]▶

>>42061007 #

> but we can’t ignore the information that’s been trained into our DNA

There's around 600MB in our DNA. Subtract this from the size of any LLM out there and see how much you get.

replies(1): >>42072096 #

myownpetard ◴[07 Nov 24 01:04 UTC] No.42072096[source]▶

>>42070063 #

A more fair comparison would be subtract it from the size the of source code required to represent the LLM.

replies(2): >>42072476 #>>42072730 #

marcosdumay ◴[07 Nov 24 02:34 UTC] No.42072730[source]▶

>>42072096 #

The source code is the weights. That's what they learn.

replies(1): >>42073049 #

1. myownpetard ◴[07 Nov 24 03:26 UTC] No.42073049[source]▶

>>42072730 #

I disagree. A neural network is not learning it's source code. The source code specifies the model structure and hyperparameters. Then it compiled and instantiated into some physical medium, usually a bunch of GPUs, and weights are learned.

Our DNA specifies the model structure and hyperparameters for our brains. Then it is compiled and instantiated into a physical medium, our bodies, and our connectome is trained.

If you want to make a comparison about the quantity of information contained in different components of an artificial and a biological system, then it only makes sense if you compare apples to apples. DNA:Code :: Connectome:Weights

↑