(simonwillison.net)

577 points simonw | 1 comments | 29 Jul 25 13:45 UTC | HN request time: 0s | source

Show context

AlexeyBrin ◴[29 Jul 25 14:02 UTC] No.44723521[source]▶

>>44723316 (OP) #

Most likely its training data included countless Space Invaders in various programming languages.

replies(6): >>44723664 #>>44723707 #>>44723945 #>>44724116 #>>44724439 #>>44724690 #

quantumHazer ◴[29 Jul 25 14:15 UTC] No.44723664[source]▶

>>44723521 #

and probably some synthetic data are generated copy of the games already on the dataset?

i have this feeling with LLM's generated react frontend, they all look the same

replies(4): >>44723867 #>>44724566 #>>44724902 #>>44731430 #

bayindirh ◴[29 Jul 25 14:29 UTC] No.44723867[source]▶

>>44723664 #

Last time somebody asked for a "premium camera app for iOS", and the model (re)generated Halide.

Models don't emit something they don't know. They remix and rewrite what they know. There's no invention, just recall...

replies(4): >>44724102 #>>44724181 #>>44724845 #>>44726775 #

satvikpendem ◴[29 Jul 25 14:54 UTC] No.44724181[source]▶

>>44723867 #

This doesn't make sense thermodynamically because models are far smaller than the training data they purport to hold and recall, so there must be some level of "understanding" going on. Whether that's the same as human understanding is a different matter.

replies(1): >>44726179 #

Eggpants ◴[29 Jul 25 17:37 UTC] No.44726179[source]▶

>>44724181 #

It’s a lossy text compression technique. It’s clever applied statistics. Basically an advanced association rules algorithm which has been around for decades but modified to consider order and relative positions.

There is no understanding, regardless of the wants of all the capital investors in this domain.

replies(3): >>44726653 #>>44726720 #>>44728418 #

1. CamperBob2 ◴[29 Jul 25 21:24 UTC] No.44728418{4}[source]▶

>>44726179 #

It’s a lossy text compression technique.

That is a much, much bigger deal than you make it sound like.

Compression may, in fact, be all we need. For that matter, it may be all there is.

↑

My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air)