Most active commenters

simonw(4)
qingcharles(3)

Popular/hot comments

>>44724682 #
>>44724876 #
>>44724300 #

←back to thread

My 2.5 year old laptop can write Space Invaders in JavaScript now (GLM-4.5 Air)

(simonwillison.net)

1. stpedgwdgfhgdd ◴[29 Jul 25 14:30 UTC] No.44723879[source]▶

>>44723316 (OP) #

Aside that space invaders from scratch is not representative for real engineering, it will be interesting to see what the business model for Anthropic will be if I can run a solid code generation model on my local machine (no usage tier per hour or week), let’s say, one year from now. At $200 per month for 2 years I can buy a decent Mx with 64GB (or perhaps even 128GB taking residual value into account)

replies(5): >>44724300 #>>44724450 #>>44724558 #>>44724731 #>>44724993 #

2. falcor84 ◴[29 Jul 25 15:04 UTC] No.44724300[source]▶

>>44723879 (TP) #

How come it's "not representative for real engineering"? Other than copy-pasting existing code (which is not what an LLM does), I don't see how you can create a space invaders game without applying "engineering".

replies(3): >>44724424 #>>44724823 #>>44725919 #

3. phkahler ◴[29 Jul 25 15:15 UTC] No.44724424[source]▶

>>44724300 #

>> Other than copy-pasting existing code (which is not what an LLM does)

I'd like to see someone try to prove this. How many space invaders projects exist on the internet? I'd be hard to compare model "generated" code to everything out there looking for plagiarism, but I bet there are lots of snippets pulled in. These things are NOT smart, they are huge and articulate information repositories.

replies(2): >>44724436 #>>44724794 #

4. simonw ◴[29 Jul 25 15:16 UTC] No.44724436{3}[source]▶

>>44724424 #

Go for it. https://www.google.com/search?client=firefox-b-1-d&q=github+... has a bunch of results. Here's the source code GLM-4.5 Air spat out for me on my laptop: https://github.com/simonw/tools/blob/main/space-invaders-GLM...

Based on my mental model of how these things work I'll be genuinely surprised if you can find even a few lines of code duplicated from one of those projects into the code that GLM-4.5 wrote for me.

replies(2): >>44724682 #>>44724737 #

5. rafaelmn ◴[29 Jul 25 15:16 UTC] No.44724450[source]▶

>>44723879 (TP) #

What about power used and support hardware ? Also card going down means you are down until you get warranty service.

replies(1): >>44725070 #

6. tptacek ◴[29 Jul 25 15:24 UTC] No.44724558[source]▶

>>44723879 (TP) #

OK, go write Space Invaders by hand.

replies(1): >>44725778 #

7. phkahler ◴[29 Jul 25 15:34 UTC] No.44724682{4}[source]▶

>>44724436 #

So I scanned the beginning of the generated code, picked line 83:

  animation: glow 2s ease-in-out infinite;

stuffed it verbatim into google and found a stack overflow discussion that contained this:

      animation: glow .5s infinite alternate;

in under one minute. Then I found this page of CSS effects:

https://alvarotrigo.com/blog/animated-backgrounds-css/

Another page has examples and contains:

  animation: float 15s infinite ease-in-out;

There is just too much internet to scan for an exact match or a match of larger size.

replies(4): >>44724749 #>>44724806 #>>44724817 #>>44725857 #

8. dmortin ◴[29 Jul 25 15:37 UTC] No.44724731[source]▶

>>44723879 (TP) #

" it will be interesting to see what the business model for Anthropic will be if I can run a solid code generation model on my local machine "

Most people won't bother with buying powerful hardware for this, they will keep using SAAS solutions, so Anthropic can be in trouble if cheaper SAAS solutions come out.

9. ◴[29 Jul 25 15:38 UTC] No.44724737{4}[source]▶

>>44724436 #

10. simonw ◴[29 Jul 25 15:39 UTC] No.44724749{5}[source]▶

>>44724682 #

That's not an example of copying from an existing Space Invaders implementation. That's an LLM using a CSS animation pattern - one that it's seen thousands (probably millions) of times in the training data.

That's what I expect these things to do: they break down Space Invaders into the components they need to build, then mix and match thousands of different coding patterns (like "animation: glow 2s ease-in-out infinite;") to implement different aspects of that game.

You can see that in the "reasoning" trace here: https://gist.github.com/simonw/9f515c8e32fb791549aeb88304550... - "I'll use a modern design with smooth animations, particle effects, and a retro-futuristic aesthetic."

replies(1): >>44725637 #

11. ben_w ◴[29 Jul 25 15:44 UTC] No.44724794{3}[source]▶

>>44724424 #

Sorites paradox. Where's the distinction between "snippet" and "a design pattern"?

Compressing a few petabytes into a few gigabytes requires that they can't be like this about all of the things they're accused of simply copy-pasting, from code to newspaper articles to novels. There's not enough space.

12. falcor84 ◴[29 Jul 25 15:45 UTC] No.44724806{5}[source]▶

>>44724682 #

The parent said

> find even a few lines of code duplicated from one of those projects

I'm pretty sure they meant multiple lines copied verbatim from a single project implementing space invaders, rather than individual lines copied (or likely just accidentally identical) across different unrelated projects.

13. ben_w ◴[29 Jul 25 15:46 UTC] No.44724817{5}[source]▶

>>44724682 #

So, your example of it copying snippets is… using the same API with fairly different parameters in a different order?

14. hbn ◴[29 Jul 25 15:46 UTC] No.44724823[source]▶

>>44724300 #

The prompt was

> Write an HTML and JavaScript page implementing space invaders

It may not be "copy pasting" but it's generating output as best it can be recreated from its training on looking at Space Invaders source code.

The engineers at Taito that originally developed Space Invaders were not told "make Space Invaders" and then did their best to recall all the source code they've looked at in their life to re-type the source code to an existing game. From a logistics standpoint, where the source code already exists and is accessible, you may as well have copy-pasted it and fudged a few things around.

replies(1): >>44724876 #

15. simonw ◴[29 Jul 25 15:51 UTC] No.44724876{3}[source]▶

>>44724823 #

The source code for original Space Invaders from 1978 has never been published. The closest to that is disassembled ROMs.

I used that prompt because it's the shortest possible prompt that tells the model to build a game with a specific set of features. If I wanted to build a custom game I would have had to write a prompt that was many paragraphs longer than that.

The aim of this piece isn't "OMG looks LLMs can build space invaders" - at this point that shouldn't be a surprise to anyone. What's interesting is that my laptop can run a model that is capable of that now.

replies(4): >>44725462 #>>44725985 #>>44728033 #>>44733512 #

16. qingcharles ◴[29 Jul 25 16:00 UTC] No.44724993[source]▶

>>44723879 (TP) #

The frontier models are always going to tempt you with their higher quality and quicker generation, IMO.

replies(2): >>44725056 #>>44726682 #

17. kasey_junk ◴[29 Jul 25 16:05 UTC] No.44725056[source]▶

>>44724993 #

I’ve been mentally mapping tge models to the history of db.

Most db in the early days you had to pay for. There are still for pay db that are just better than ones you don’t pay for. Some teams think that the cost is worth the improvements and there is a (tough) business there. Fortunes were made in the early days.

But eventually open source models became good enough for many use cases and they have their own advantages. So lots of teams use them.

I think coding models might have a similar trajectory.

replies(1): >>44725139 #

18. skeezyboy ◴[29 Jul 25 16:06 UTC] No.44725070[source]▶

>>44724450 #

why are you doing anything locally then?

replies(1): >>44731841 #

19. qingcharles ◴[29 Jul 25 16:12 UTC] No.44725139{3}[source]▶

>>44725056 #

You make a good point -- a majority of applications are now using open source or free versions[1] of DBs.

My only feedback is: are these the same animal? Can we compare an O/S DB vs. paid/closed DB to me running an LLM locally? The biggest issue right now with LLMs is simply the cost of the hardware to run one locally, not the quality of the actual software (the model).

[1] e.g. SQL Server Express is good enough for a lot of tasks, and I guess would be roughly equivalent to the upcoming open versions of GPT vs. the frontier version.

replies(1): >>44725478 #

20. sarchertech ◴[29 Jul 25 16:35 UTC] No.44725462{4}[source]▶

>>44724876 #

> The source code for original Space Invaders from 1978 has never been published. The closest to that is disassembled ROMs.

Sure but that doesn’t impact the OPs point at all because there are numerous copies of reverse engineered source code available.

There are numerous copies of the reverse engineered source code already translated to JavaScript in your models training set.

21. qcnguy ◴[29 Jul 25 16:37 UTC] No.44725478{4}[source]▶

>>44725139 #

A majority of apps nowadays are using proprietary forks of open source DBs running in the cloud, where their feature set is (slightly) rounded out and smoothed off by the cloud vendors.

Not that many projects are doing fully self-hosted RDBMS at this point. So ultimately proprietary databases still win out, they just (ab)use the Postgresql trademark to make people think they're using open source.

LLMs might go the same way. The big clouds offering proprietary fine tunes of models given away by AI labs using investor money?

replies(1): >>44726028 #

22. threeducks ◴[29 Jul 25 16:49 UTC] No.44725637{6}[source]▶

>>44724749 #

I think LLMs are adapting higher level concepts. For example, the following JavaScript code generated by GLM (https://github.com/simonw/tools/blob/9e04fd9895fae1aa9ac78b8...) is clearly inspired by this C++ code (https://github.com/portapack-mayhem/mayhem-firmware/blob/28e...), but it is not an exact copy.

replies(1): >>44725675 #

23. simonw ◴[29 Jul 25 16:53 UTC] No.44725675{7}[source]▶

>>44725637 #

This is a really good spot.

That code certainly looks similar, but I have trouble imagining how else you would implement very basic collision detection between a projectile and a player object in a game of this nature.

replies(1): >>44726128 #

24. LandR ◴[29 Jul 25 17:02 UTC] No.44725778[source]▶

>>44724558 #

I'd hope most professional software engineers could do this in an afternoon or so?

replies(2): >>44725890 #>>44731877 #

25. sejje ◴[29 Jul 25 17:09 UTC] No.44725857{5}[source]▶

>>44724682 #

Is this some kind of joke?

That's how you write css. The examples aren't the same at all, they just use the same css feature.

It feels like you aren't a coder--you've sabotaged your own point.

26. sejje ◴[29 Jul 25 17:12 UTC] No.44725890{3}[source]▶

>>44725778 #

Most professional software engineers have never written a game and don't do web work, so I somehow doubt that.

replies(1): >>44726363 #

27. sharkjacobs ◴[29 Jul 25 17:14 UTC] No.44725919[source]▶

>>44724300 #

Making a space invaders game is not representative of normal engineering because you're reproducing an existing game with well known specs and requirements. There are probably hundreds of thousands of words describing and discussing Space Invaders in GLM-4.5's training data

It's like using an LLM to implement a red black tree. Red black trees are in the training data, so you don't need to explain or describe what you mean beyond naming it.

"Real engineering" with LLMs usually requires a bunch of up front work creating specs and outlines and unit tests. "Context engineering"

replies(1): >>44726416 #

28. nottorp ◴[29 Jul 25 17:19 UTC] No.44725985{4}[source]▶

>>44724876 #

> What's interesting is that my laptop can run a model that is capable of that now.

I'm afraid no one cared much about your point :)

You'll only get "OMG look how good LLMs are they'll get us all fired!" comments and "LLMs suck" comments.

This is how it goes with religion...

29. qingcharles ◴[29 Jul 25 17:23 UTC] No.44726028{5}[source]▶

>>44725478 #

That's definitely true. I could see more of the running open source models on other people's hardware model.

I dislike running local LLMs right now because I find the software kinda janky still, you often have to tweak settings, find the right model files. Basically have a bunch of domain knowledge I don't have space for in my head. On top of maintaining a high-spec piece of hardware and paying for the power costs.

30. threeducks ◴[29 Jul 25 17:32 UTC] No.44726128{8}[source]▶

>>44725675 #

A human would likely have refactored the two collision checks between bullet/enemy and enemyBullet/player in the JavaScript code into its own function, perhaps something like "areRectanglesOverlapping". The C++ code only does one collision check like that, so it has not been refactored there, but as a human, I certainly would not want to write that twice.

More importantly, it is not just the collision check that is similar. Almost the entire sequence of operations is identical on a higher level:

    1. enemyBullet/player collision check
    2. same comment "// Player hit!" (this is how I found the code)
    3. remove enemy bullet from array
    4. decrement lives
    5. update lives UI
    6. (createParticle only exists in JS code)
    7. if lives are <= 0, gameOver

31. anthk ◴[29 Jul 25 17:53 UTC] No.44726363{4}[source]▶

>>44725890 #

With TCL/TK it's a matter of less than 2 hours.

32. jasonvorhe ◴[29 Jul 25 17:57 UTC] No.44726416{3}[source]▶

>>44725919 #

Smells like moving the goal post. What's real engineering to be in 2028? Implementing Google's infra stack in your homelab?

33. zarzavat ◴[29 Jul 25 18:21 UTC] No.44726682[source]▶

>>44724993 #

Closed doesn't always win over open. People said the same thing about Windows vs Linux, but even Microsoft was forced to admit defeat and support Linux.

All it takes is some large companies commoditizing their complements. For Linux it was Google, etc. For AI it's Meta and China.

The only thing keeping Anthropic in business is geopolitics. If China were allowed full access to GPUs, they would probably die.

replies(1): >>44732293 #

34. hbn ◴[29 Jul 25 20:41 UTC] No.44728033{4}[source]▶

>>44724876 #

The discussion I replied to was just regarding whether or not what the LLM did should be considered "engineering"

It doesn't really matter whether or not the original code was published. In fact that original source code on its own probably wouldn't be that useful, since I imagine it wouldn't have tipped the weights enough to be "recallable" from the model, not to mention it was tasked with implementing it in web technologies.

35. rafaelmn ◴[30 Jul 25 07:58 UTC] No.44731841{3}[source]▶

>>44725070 #

Latency and tooling support ? UX of cloud based LLM vs local is much better for the cloud option - not so much for dev tooling.

I tried using remote workstations - I am not a fan of lugging a beefy client machine to do my work - would much rather use something thats super light and power efficient.

36. Mashimo ◴[30 Jul 25 08:04 UTC] No.44731877{3}[source]▶

>>44725778 #

Depends on the rules. Can I look up other space invaders games on github first? Can I use a game framework?

Just JS / HTML docs I probably could not.

replies(1): >>44732544 #

37. airspresso ◴[30 Jul 25 09:20 UTC] No.44732293{3}[source]▶

>>44726682 #

> The only thing keeping Anthropic in business is geopolitics. If China were allowed full access to GPUs, they would probably die.

Disagree. Anthropic have a unique approach to how they post-train their models and tune it to be the way they want it. No other lab has managed to reproduce the style and personality of Claude yet, which is currently a key reason why coders prefer it. And since post-training data is secret, it'll take other providers a lot of focused effort to get close to that.

38. pharrington ◴[30 Jul 25 10:15 UTC] No.44732544{4}[source]▶

>>44731877 #

No preexisting framework.

39. dfedbeef ◴[30 Jul 25 12:46 UTC] No.44733512{4}[source]▶

>>44724876 #

I have good news about how games were programmed in the 70's. What if I told you the disassembled ROM is the code.

↑