Andrej Karpathy: Software in the era of AI [video]

1. blobbers ◴[19 Jun 25 07:27 UTC] No.44316344[source]▶

Software 3.0 is the code generated by the machine, not the prompts that generated it. The prompts don't even yield the same output; there is randomness.

The new software world is the massive amount of code that will be burped out by these agents, and it should quickly dwarf the human output.

replies(4): >>44316389 #>>44318479 #>>44318855 #>>44322374 #

2. pelagicAustral ◴[19 Jun 25 07:39 UTC] No.44316389[source]▶

>>44316344 (TP) #

I think that if you give the same task to three different developers you'll get three different implementations. It's not a random result if you do get the functionality that was expected, and at that, I do think the prompt plays an important role in offering a view of how the result was achieved.

replies(2): >>44317798 #>>44329445 #

3. klabb3 ◴[19 Jun 25 11:56 UTC] No.44317798[source]▶

>>44316389 #

> I think that if you give the same task to three different developers you'll get three different implementations.

Yes, but if you want them to be compatible you need to define a protocol and conformance test suite. This is way more work than writing a single implementation.

The code is the real spec. Every piece of unintentional non-determinism can be a hazard. That’s why you want the code to be the unit of maintenance, not a prompt.

replies(1): >>44318372 #

4. imiric ◴[19 Jun 25 13:08 UTC] No.44318372{3}[source]▶

>>44317798 #

I know! Let's encode the spec into a format that doesn't have the ambiguities of natural language.

replies(1): >>44319386 #

5. tamersalama ◴[19 Jun 25 13:23 UTC] No.44318479[source]▶

>>44316344 (TP) #

How I understood it is that natural language will form relatively large portions of stacks (endpoint descriptions, instructions, prompts, documentations, etc…). In addition to code generated by agents (which would fall under 1.0)

6. poorcedural ◴[19 Jun 25 14:11 UTC] No.44318855[source]▶

>>44316344 (TP) #

It is not the code, which just like prompts is a written language. Software 3.0 will be branches of behaviors, by the software and by the users all documented in a feedback loop. The best behaviors will be merged by users and the best will become the new HEAD. Underneath it all will be machine code for the hardware, but it will be the results that dictate progress.

7. klabb3 ◴[19 Jun 25 15:10 UTC] No.44319386{4}[source]▶

>>44318372 #

Right. Great idea. Maybe call it ”formal execution spec for LLM reference” or something. It could even be versioned in some kind of distributed merkle tree.

8. fritzo ◴[19 Jun 25 20:44 UTC] No.44322374[source]▶

>>44316344 (TP) #

Code is read much more often than it is written. Code generated by the machine today will be prompt read by the machine going forward. It's a closed loop.

Software is a world in motion. Software 1.0 was animated by developers pushing it around. Software 3.0 is additionally animated by AI agents.

9. blobbers ◴[20 Jun 25 16:34 UTC] No.44329445[source]▶

>>44316389 #

Interestingly, I was generating some scraping code today. I prompted it fairly generically and it decided to spit out some Selenium code. I reported the stack trace failure, and it gave me a new script with playwright. That failed also and it gave me some suggestions to fix it. I asked it to update the whole script rather than snippets, and it responded with "Hey let's not use either of these and here we'll use the site's API." and proceeded to do that.

Kind of crazy, it basically found 3 different hammers to hit the nail I wanted. The API unfortunately seems to be timeing out (I had to add the timeout=10 to the post u_u)