(simonwillison.net)

76 points FromTheArchives | 2 comments | 22 Oct 25 12:36 UTC | HN request time: 0s | source

Show context

matthewdgreen ◴[23 Oct 25 01:10 UTC] No.45677089[source]▶

So let me get this straight. You’re writing tens of thousands of lines of code that will presumably go into a public GitHub repository and/or be served from some location. Even if it only runs locally on your own machine, at some point you’ll presumably give that code network access. And that code is being developed (without much review) by an agent that, in our threat model, has been fully subverted by prompt injection?

Sandboxing the agent hardly seems like a sufficient defense here.

replies(2): >>45677537 #>>45684527 #

1. simonw ◴[23 Oct 25 02:31 UTC] No.45677537[source]▶

>>45677089 #

What is your worst case scenario from this?

replies(1): >>45682120 #

2. noitpmeder ◴[23 Oct 25 14:20 UTC] No.45682120[source]▶

>>45677537 (TP) #

Bank accounts drained, ransomware installed, ...

↑

Living Dangerously with Claude