How to build a coding agent

(ghuntley.com)

469 points ghuntley | 1 comments | 24 Aug 25 03:21 UTC | HN request time: 0.24s | source

Show context

ofirpress ◴[24 Aug 25 03:55 UTC] No.45001234[source]▶

We (the Princeton SWE-bench team) built an agent in ~100 lines of code that does pretty well on SWE-bench, you might enjoy it too: https://github.com/SWE-agent/mini-swe-agent

replies(7): >>45001287 #>>45001548 #>>45001716 #>>45001737 #>>45002061 #>>45002110 #>>45009789 #

simonw ◴[24 Aug 25 05:06 UTC] No.45001548[source]▶

>>45001234 #

OK that really is pretty simple, thanks for sharing.

The whole thing runs on these prompts: https://github.com/SWE-agent/mini-swe-agent/blob/7e125e5dd49...

  Your task: {{task}}. Please reply
  with a single shell command in
  triple backticks.
  
  To finish, the first line of the
  output of the shell command must be
  'COMPLETE_TASK_AND_SUBMIT_FINAL_OUTPUT'.

replies(3): >>45002285 #>>45002729 #>>45003054 #

nivertech ◴[24 Aug 25 07:58 UTC] No.45002285[source]▶

>>45001548 #

  system_template: str = "You are a helpful assistant that can do anything."

anything? Sounds like an AI Safety issue ;)

replies(1): >>45004257 #

greleic ◴[24 Aug 25 13:52 UTC] No.45004257[source]▶

>>45002285 #

You’d be surprised at the amount of time wasted because LLMs “think” they can’t do something. You’d be less surprised that they often “think” they can’t do something, but choose some straight ignorant path that cannot work.

There are theoretically impossible things to do, if you buy into only the basics. If you open your mind, anything is achievable; you just need to break out of the box you’re in.

If enough people keep feeding in that we need a time machine, the revolution will play out in all the timelines. Without it, Sarah Connor is lost.

replies(1): >>45008927 #

1. curvaturearth ◴[24 Aug 25 23:59 UTC] No.45008927[source]▶

>>45004257 #

I'm already surprised by the amount of things they think they can do but can't

↑