How to build a coding agent

(ghuntley.com)

Show context

ofirpress ◴[24 Aug 25 03:55 UTC] No.45001234[source]▶

We (the Princeton SWE-bench team) built an agent in ~100 lines of code that does pretty well on SWE-bench, you might enjoy it too: https://github.com/SWE-agent/mini-swe-agent

replies(7): >>45001287 #>>45001548 #>>45001716 #>>45001737 #>>45002061 #>>45002110 #>>45009789 #

1. meander_water ◴[24 Aug 25 05:51 UTC] No.45001737[source]▶

>>45001234 #

> 1. Analyze the codebase by finding and reading relevant files 2. Create a script to reproduce the issue 3. Edit the source code to resolve the issue 4. Verify your fix works by running your script again 5. Test edge cases to ensure your fix is robust

This prompt snippet from your instance template is quite useful. I use something like this for getting out of debug loops:

> Analyse the codebase and brainstorm a list of potential root causes for the issue, and rank them from most likely to least likely.

Then create scripts or add debug logging to confirm whether your hypothesis is correct. Rule out root causes from most likely to least by executing your scripts and observing the output in order of likelihood.

replies(1): >>45006960 #

2. afro88 ◴[24 Aug 25 19:30 UTC] No.45006960[source]▶

>>45001737 (TP) #

Does this mean it's only useful for issue fixes?

replies(1): >>45008077 #

3. regularfry ◴[24 Aug 25 21:43 UTC] No.45008077[source]▶

>>45006960 #

A feature is just an issue. The issue is that the feature isn't complete yet.

replies(1): >>45012539 #

4. afro88 ◴[25 Aug 25 11:04 UTC] No.45012539{3}[source]▶

>>45008077 #

> 2. Create a script to reproduce the issue