←back to thread

2127 points bakugo | 1 comments | | HN request time: 0.364s | source
Show context
anotherpaulg ◴[] No.43164684[source]
Claude 3.7 Sonnet scored 60.4% on the aider polyglot leaderboard [0], WITHOUT USING THINKING.

Tied for 3rd place with o3-mini-high. Sonnet 3.7 has the highest non-thinking score, taking that title from Sonnet 3.5.

Aider 0.75.0 is out with support for 3.7 Sonnet [1].

Thinking support and thinking benchmark results coming soon.

[0] https://aider.chat/docs/leaderboards/

[1] https://aider.chat/HISTORY.html#aider-v0750

replies(18): >>43164827 #>>43165382 #>>43165504 #>>43165555 #>>43165786 #>>43166186 #>>43166253 #>>43166387 #>>43166478 #>>43166688 #>>43166754 #>>43166976 #>>43167970 #>>43170020 #>>43172076 #>>43173004 #>>43173088 #>>43176914 #
bearjaws ◴[] No.43164827[source]
Thanks for all the work on aider, my favorite AI tool.
replies(1): >>43166454 #
bt1a ◴[] No.43166454[source]
It really is best in slot. Owe it to git, which has a particular synergy with a hallucination-prone but correctable system
replies(1): >>43167992 #
doctoboggan ◴[] No.43167992[source]
I like Aider but I've turned off auto-commit. I just can't seem to let the AI actually commit code for me. Do you regularly let Aider commit for you? How much do you review the code written by it?
replies(4): >>43168124 #>>43168129 #>>43168132 #>>43168166 #
1. itgoon ◴[] No.43168132[source]
I create a feature branch, do the work and let it commit. I check the code as I go. If I don't like it, then I revert to a previous commit. Other times I write some code that it isn't getting right for whatever reason.

When it's ready, I squash merge into main.