I think the benchmark test for these programming agents that I would like to see an Agent making a flawless PR or patch to the BSD / Linux kernel.
This should be possible today and surely Linus would also see this in the future.
replies(1):
This should be possible today and surely Linus would also see this in the future.