This seems like a near perfect use of coding LLMs and a useful way to implement reinforcement learning.
“Add a major bug to this file that is not covered by existing tests” vs “Find the bug in this file” vs “write a sensible test in this file that protects against this type of bug”
replies(1):