←back to thread

105 points todsacerdoti | 3 comments | | HN request time: 0.599s | source
1. rao-v ◴[] No.43674120[source]
This seems like a near perfect use of coding LLMs and a useful way to implement reinforcement learning.

“Add a major bug to this file that is not covered by existing tests” vs “Find the bug in this file” vs “write a sensible test in this file that protects against this type of bug”

replies(1): >>43675069 #
2. infogulch ◴[] No.43675069[source]
Somehow that reminds me of how diffusion models are trained.
replies(1): >>43679175 #
3. acheong08 ◴[] No.43679175[source]
I'm pretty sure that's the premise for GANs (generative adversarial networks) rather than diffusion. Diffusion is more about noise reduction than pitting models against each other