←back to thread

566 points PaulHoule | 1 comments | | HN request time: 0.481s | source
Show context
mynti ◴[] No.44489785[source]
is there a kind of nanogpt for diffusion language models? i would love to understand them better
replies(1): >>44490381 #
1. nvtop ◴[] No.44490381[source]
This video has a live coding part which implements a masked diffusion generation process: https://www.youtube.com/watch?v=oot4O9wMohw