It'd be curious to see how those AI generated kernels compare to kernels generated by https://github.com/tinygrad/tinygrad
replies(1):
GeoHot didn't want to make it only FlashAttention specific, he worked on FlashAttenrion being automatically generated by the optimizer. It's going well