On related note a very good open source TTS model was released 2 days back: https://github.com/SWivid/F5-TTS
Very good voice cloning capability. Runs under 10G vram nvidia gpu.
replies(1):
Very good voice cloning capability. Runs under 10G vram nvidia gpu.