←back to thread

257 points amrrs | 1 comments | | HN request time: 0s | source
Show context
mlboss ◴[] No.41843344[source]
On related note a very good open source TTS model was released 2 days back: https://github.com/SWivid/F5-TTS

Very good voice cloning capability. Runs under 10G vram nvidia gpu.

replies(1): >>41843634 #
stavros ◴[] No.41843634[source]
Thanks! Would "under 10G" also include 8 GB, by any chance? Although I do die inside a little every time I see "install Torch for your CUDA version", because I never managed to get that working in Linux.
replies(3): >>41843916 #>>41844115 #>>41845127 #
1. mlboss ◴[] No.41843916[source]
I bought a 10 Tb drive just for these kind of experiments