On related note a very good open source TTS model was released 2 days back:
https://github.com/SWivid/F5-TTSVery good voice cloning capability. Runs under 10G vram nvidia gpu.
Thanks! Would "under 10G" also include 8 GB, by any chance? Although I do die inside a little every time I see "install Torch for your CUDA version", because I never managed to get that working in Linux.
Try out PopOS. They make it really easy. Though it’s named Tensorman it helps with Torch as well.
https://support.system76.com/articles/tensorman/
Thanks, but I don't think I'm going to reinstall my entire OS to run these. I'll see if I can get Docker working, it's been more reliable with CUDA for me.
I haven't tried it, but I notice that it's also in nixpkgs:
https://search.nixos.org/packages?channel=24.05&show=tensorm... That might be a less invasive way to use it, though you'd still have to install nix.