←back to thread

257 points amrrs | 1 comments | | HN request time: 0s | source
Show context
mlboss ◴[] No.41843344[source]
On related note a very good open source TTS model was released 2 days back: https://github.com/SWivid/F5-TTS

Very good voice cloning capability. Runs under 10G vram nvidia gpu.

replies(1): >>41843634 #
stavros ◴[] No.41843634[source]
Thanks! Would "under 10G" also include 8 GB, by any chance? Although I do die inside a little every time I see "install Torch for your CUDA version", because I never managed to get that working in Linux.
replies(3): >>41843916 #>>41844115 #>>41845127 #
linotype ◴[] No.41844115[source]
Try out PopOS. They make it really easy. Though it’s named Tensorman it helps with Torch as well.

https://support.system76.com/articles/tensorman/

replies(1): >>41844746 #
stavros ◴[] No.41844746[source]
Thanks, but I don't think I'm going to reinstall my entire OS to run these. I'll see if I can get Docker working, it's been more reliable with CUDA for me.
replies(1): >>41845003 #
__MatrixMan__ ◴[] No.41845003[source]
I haven't tried it, but I notice that it's also in nixpkgs: https://search.nixos.org/packages?channel=24.05&show=tensorm... That might be a less invasive way to use it, though you'd still have to install nix.
replies(1): >>41846680 #
1. stavros ◴[] No.41846680{3}[source]
That's easier, thank you!