←back to thread

261 points david927 | 1 comments | | HN request time: 0.208s | source

What are you working on? Any new ideas that you're thinking about?
1. robviren ◴[] No.43160037[source]
I've been trying to use genetic algorithms to evolve voice style tensors for Kokoro-82M TTS. My current main barrier is that the scoring function is powered by resemblyzer and whatever it is using to compare the audio data has limitations. The generated tensors over fit and make garbage sounding audio that scores high, but doesn't sound like voice. Considering alternate methods of scoring.