261 points david927 | 1 comments | 23 Feb 25 23:00 UTC | HN request time: 0.214s | source

What are you working on? Any new ideas that you're thinking about?

1. robviren ◴[24 Feb 25 14:37 UTC] No.43160037[source]▶

I've been trying to use genetic algorithms to evolve voice style tensors for Kokoro-82M TTS. My current main barrier is that the scoring function is powered by resemblyzer and whatever it is using to compare the audio data has limitations. The generated tensors over fit and make garbage sounding audio that scores high, but doesn't sound like voice. Considering alternate methods of scoring.

↑

Ask HN: What are you working on? (February 2025)