←back to thread

652 points toebee | 2 comments | | HN request time: 0.411s | source
1. xhkkffbf ◴[] No.43756773[source]
Are there different voices? Or only [s1] and [s2] in the examples?
replies(1): >>43758096 #
2. toebee ◴[] No.43758096[source]
We just clarified in the README, sorry for the confusion ;(

Note that the model was not fine-tuned on a specific voice. Hence, you will get different voices every time you run the model. You can keep speaker consistency by either adding an audio prompt (a guide coming VERY soon - try it with the second example on Gradio or HF Space for now), or fixing the seed.