←back to thread

448 points lastdong | 1 comments | | HN request time: 0.198s | source
1. cush ◴[] No.45117583[source]
To me this is like early generative AI art, where the images came out very "smooth" and visually buttery, but instead there's no timbre to the voices. Intonation issues aside, these models could use a touch of vocal fry and some body to be more believable