←back to thread

314 points pretext | 1 comments | | HN request time: 0.307s | source
Show context
binsquare ◴[] No.46220168[source]
Does anyone else find that there's hard to pin down reason of life-lessness in the speech of these voice models?

Especially in the fruit pricing portion of the video for this model. Sounds completely normal but I can immediately tell it is ai. Maybe it's intonation or the overly stable rate of speech?

replies(5): >>46220275 #>>46220301 #>>46220340 #>>46220359 #>>46222792 #
1. vessenes ◴[] No.46222792[source]
I'm not convinced its end-to-end multimodal - in that case, you'll have a speech synthesis section and this will be some of the result. You could test by having it sing or do some accents, or have it talk back to you in an accent you give it.