←back to thread

265 points ctoth | 2 comments | | HN request time: 0s | source
Show context
boznz ◴[] No.43746571[source]
I'm surprised nobody mentioned the video interview. I only watched the first 60 seconds and this is the first time I have seen or heard the author, but if I hadn't been told this was AI generated I would have assumed it was genuine and any 'twitching' was the result of the video compression.
replies(3): >>43746580 #>>43746982 #>>43747086 #
smusamashah ◴[] No.43746982[source]
How???? I can believe the guy in the video being AI because his lips are not perfectly synced. But the woman? Even with continuous silly exaggerated movement I have hard time believing its generated.

A strand of her hair fell on her shoulder, because she was moving continuously (like crazy) it was moving too in a perfectly believable way, and IT EVENTUALLY FELL OFF THE SHOULDER/SHIRT LIKE REAL HAIR and got mixed into other fallen hair. How is that generated? It's too small detail. Are there any artifacts on her side?

Edit: she has to be real. Her lip movements are definitely forced/edited though. It has to be a video recording of her talking. And then a tool/AI has modified her lips to match the voice. If you look at her face and hand movements, her shut lips seem forced.

replies(1): >>43747354 #
1. furyofantares ◴[] No.43747354[source]
> Edit: she has to be real

Nah, having used HeyGen a bit, it's extremely clearly a HeyGen generation. There's a small number of movements and expressions that it continually uses (in forward and reverse).

Edit: I mean, to be clear, it is a real person, just like the author's video is. The way HeyGen works is you record a short clip of you saying some stuff and then you can generate long videos like these of you saying whatever you want. So the stuff you noticed does come from a real video of her, but it's not a real video that's lightly edited by AI, more like the AI has a bunch of clips it can continually mesh together while fixing up the mouth to continually generate video.

replies(1): >>43754937 #
2. smusamashah ◴[] No.43754937[source]
If its a bunch of clips meshed together, the AI isn't doing a good job of meshing in the silence pose. I looked at HeyGen site, these are probably called interactive avatars. A woman's throat is moving (as well as her hands) as if she is talking, but her lips as shut. Whatever they are doing, they are not handling silence/listening very well.