(qwen.ai)

314 points pretext | 1 comments | 10 Dec 25 16:13 UTC | HN request time: 1.761s | source

Show context

sosodev ◴[10 Dec 25 16:55 UTC] No.46220123[source]▶

>>46219538 (OP) #

Does Qwen3-Omni support real-time conversation like GPT-4o? Looking at their documentation it doesn't seem like it does.

Are there any open weight models that do? Not talking about speech to text -> LLM -> text to speech btw I mean a real voice <-> language model.

edit:

It does support real-time conversation! Has anybody here gotten that to work on local hardware? I'm particularly curious if anybody has run it with a non-nvidia setup.

replies(4): >>46220228 #>>46222544 #>>46223129 #>>46224919 #

dsrtslnd23 ◴[10 Dec 25 17:01 UTC] No.46220228[source]▶

>>46220123 #

it seems to be able to do native speech-speech

replies(1): >>46220381 #

1. sosodev ◴[10 Dec 25 17:12 UTC] No.46220381[source]▶

>>46220228 #

It does for sure. I did some more digging and it does real-time too. That's fascinating.

↑

Qwen3-Omni-Flash-2025-12-01：a next-generation native multimodal large model