←back to thread

High-fidelity simultaneous speech-to-speech translation

(arxiv.org)

112 points Bluestein | 5 comments | 03 Jul 25 20:27 UTC | HN request time: 0.912s | source

Show context

cs702 ◴[03 Jul 25 22:05 UTC] No.44459551[source]▶

>>44458877 (OP) #

Nice. I'm impressed.

Translator jobs are going to go poof! overnight.

Just sayin'.

replies(2): >>44459991 #>>44460001 #

1. desultir ◴[03 Jul 25 23:35 UTC] No.44460001[source]▶

Translators sure, interpreters no.

Interpreters also have to factor in cultural context and customs, ensuring that meaning is conveyed without offence being given in formal contexts.

replies(2): >>44460339 #>>44460346 #

2. esafak ◴[04 Jul 25 01:01 UTC] No.44460339[source]▶

>>44460001 (TP) #

I don't see why software couldn't do that, if you give them the context.

replies(1): >>44461948 #

3. cortesoft ◴[04 Jul 25 01:04 UTC] No.44460346[source]▶

>>44460001 (TP) #

That seems like something LLMs could eventually get good at

replies(1): >>44463515 #

4. yorwba ◴[04 Jul 25 07:15 UTC] No.44461948[source]▶

The end-user is unlikely to know which part of the context is relevant, and it may also change from moment to moment depending on who is speaking to whom. Of course you could imagine an AI interpreter that has cameras for situational awareness and asks for clarification if anything important is unclear while smoothing over minor stuff without interrupting, but you could equally easily imagine an AGI, so it's not clear that this could be built to a reasonable quality standard with current technology.

5. nottorp ◴[04 Jul 25 11:21 UTC] No.44463515[source]▶

They'll just push everyone to use corporate wooden language and then they won't have to worry about tone and implied meanings :)