Real-time translation

Translating speech as it is spoken, with only a sub-second delay, so a conversation flows naturally.

Real-time translation converts speech from one language to another while a person is still talking, streaming the result with only a fraction of a second of delay. Unlike batch translation, where you submit text and wait, real-time systems process audio continuously so both sides can hold a normal back-and-forth conversation.

It chains three steps, speech recognition, machine translation, and speech synthesis, and optimises each for latency rather than waiting for a full sentence. SimulSpeak targets sub-second added latency so translated calls feel like ordinary calls.

How real-time voice translation works (and why latency matters)
How to run multilingual business calls without an interpreter

Talk to anyone, in any language

Real-time translated calls in your own voice, across 24 languages.

View pricing

Real-time translation

Related terms

Read more