As a visual aid to follow the speech, live audio is transcribed into text through AI-powered Automated Speech Recognition (ASR), or often also referred to as "speech-to-text" technology.
Interprefy Captions are generated off the audio speech of each speaker (and interpreter, if active) using Automated Speech Recognition (ASR) technology powered by Artificial Intelligence (AI).
This technology combination uses speech-to-text processing technology to provide text directly from the words being spoken. Just like interpretation, captions will follow as live transcription slightly after the speaker has delivered their words.
In the diagram below, you can see how ASR works, when you have an English speaker, and a Spanish interpreter connected:
