<img src="https://ws.zoominfo.com/pixel/ODemgiDEhQshzjvCQ1qL" width="1" height="1" style="display: none;">

Automatic live captioning for meetings that matter.

Remove communication barriers in your meetings and events with highly accurate live captions, automatically generated in real-time. We combine the leading speech-to-text technology with professional glossary curation to create results that go beyond the standard.

Interprefy banner_automatic captioning-min

Artificial Intelligence with a human touch

Automatic Speech Recognition (ASR) is a powerful way to automatically transcribe your meeting speech and provide your audience with live closed captioning. But we go one step further, by adding an additional layer of professional glossary customization to make sure the meeting-specific key terminology that matters is captured accurately.

Interprefy_visual_automatic captions_shadow-min

Unlock the full potential of live captioning

Custom Glossaries

Our custom glossaries – created by a highly skilled team of linguists – prepare the ASR engine to capture key names, abbreviations, and phrases, as well as pronunciations that are relevant to your session and your particular subject matter.

The glossaries increase accuracy for particularly challenging terms and apply customer-specific standards, and profanity filtering requirements to create the best ASR captions possible.

Human Curation

A team of captioning and event accessibility experts supports you in compiling names, terms, phrases, spellings, and pronunciations relevant to your session and feeds them into your very own custom glossary and captioning filter.

30+ Languages Supported

Our speech-to-text engines support over 30 languages, including the world's most spoken and beyond.

List of supported languages

Enjoy end-to-end captioning services

From set-up to monitoring and reporting, we help you create a more engaging, memorable, inclusive and accessible event experience for everyone.


Deliver captions anywhere

We integrate with over 70 of the leading online meeting platforms and provide a range of streaming options to enable language access anywhere - on-site and online. 


Transcribe interpretation

If you have interpreters in the meeting, easily transcribe their speech into live text.


Adjust font size

Users can easily choose the size of the captions for a customized reading experience. 


Download transcripts

Download your automatic transcripts after the meeting.

crissilia and andres-min

Want to learn more?

Book a 15-minute introduction meeting with us today to learn more about how we can help you add automatic captions and translated subtitles to your upcoming meetings.

Frequently Asked Questions

Find more answers in our knowledge base

How does Automatic Speech Recognition (ASR) work?

As a visual aid to follow the speech, live audio is transcribed into text through AI-powered Automated Speech Recognition (ASR), or often also referred to as "speech-to-text" technology.

Interprefy Captions are generated off the audio speech of each speaker (and interpreter, if active) using Automated Speech Recognition (ASR) technology powered by Artificial Intelligence (AI).

This technology combination uses speech-to-text processing technology to provide text directly from the words being spoken. Just like interpretation, captions will follow as live transcription slightly after the speaker has delivered their words.

In the diagram below, you can see how ASR works, when you have an English speaker, and a Spanish interpreter connected:

How does automatic captioning work


How accurate are Interprefy Captions?

We only support languages that have been tested thoroughly in collaboration with a linguistic partner and meet rigorous quality standards. Our system uses human-curated custom glossaries to add a layer of human refinement to raw ASR output, leading to better accuracy for you and your audience. Audio input factors such as heavy accents or low audio quality can impact captioning accuracy.

What are custom glossaries and how do they work?

Custom glossaries are lists of terms and phrases specific to your sessions, that our project team uses to guide an ASR engine, so it produces the words correctly when it ‘hears’ them. After collecting and compiling key names and terms, professional linguists use their in-depth language knowledge to program phonetic pronunciations into the system. This process makes the automatic live captions more accurate.

How does the speed of Interprefy Captions compare to human-generated live captions?

Interprefy Captions have a similar or even shorter time delay than human-generated live captions. The delay between the audio and the Interprefy Captions is usually around two to four seconds, depending on the preferred setting, whereas human-generated captions are usually delayed by around four to seven seconds.

Interprefy Captions can be enabled in two different modes. By default, the text will appear within 4 seconds of the speaker having completed a sentence. If 'instant mode' is activated, the text will appear in real-time with rapid auto-correction.

How is Interprefy's ASR solution different from others?

We are a language technology company with a strong belief that humans and machines working together lead to outstanding results. We use best-of-breed ASR technology and continuously benchmark existing and new ASR systems to offer customers solutions that are best suited for their needs. Our system utilizes linguistic expertise to improve these engines and guide them to capture customer-specific terms and phrases that are typically difficult to capture.

What's more, we provide end-to-end services to support you at every step of the way. To make sure the process and delivery are as smooth as possible.

Can Interprefy Captions be combined with simultaneous interpretation?

Absolutely. We can deliver simultaneous interpretation alongside live captions, and also turn audio interpretation into text for your audience to follow.

Which languages does Interprefy's ASR solution support?

We can automatically recognize and transcribe speech in over 30 languages. Click here for the latest list of supported languages.