Question 1

Does the transcriber support multiple speakers?

Accepted Answer

Yes. When a HuggingFace token is configured (required for diarization), the AI identifies and labels each speaker in the conversation. Each segment in the transcript will be prefixed with SPEAKER_00, SPEAKER_01, etc.

Question 2

What audio formats are supported?

Accepted Answer

Meeting Transcriber accepts MP3, WAV, M4A, and OGG files up to 20MB. For best accuracy, use recordings with minimal background noise and clear audio levels.

Question 3

What languages are supported?

Accepted Answer

WhisperX supports over 90 languages and automatically detects the language from the audio. You can also manually specify a language code (e.g. en, es, fr) for faster processing.

Question 4

What is the .srt file format used for?

Accepted Answer

SRT (SubRip Subtitle) is the standard format for video subtitles. It contains numbered entries with start/end timestamps and text. You can import .srt files into video editors like DaVinci Resolve, Premiere Pro, or CapCut to add captions to your videos.

Question 5

How accurate is the transcription?

Accepted Answer

WhisperX achieves near-human accuracy on clean audio recordings. Accuracy decreases with heavy accents, overlapping speech, or low-quality recordings with background noise. The output is editable so you can correct any errors.

Meeting Transcriber

Meeting Transcriber

How it works

Use cases

Frequently asked questions

Related AI Tools