Overview
Gladia is a speech-to-text platform built for production, turning raw audio into structured outputs that power real workflows like meeting summaries, CRM enrichment, contact center QA, and real-time voice assistants.
With support for 99+ languages and the ability to handle messy real-world audio—overlapping speakers, accents, code-switching, domain-specific terminology—Gladia is designed for the complexity of actual conversations, not clean studio recordings.
Key Features
- Real-time And Async Transcription
- Named Entity Recognition
- Multi-lingual Support (99+ Languages)
- Speaker Diarization
- Code-switching
- Custom Vocabulary
- Multi-region Support
Releases
Top alternatives
-
Mery🙏 82 karmaMay 16, 2025@AssemblyAIOne of the most accurate API's I've used for speech to text and summarization. Cost effective w/ bulk contracts too. -
Unlimited transcripts, summaries, 99.8% accuracy, speaker recognition, superfastOpenI already have another transcription tool, but this one is much better. I love the different features such as the summary, quiz, and chapters. It does a great job of them. I've only done one transcript so far to try it out, but I'm truly impressed and am going to grab another code. A couple things that would make it even better are: - the ability to rename the files and organize them through folders. - the ability to download a copy of the other features as well as the transcript. Copying and pasting it works, but doesn't keep the format. -
🎯 3 free transcripts every day. 🔥 Unlimited transcription starting at $10/mo.OpenNo other tool quite like this, it's pretty straightforward. Needed to extract a long interview from YouTube and it extracted everything, providing it in different meaningful formats in less than two minutes. Awesome
-
OpenThis is my favourite, so handy and works brilliant -
OpenHi there! It worked fine for me, even with longer videos. It might have been a temporary bug, try again
MongoDB - Build AI That Scales


