TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

VibeVoice TTS 1.5B

VibeVoice is a family of open-source frontier voice AI models from Microsoft that covers long-form ASR, multi-speaker TTS and lightweight real-time TTS. It uses continuous acoustic and semantic speech tokenizers at 7.5 Hz plus a next-token diffusion head so it can transcribe up to 60 minutes of audio in one pass and synthesize up to 90 minutes of natural, multi-voice speech with strong quality and efficiency.
New Audio Gen 4
Released: August 25, 2025

Overview

VibeVoice is Microsoft's open-source frontier voice AI family, unifying long-form ASR, multi-speaker TTS and low-latency streaming TTS using continuous speech tokenizers and next-token diffusion for efficient, high-fidelity speech over very long audio.

About Microsoft

Microsoft is a technology company that offers a wide range of software, cloud computing services, hardware, and artificial intelligence solutions.

Industry: Technology, Information and Internet
Company Size: 228000+
Location: Redmond, Washington, US
View Company Profile

Tools using VibeVoice TTS 1.5B

No tools found for this model yet.

Last updated: February 25, 2026
0 AIs selected
Clear selection
#
Name
Task