Microsoft launches VibeVoice

Microsoft Research has launched VibeVoice, an open-source TTS model enabling expressive, multi-speaker dialogues. Capable of generating 90-minute conversations with up to four speakers, it uses continuous speech tokenisers at 7.5 Hz for high-quality audio. Powered by Qwen2.5-1.5B and a 123M-parameter diffusion head, VibeVoice supports English and Chinese, embeds watermarks for safety, and is available on GitHub (MIT License) for research use.

What's Hot

93% of Organizations Have Experienced AI-Caused Infrastructure Incidents as ‘Vibe Coding’ Spreads to Infrastructure – Spacelift Survey

Foxconn Singapore Increases Stake in India Unit With $37.2 Million Investment

IBM unveils sub-1 nanometer chip technology, touts nanostack design

Microsoft has launched VibeVoice, a Open-Source Text-to-Speech Model

The Great Betrayal: Microsoft, OpenAI and Amazon’s Shifting Alliances

Why Startups Fail: Distribution Matters More Than Product

How India, Dubai, and Singapore Are Quietly Building the World’s Next Startup Power Corridor

Ind-Ego: The Crisis IndiGo Saw Coming and Still Walked Into

AllHome Closes Rs 200 Crore Round at Rs 2,000 Crore Valuation

AI-Generated Code Triggers Infrastructure Incidents at 93% of Organizations: Report

Foxconn Singapore Increases Stake in India Unit With $37.2 Million Investment

Explore On The Go !

What's Hot

Microsoft has launched VibeVoice, a Open-Source Text-to-Speech Model

Keep Reading

Explore On The Go !