Microsoft Research has launched VibeVoice, an open-source TTS model enabling expressive, multi-speaker dialogues. Capable of generating 90-minute conversations with up to four speakers, it uses continuous speech tokenisers at 7.5 Hz for high-quality audio. Powered by Qwen2.5-1.5B and a 123M-parameter diffusion head, VibeVoice supports English and Chinese, embeds watermarks for safety, and is available on GitHub (MIT License) for research use.
Trending
- AllHome Closes Rs 200 Crore Round at Rs 2,000 Crore Valuation
- AI-Generated Code Triggers Infrastructure Incidents at 93% of Organizations: Report
- 93% of Organizations Have Experienced AI-Caused Infrastructure Incidents as ‘Vibe Coding’ Spreads to Infrastructure – Spacelift Survey
- Foxconn Singapore Increases Stake in India Unit With $37.2 Million Investment
- Foxconn Singapore Increases Stake in India Unit With $37.2 Million Investment
- IBM unveils sub-1 nanometer chip technology
- IBM unveils sub-1 nanometer chip technology, touts nanostack design
- JiviAI Closes Down; Ankur Jain’s BharatPe Return Under Discussion

