Social Media

Light
Dark

Leave a Reply

Your email address will not be published. Required fields are marked *

Light
Dark

Microsoft Research has launched VibeVoice, an open-source TTS model enabling expressive, multi-speaker dialogues. Capable of generating 90-minute conversations with up to four speakers, it uses continuous speech tokenisers at 7.5 Hz for high-quality audio. Powered by Qwen2.5-1.5B and a 123M-parameter diffusion head, VibeVoice supports English and Chinese, embeds watermarks for safety, and is available on GitHub (MIT License) for research use.