Xiaomi Group AI Lab releases ZipVoice series of speech synthesis (TTS) models

2025-09-12 10:23:52

According to Xiaomi Technology, the Next-Generation Kaldi team at Xiaomi Group's AI Lab recently released the ZipVoice series of text-to-speech (TTS) models, based on the Flow Matching architecture. These models include ZipVoice (a zero-shot single-speaker text-to-speech synthesis model) and ZipVoice-Dialog (a zero-shot conversational text-to-speech synthesis model). ZipVoice addresses the large number of parameters and slow synthesis speed of existing zero-shot text-to-speech synthesis models, while ZipVoice-Dialog addresses the stability and inference speed bottlenecks of existing conversational text-to-speech synthesis models.

AI Xiaomi

Email Subscription

Newsletters and emails are now available! Delivered on time, every weekday, to keep you up to date with North American business news.

Weekly Highlights

                                    Apple releases N1 chip
                            2025-09-10

                                    Nasdaq and S&P 500 hit new closing highs, while Apple closed down 1.48%.
                            2025-09-10

                                    Oracle (ORCL.N) opened up 2.6% after several brokerages raised their target prices. It will release earnings after the market close on Tuesday.
                            2025-09-08

                                    Kioxia shares surge after institutions raise target prices due to Nand's recovery
                            2025-09-11

                                    The Nasdaq China Golden Dragon Index closed down 0.95%.
                            2025-09-11