Main Page
Welcome to TTS Wiki
TTS Wiki is a collaborative knowledge base dedicated to documenting and comparing latest Text-to-Speech (TTS) models and technologies. Our mission is to provide comprehensive, up-to-date information about the rapidly evolving landscape of speech synthesis.
Getting Started
For Developers
- Installation Guides - Setup instructions for various TTS models
- Finetuning Guides - Walkthroughs for fine-tuning TTS models
- Licensing Overview - Commercial usage rights and restrictions
Model Categories
Open Source Models
Model | License | Languages | Voice Cloning | Conversational | Fine-Tuning | Date Released |
---|---|---|---|---|---|---|
F5-TTS | MIT, CC-BY-NC | English, Chinese | ✅ | ❌ | ✅ | 2024 |
MaskGCT | MIT, CC-BY-NC | English, Chinese, Korean, Japanese, French, German | ✅ | ❌ | ❌ | 2024 |
StyleTTS 2 | MIT | English | ✅ | ❌ | ✅ | 2024 |
VibeVoice | MIT | English, Chinese | ❌ | ✅ | ✅ | 2025 |
Commercial Services
Coming soon
Contributing
Contributions are welcome! Here's how you can help:
- Add new models - Document recently released TTS systems
- Update comparisons - Share performance benchmarks and quality tests
- Write tutorials and guides - Help others learn to use different TTS tools
- Upload samples - Provide audio examples for model comparisons (please do not upload copyrighted content!)
- Fix information - Correct outdated or inaccurate details
Disclaimer: This wiki is maintained by the community and information may not always be current. Always verify details with official sources before making production decisions. Voice cloning should only be used with proper consent and for ethical purposes.