Welcome to TTS Wiki

TTS Wiki is a collaborative knowledge base dedicated to documenting and comparing latest Text-to-Speech (TTS) models and technologies. Our mission is to provide comprehensive, up-to-date information about the rapidly evolving landscape of speech synthesis.

Getting Started

For Developers

Installation Guides - Setup instructions for various TTS models
Finetuning Guides - Walkthroughs for fine-tuning TTS models
Licensing Overview - Commercial usage rights and restrictions

Model Categories

Open Source Models

Model	License	Languages	Voice Cloning	Conversational	Fine-Tuning	Date Released
F5-TTS	MIT, CC-BY-NC	English, Chinese	✅	❌	✅	2024
MaskGCT	MIT, CC-BY-NC	English, Chinese, Korean, Japanese, French, German	✅	❌	❌	2024
StyleTTS 2	MIT	English	✅	❌	✅	2024
VibeVoice	MIT	English, Chinese	❌	✅	✅	2025

Commercial Services

Coming soon

Contributing

Contributions are welcome! Here's how you can help:

Add new models - Document recently released TTS systems
Update comparisons - Share performance benchmarks and quality tests
Write tutorials and guides - Help others learn to use different TTS tools
Upload samples - Provide audio examples for model comparisons (please do not upload copyrighted content!)
Fix information - Correct outdated or inaccurate details

Disclaimer: This wiki is maintained by the community and information may not always be current. Always verify details with official sources before making production decisions. Voice cloning should only be used with proper consent and for ethical purposes.

Main Page

Contents

Welcome to TTS Wiki

Getting Started

For Developers

Model Categories

Open Source Models

Commercial Services

Contributing

Navigation menu

Main Page

Welcome to TTS Wiki

Getting Started

For Developers

Model Categories

Open Source Models

Commercial Services

Contributing

Navigation menu

Search