New pages
Jump to navigation
Jump to search
- 02:51, 22 September 2025 Mean Opinion Score (hist | edit) [13,296 bytes] Ttswikiadmin (talk | contribs) (Add MOS) Tag: Visual edit
- 02:39, 22 September 2025 VITS (hist | edit) [5,535 bytes] Ttswikiadmin (talk | contribs) (Created page with "'''VITS''' ('''Variational Inference with adversarial learning for end-to-end Text-to-Speech''') and '''VITS2''' are neural text-to-speech synthesis models that generate speech directly from text input using end-to-end training. VITS was first introduced by researchers at Kakao Enterprise in June 2021, while VITS2 was developed by SK Telecom and published in July 2023 as an improvement over the original model. == Overview == Traditional text-to-speech systems typically...") Tag: Visual edit: Switched
- 20:46, 21 September 2025 IndexTTS2 (hist | edit) [8,170 bytes] Ttswikiadmin (talk | contribs) (Add IndexTTS 2) Tag: Visual edit
- 20:48, 20 September 2025 ElevenLabs (hist | edit) [9,072 bytes] Ttswikiadmin (talk | contribs) (Add ElevenLabs page) Tag: Visual edit
- 20:10, 20 September 2025 MIT License (hist | edit) [3,777 bytes] Ttswikiadmin (talk | contribs) (Created page with "'''MIT License''' is a permissive free software license originally developed at the Massachusetts Institute of Technology (MIT). It is one of the most popular open-source licenses used in software development and is commonly used for licensing both code and model weights. == Overview == The MIT License is characterized by its simplicity and permissive nature. It allows users to do almost anything with the licensed software, including using, copying, modifying, merg...") Tag: Visual edit: Switched
- 16:27, 20 September 2025 Chatterbox (hist | edit) [5,234 bytes] Ttswikiadmin (talk | contribs) (Add Chatterbox) Tag: Visual edit
- 16:04, 20 September 2025 Orpheus TTS (hist | edit) [3,835 bytes] Ttswikiadmin (talk | contribs) (Add Orpheus TTS) Tag: Visual edit
- 01:48, 20 September 2025 VibeVoice (hist | edit) [6,709 bytes] Ttswikiadmin (talk | contribs) (Created page with "'''VibeVoice''' is an experimental text-to-speech (TTS) framework developed by Microsoft Research for generating long-form, multi-speaker conversational audio. It was released in August 2025 and is designed to synthesize long-form speech content such as podcasts and audiobooks with up to 4 speakers and with support for voice cloning.<ref>https://github.com/microsoft/VibeVoice</ref> == Development and Release == VibeVoice was developed by a team at Microsoft Res...") Tag: Visual edit
- 03:42, 19 September 2025 Emilia Dataset (hist | edit) [4,466 bytes] Ttswikiadmin (talk | contribs) (Add Emilia dataset) Tag: Visual edit