Discover TTS WebUI Extensions
Enhance your Text-to-Speech WebUI with powerful extensions from the community
61 extensions found
Featured Extensions
Omnivoice (uv)
State-of-the-art massive multilingual zero-shot text-to-speech model supporting 600+ languages with voice cloning and voice design.
by rsxdalv
Text-to-SpeechACE-Step 1.5
The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
by rsxdalv
Audio & MusicFoundation-1 (uv)
Foundation-1 music generation model by RoyalCities, built on stable-audio-tools
by rsxdalv
Audio & MusicParakeet
Speech transcription via Nvidia Parakeet model
by mefi
ToolsBark Legacy
This is the legacy UI of Bark from TTS-WebUI
by rsxdalv
Text-to-SpeechPyVideoTrans TTS API
PyVideoTrans text-to-speech API with WebUI integration.
by rsxdalv
ToolsSRT Tools
Import and parse multiple SRT files into JSON segments for later TTS batching.
by rsxdalv
ToolsPip Install UI
Install and uninstall Python packages from the web UI. Disable when not in use for security.
by rsxdalv
ToolsOmnivoice (uv)
State-of-the-art massive multilingual zero-shot text-to-speech model supporting 600+ languages with voice cloning and voice design.
by rsxdalv
Text-to-SpeechACE-Step 1.5
The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
by rsxdalv
Audio & MusicFoundation-1 (uv)
Foundation-1 music generation model by RoyalCities, built on stable-audio-tools
by rsxdalv
Audio & MusicAudioCraft Plus (uv)
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top.
by rsxdalv
Audio & MusicTortoise TTS (uv) [broken]
Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
by rsxdalv
Text-to-SpeechLog Viewer
View, search, and manage log files from the TTS Generation WebUI. Browse installation logs, filter by keywords, and clean up old logs.
by rsxdalv
SettingsVall-E-X
Multilingual text-to-speech model supporting English, Chinese, and Japanese
by rsxdalv
Text-to-SpeechStyleTTS2
StyleTTS2 is a text-to-speech model that generates high-quality speech with controllable style
by rsxdalv
Text-to-SpeechSeamless M4T
SeamlessM4T is a multilingual and multimodal translation model supporting text and speech
by rsxdalv
Text-to-SpeechMMS
MMS (Massively Multilingual Speech) is a text-to-speech model supporting over 1000 languages
by rsxdalv
Text-to-SpeechTortoise TTS
Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
by rsxdalv
Text-to-SpeechF5-TTS
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching.
by rsxdalv
Text-to-SpeechChatterbox
Chatterbox, Resemble AI's first production-grade open source TTS model
by rsxdalv
Text-to-SpeechKokoro
Kokoro: A small, fast, and high-quality TTS model
by rsxdalv
Text-to-SpeechBark
Bark: A text-to-speech model
by rsxdalv
Text-to-SpeechXTTS
XTTS-Simple is a Gradio UI for XTTSv2
by rsxdalv
Text-to-SpeechParler-TTS
Parler-TTS is a training and inference library for high-fidelity text-to-speech (TTS) models.
by rsxdalv
Text-to-SpeechCosyVoice [Unstable]
CosyVoice: High-quality text-to-speech synthesis.
by rsxdalv
Text-to-SpeechMARS5
MARS5: A novel speech model for insane prosody
by rsxdalv
Text-to-SpeechDIA
DIA: A text-to-dialogue model
by rsxdalv
Text-to-SpeechGPT-SoVITS (uv)
GPT-SoVITS: A TTS solution powered by GPT and SoftVC VITS Singing Voice Conversion.
by rsxdalv
Text-to-SpeechMaha TTS
Maha TTS allows generating speech from text using the MahaTTS model.
by rsxdalv
Text-to-SpeechOpenVoice (uv)
OpenVoice: A versatile instant voice cloning approach
by rsxdalv
Text-to-SpeechOpenVoice V2
OpenVoice: A versatile instant voice cloning approach
by rsxdalv
Text-to-SpeechPiper TTS
Piper TTS is a text-to-speech model by rsxdalv
by rsxdalv
Text-to-SpeechHiggs V2 (Early Access)
Higgs V2
by rsxdalv
Text-to-SpeechVibeVoice (Early Access)
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechKitten TTS
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechIndex TTS (uv)
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechVoxCPM (Beta)
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechFireRedTTS2 (Beta)
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechMegaTTS3 (Alpha)
A template extension for TTS Generation WebUI
by rsxdalv
Text-to-SpeechACE-Step
ACE-Step: A Step Towards Music Generation Foundation Model
by rsxdalv
Audio & MusicStable Audio
Stable Audio is a text-to-audio model for generating high-quality music and sound effects
by rsxdalv
Audio & MusicAudiocraft
Audiocraft provides MusicGen and MAGNeT models for high-quality music and audio generation
by rsxdalv
Audio & MusicAudioCraft Plus
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top.
by rsxdalv
Audio & MusicRiffusion
Riffusion allows generating music from text.
by rsxdalv
Audio & MusicMusicGen (Mac)
MusicGen allows generating music from text
by rsxdalv
Audio & MusicSongbloom (Beta)
A template extension for TTS Generation WebUI
by rsxdalv
Audio & MusicVocos
Vocos is a neural audio codec for high-quality audio compression and reconstruction
by rsxdalv
Audio ConversionRVC
RVC: Retrieval-based Voice Conversion
by rsxdalv
Audio ConversionDemucs
Demucs is a music source separation model that can separate drums, bass, vocals, and other instruments
by rsxdalv
Audio ConversionAudio Separator
Audio Separator allows separating audio files into multiple audio files.
by rsxdalv
Audio ConversionResemble Enhance
Resemble Enhance allows enhancing audio files.
by rsxdalv
Audio ConversionAP-BWE Bandwidth Extension
AP-BWE: An audio bandwidth extension solution using Amplitude-Phase Bandwidth Extension models.
by rsxdalv
Audio ConversionPyRNNoise
A template extension for TTS Generation WebUI
by rsxdalv
Audio ConversionOpenAI TTS API
OpenAI compatible TTS API with support for multiple TTS models
by rsxdalv
ToolsXTTS Fine-tuning Demo
XTTS fine-tuning demo
by rsxdalv
ToolsRVC Training (Not available yet)
RVC Training
by rsxdalv
ToolsBark Voice Clone
Bark Voice Clone allows cloning voices for use with Bark TTS
by rsxdalv
ToolsEbook2Audiobook (Not available yet)
Ebook2Audiobook allows converting ebooks to audiobooks
by rsxdalv
ToolsEPub2TTS (Not available yet)
EPub2TTS allows converting ebooks to audiobooks
by rsxdalv
ToolsAudiobook Generator (Not available yet)
Audiobook Generator allows converting ebooks to audiobooks
by rsxdalv
ToolsCUDA Toolkit
CUDA Toolkit
by rsxdalv
ToolsKimi Audio
Kimi Audio is a powerful text-to-speech and speech-to-text model by Moonshot AI
by rsxdalv
Conversational AIMiMo-Audio
A template extension for TTS Generation WebUI
by rsxdalv
Conversational AI