Discover TTS WebUI Extensions

Enhance your Text-to-Speech WebUI with powerful extensions from the community

75 extensions found

Featured Extensions

Vall-E-X
text-to-speech
Multilingual text-to-speech model supporting English, Chinese, and Japanese
by rsxdalv
StyleTTS2
text-to-speech
StyleTTS2 is a text-to-speech model that generates high-quality speech with controllable style
by rsxdalv
Seamless M4T
text-to-speech
SeamlessM4T is a multilingual and multimodal translation model supporting text and speech
by rsxdalv
Extensions List
settings
Extensions List shows the list of interface extensions in the web UI
by rsxdalv
Decorator Extensions List
settings
Decorator Extensions List shows the list of decorator extensions in the web UI
by rsxdalv
Gradio Settings
settings
Gradio Settings allows to configure Gradio interface options from the web UI
by rsxdalv
GPU Info
settings
Display GPU information such as VRAM, CUDA version, and more.
by openai
Installed Packages
settings
Pip List shows the list of installed packages in the web UI
by rsxdalv
Model Location Settings
settings
Model Location Settings allows changing the location of the model cache directories used by Hugging Face and Torch Hub.
by rsxdalv
External Extensions Installer
settings
Add external extension entries via JSON and install them without restarts.
by rsxdalv
Log Viewer
settings
View, search, and manage log files from the TTS Generation WebUI. Browse installation logs, filter by keywords, and clean up old logs.
by rsxdalv
Vall-E-X
text-to-speech
Multilingual text-to-speech model supporting English, Chinese, and Japanese
by rsxdalv
StyleTTS2
text-to-speech
StyleTTS2 is a text-to-speech model that generates high-quality speech with controllable style
by rsxdalv
Seamless M4T
text-to-speech
SeamlessM4T is a multilingual and multimodal translation model supporting text and speech
by rsxdalv
MMS
text-to-speech
MMS (Massively Multilingual Speech) is a text-to-speech model supporting over 1000 languages
by rsxdalv
Tortoise TTS
text-to-speech
Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
by rsxdalv
F5-TTS
text-to-speech
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching.
by rsxdalv
Chatterbox
text-to-speech
Chatterbox, Resemble AI's first production-grade open source TTS model
by rsxdalv
Kokoro
text-to-speech
Kokoro: A small, fast, and high-quality TTS model
by rsxdalv
Bark
text-to-speech
Bark: A text-to-speech model
by rsxdalv
XTTS
text-to-speech
XTTS-Simple is a Gradio UI for XTTSv2
by rsxdalv
Parler-TTS
text-to-speech
Parler-TTS is a training and inference library for high-fidelity text-to-speech (TTS) models.
by rsxdalv
CosyVoice [Unstable]
text-to-speech
CosyVoice: High-quality text-to-speech synthesis.
by rsxdalv
MARS5
text-to-speech
MARS5: A novel speech model for insane prosody
by rsxdalv
DIA
text-to-speech
DIA: A text-to-dialogue model
by rsxdalv
GPT-SoVITS [low compatibility]
text-to-speech
GPT-SoVITS: A TTS solution powered by GPT and SoftVC VITS Singing Voice Conversion.
by rsxdalv
Maha TTS
text-to-speech
Maha TTS allows generating speech from text using the MahaTTS model.
by rsxdalv
OpenVoice
text-to-speech
OpenVoice: A versatile instant voice cloning approach
by rsxdalv
OpenVoice V2
text-to-speech
OpenVoice: A versatile instant voice cloning approach
by rsxdalv
Piper TTS
text-to-speech
Piper TTS is a text-to-speech model by rsxdalv
by rsxdalv
Higgs V2 (Early Access)
text-to-speech
Higgs V2
by rsxdalv
VibeVoice (Early Access)
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
Kitten TTS
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
Index TTS (Beta)
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
VoxCPM (Beta)
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
FireRedTTS2 (Beta)
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
MegaTTS3 (Alpha)
text-to-speech
A template extension for TTS Generation WebUI
by rsxdalv
ACE-Step
audio-music-generation
ACE-Step: A Step Towards Music Generation Foundation Model
by rsxdalv
Stable Audio
audio-music-generation
Stable Audio is a text-to-audio model for generating high-quality music and sound effects
by rsxdalv
Audiocraft
audio-music-generation
Audiocraft provides MusicGen and MAGNeT models for high-quality music and audio generation
by rsxdalv
AudioCraft Plus
audio-music-generation
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top.
by rsxdalv
Riffusion
audio-music-generation
Riffusion allows generating music from text.
by rsxdalv
MusicGen (Mac)
audio-music-generation
MusicGen allows generating music from text
by rsxdalv
Songbloom (Beta)
audio-music-generation
A template extension for TTS Generation WebUI
by rsxdalv
Vocos
audio-conversion
Vocos is a neural audio codec for high-quality audio compression and reconstruction
by rsxdalv
RVC
audio-conversion
RVC: Retrieval-based Voice Conversion
by rsxdalv
Demucs
audio-conversion
Demucs is a music source separation model that can separate drums, bass, vocals, and other instruments
by rsxdalv
Audio Separator
audio-conversion
Audio Separator allows separating audio files into multiple audio files.
by rsxdalv
Resemble Enhance
audio-conversion
Resemble Enhance allows enhancing audio files.
by rsxdalv
AP-BWE Bandwidth Extension
audio-conversion
AP-BWE: An audio bandwidth extension solution using Amplitude-Phase Bandwidth Extension models.
by rsxdalv
PyRNNoise
audio-conversion
A template extension for TTS Generation WebUI
by rsxdalv
History
outputs
Outputs Tab for TTS WebUI
by rsxdalv
Gallery History
outputs
Gallery History allows selecting previously generated audio files by looking at their waveforms
by rsxdalv
FFMPEG Metadata
outputs
FFMPEG Metadata allows loading metadata from audio files.
by rsxdalv
OpenAI TTS API
tools
OpenAI compatible TTS API with support for multiple TTS models
by rsxdalv
Whisper
tools
Whisper allows transcribing audio files.
by rsxdalv
XTTS Fine-tuning Demo
tools
XTTS fine-tuning demo
by rsxdalv
Huggingface Cache Manager
tools
Huggingface Cache Manager allows managing the Huggingface cache.
by rsxdalv
Model Downloader
tools
Model Downloader allows downloading models from the Huggingface model hub.
by rsxdalv
Simple Remixer
tools
Simple remixer allows concatenating multiple audio files and mixing them together.
by rsxdalv
Conda Storage Optimizer
tools
Conda Storage Optimizer allows cleaning up conda storage to free disk space.
by rsxdalv
RVC Training (Not available yet)
tools
RVC Training
by rsxdalv
Bark Voice Clone
tools
Bark Voice Clone allows cloning voices for use with Bark TTS
by rsxdalv
Ebook2Audiobook (Not available yet)
tools
Ebook2Audiobook allows converting ebooks to audiobooks
by rsxdalv
EPub2TTS (Not available yet)
tools
EPub2TTS allows converting ebooks to audiobooks
by rsxdalv
Audiobook Generator (Not available yet)
tools
Audiobook Generator allows converting ebooks to audiobooks
by rsxdalv
CUDA Toolkit
tools
CUDA Toolkit
by rsxdalv
Kimi Audio
conversational-ai
Kimi Audio is a powerful text-to-speech and speech-to-text model by Moonshot AI
by rsxdalv
MiMo-Audio
conversational-ai
A template extension for TTS Generation WebUI
by rsxdalv
YouTube Tutorials
tutorials
YouTube Tutorials shows a list of YouTube tutorials in the web UI
by rsxdalv
Parakeet
tools
Speech transcription via Nvidia Parakeet model
by mefi
Bark Legacy
text-to-speech
This is the legacy UI of Bark from TTS-WebUI
by rsxdalv
PyVideoTrans TTS API
tools
PyVideoTrans text-to-speech API with WebUI integration.
by rsxdalv
SRT Tools
tools
Import and parse multiple SRT files into JSON segments for later TTS batching.
by rsxdalv
Pip Install UI
tools
Install and uninstall Python packages from the web UI. Disable when not in use for security.
by rsxdalv
Save Ogg
outer
Decorator Save Ogg
by rsxdalv
Save Waveform
outer
Decorator Save Waveform
by rsxdalv
Average Time
outer
Decorator Average Execution Time
by rsxdalv