🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Persian/Farsi text to speech(TTS) training using coqui tts
TTS models for Arabic (Tacotron2, FastPitch)
支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。
zero-shot realtime TTS system, fully offline, free and open source
Persian text-to-speech streamlit interface
TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format
This was created using NextJS and Typescript. This app takes 4 of the OpenAi models: GPT-4 (chat), Dalle-3 (image generator), Vision (image analysis), and TTS-1 (text-to-speech) and allows the user to transform the way they approach everyday tasks.
Docker镜像自动构建并上传到阿里云
Voice Agent responds like humans for the sales teams to qualify the leads and different use cases
Mixer-TTS for efficient TTS
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
Audiobook Simplifier is a tool that creates audiobooks from text documents or eBooks using TTS (Text-to-Speech) technology.
XTTS fine-tuning via CLI
Assignment 2: Fine-tuning Text-to-Speech (TTS) Models for English Technical Speech and Regional Languages
A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.
Add a description, image, and links to the tts-model topic page so that developers can more easily learn about it.
To associate your repository with the tts-model topic, visit your repo's landing page and select "manage topics."