tts

Star

Here are 1,816 public repositories matching this topic...

CorentinJ / Real-Time-Voice-Cloning

Star

Clone a voice in 5 seconds to generate arbitrary speech in real-time

python deep-learning tensorflow pytorch tts voice-cloning

Updated Jul 8, 2023
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 28, 2023
Python

🤖 Self-hosted, community-driven, local OpenAI compatible API. Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, gpt4all, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

api kubernetes bloom ai containers falcon tts api-rest llama alpaca vicuna guanaco gpt-neox llm stable-diffusion rwkv gpt4all

Updated Aug 30, 2023
Go

PaddlePaddle / PaddleSpeech

Star

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Aug 24, 2023
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Jan 9, 2023
Jupyter Notebook

NVIDIA / NeMo

Star

NeMo: a toolkit for conversational AI

nlp text-to-speech deep-learning neural-network machine-translation tts speech-synthesis speech-recognition speech-to-text nmt language-model speaker-recognition nlp-machine-learning asr speaker-diarization text-normalization

Updated Aug 30, 2023
Python

jaywalnut310 / vits

Star

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

text-to-speech deep-learning pytorch tts speech-synthesis

Updated Jul 4, 2023
Python

wzpan / wukong-robot

Sponsor

Star

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

alexa ai amazon-echo muse tts openai google-home unit bci speaker homeassistant snowboy asr anyq raspeberry-pi gpt3 chatgpt

Updated Aug 14, 2023
Python

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Aug 17, 2023
Jupyter Notebook

MoonInTheRiver / DiffSinger

Star

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

text-to-speech midi tts speech-synthesis diffusion-model singing-voice singing-synthesis singing-voice-synthesis singing-voice-database aaai2022 diffusion-speedup

Updated May 2, 2023
Python

LokerL / tts-vue

Star

🎤 微软语音合成工具，使用 Electron + Vue + ElementPlus + Vite 构建。

electron vue tts element-plus

Updated Aug 8, 2023
TypeScript

TensorSpeech / TensorFlowTTS

Star

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Updated Nov 25, 2022
Python

Plachtaa / VALL-E-X

Star

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

text-to-speech tts gpt transformer-architecture emotional-speech voice-clone vall-e

Updated Aug 30, 2023
Python

keithito / tacotron

Star

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

python machine-learning tensorflow tts speech-synthesis tacotron

Updated Jul 6, 2023
Python

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Aug 30, 2023
Python

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Star

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)