#

multi-speaker

Here are 9 public repositories matching this topic...

mikebrady / shairport-sync

Sponsor

AirPlay audio player. Shairport Sync adds multi-room capability with Audio Synchronisation

audio audio-player embedded-systems audio-streaming multi-room-audio airplay multi-speaker synchronized-audio airplay-2

Updated Apr 5, 2022
C

r9y9 / deepvoice3_pytorch

Sponsor

Open

Multi GPU Support

4

tanmayb123 commented Mar 4, 2018

I'd like to train this model on 8 V100 GPUs - does it support multi GPU training?

Read more

enhancement help wanted good first issue

aishoot / LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

multi-speaker audio-separation speech-separation speech-enhancement permutation-invariant-training robust-speech-recognition

Updated Jan 6, 2022
Jupyter Notebook

ranchlai / mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

pytorch tts multi-speaker tacotron fastspeech2 tts-chinese tts-hanzi aishell3

Updated Feb 3, 2022
Python

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

text-to-speech deep-learning unsupervised pytorch tts speech-synthesis transformer supervised multi-speaker sota comprehensive single-speaker neural-tts non-autoregressive fastspeech fastspeech2 hifi-gan non-ar mel-gan ultimate-tts

Updated Mar 6, 2022
Python

Totoketchup / Adaptive-MultiSpeaker-Separation

Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

tensorflow adaptive-learning deeplearning multi-speaker source-separation audio-separation speech-separation deep-learning-architectures

Updated Jul 7, 2018
Jupyter Notebook

keonlee9420 / Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech deep-learning efficiency pytorch tts speech-synthesis autoregressive multi-speaker robustness comprehensive tacotron single-speaker neural-tts tacotron2 reduction-factor hifi-gan mel-gan diagonal-guided-attention

Updated Feb 20, 2022
Python

nikitashvarts / CocktailPartySpeakerRecognition

An Algorithm for Speaker Recognition in a Multi-Speaker Environment

deep-learning lstm speaker-recognition multi-speaker cocktail-party-problem

Updated Aug 14, 2020
Python

ZoraizQ / urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

speech-recognition multi-speaker urdu kaldi-asr prus

Updated Sep 24, 2021
Shell

Improve this page

Add a description, image, and links to the multi-speaker topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-speaker topic, visit your repo's landing page and select "manage topics."