vocoder

目前的多音字使用 pypinyin 或者 g2pM，精度有限，想做一个基于 BERT (或者 ERNIE) 多音字预测模型，简单来说就是假设某语言有 100 个多音字，每个多音字最多有 3 个发音，那么可以在 BERT 后面接 100 个 3 分类器（简单的 fc 层即可），在预测时，找到对应的分类器进行分类即可。
参考论文：
tencent_polyphone.pdf

数据可以用 https://github.com/kakaobrain/g2pM 提供的数据

进阶：多任务的 BERT
![image](https://user-images.githubusercontent.com/24568452

vocoder

Here are 88 public repositories matching this topic...

mozilla / TTS

coqui-ai / TTS

PaddlePaddle / PaddleSpeech

[tts] 基于 BERT 实现语音合成文本前端的多音字预测

[tts] 基于 BERT 实现语音合成文本前端的停顿预测

[tts] 复现简单的 music_generation

TensorSpeech / TensorFlowTTS

kan-bayashi / ParallelWaveGAN

mmorise / World

jik876 / hifi-gan

ivanvovk / WaveGrad

lmnt-com / diffwave

rishikksh20 / VocGAN

szechyjs / mbelib

haoheliu / voicefixer

lmnt-com / wavegrad

HidekiKawahara / legacy_STRAIGHT

geneing / WaveRNN-Pytorch

descriptinc / cargan

mindslab-ai / univnet

xcmyz / FastVocoder

Rongjiehuang / FastDiff

syang1993 / FFTNet

erogol / FFTNet

tuan3w / cnn_vocoder

CSTR-Edinburgh / magphase

rishikksh20 / Fre-GAN-pytorch

Rongjiehuang / Multi-Singer

zceng / LVCNet

magnetophon / VoiceOfFaust

sh123 / codec2_talkie

azraelkuan / FFTNet

BogiHsu / WG-WaveNet

Improve this page

Add this topic to your repo

vocoder

Here are 88 public repositories matching this topic...

mozilla / TTS

coqui-ai / TTS

PaddlePaddle / PaddleSpeech

[tts] 基于 BERT 实现语音合成文本前端的多音字预测

[tts] 基于 BERT 实现语音合成文本前端的停顿预测

[tts] 复现 简单的 music_generation

TensorSpeech / TensorFlowTTS

kan-bayashi / ParallelWaveGAN

mmorise / World

jik876 / hifi-gan

ivanvovk / WaveGrad

lmnt-com / diffwave

rishikksh20 / VocGAN

szechyjs / mbelib

haoheliu / voicefixer

lmnt-com / wavegrad

HidekiKawahara / legacy_STRAIGHT

geneing / WaveRNN-Pytorch

descriptinc / cargan

mindslab-ai / univnet

xcmyz / FastVocoder

Rongjiehuang / FastDiff

syang1993 / FFTNet

erogol / FFTNet

tuan3w / cnn_vocoder

CSTR-Edinburgh / magphase

rishikksh20 / Fre-GAN-pytorch

Rongjiehuang / Multi-Singer

zceng / LVCNet

magnetophon / VoiceOfFaust

sh123 / codec2_talkie

azraelkuan / FFTNet

BogiHsu / WG-WaveNet

Improve this page

Add this topic to your repo

[tts] 复现简单的 music_generation