-
Updated
Apr 29, 2022 - Python
#
asr
Here are 475 public repositories matching this topic...
A PyTorch-based Speech Toolkit
audio
deep-learning
transformers
pytorch
voice-recognition
speech-recognition
speech-to-text
language-model
speaker-recognition
speaker-verification
speech-processing
audio-processing
asr
speaker-diarization
speechrecognition
speech-separation
speech-enhancement
spoken-language-understanding
huggingface
speech-toolkit
good first issue
Good for newcomers
1
3
alexa
ai
amazon-echo
muse
tts
google-home
unit
bci
speaker
homeassistant
snowboy
asr
anyq
raspeberry-pi
-
Updated
Apr 19, 2022 - Python
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
Apr 29, 2022 - Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
deep-neural-networks
deep-learning
speech
dnn
pytorch
recurrent-neural-networks
lstm
gru
speech-recognition
rnn
kaldi
rnn-model
asr
lstm-neural-networks
multilayer-perceptron-network
timit
dnn-hmm
-
Updated
Mar 14, 2022 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
pytorch
transformer
speech-recognition
automatic-speech-recognition
production-ready
asr
conformer
e2e-models
-
Updated
Apr 29, 2022 - C++
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
text-to-speech
german
speech
pytorch
tts
speech-synthesis
english
speech-recognition
spanish
colab
speech-to-text
pretrained-models
stt
asr
capitalization
onnx
stt-benchmark
tts-models
torch-hub
repunctuation
-
Updated
Apr 28, 2022 - Jupyter Notebook
DELTA is a deep learning based natural language and speech processing platform.
nlp
front-end
ops
deep-learning
text-classification
tensorflow
nlu
speech
inference
text-generation
speech-recognition
seq2seq
sequence-to-sequence
speaker-verification
asr
tensorflow-serving
emotion-recognition
custom-ops
serving
tensorflow-lite
-
Updated
Apr 27, 2022 - Python
BitBarrel
commented
Sep 19, 2021
Creating CSV files manually is a lot of work. This could be automated by a script if the name of the WAV file is the same as the transcript.
The same could be done for creating a language model input text file. A script could pull the transcript from the WAV file name.
SincNet is a neural architecture for efficiently processing raw audio samples.
audio
python
deep-learning
signal-processing
waveform
cnn
pytorch
artificial-intelligence
speech-recognition
neural-networks
convolutional-neural-networks
digital-signal-processing
filtering
speaker-recognition
speaker-verification
speech-processing
audio-processing
asr
timit
speaker-identification
-
Updated
Apr 28, 2021 - Python
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
-
Updated
Apr 29, 2022 - Python
A Python wrapper for Kaldi
python
wrapper
numpy
speech
feature-extraction
speech-recognition
kaldi
language-model
asr
openfst
clif
-
Updated
Mar 10, 2022 - Python
The official repository of the Eesen project
-
Updated
May 23, 2019 - C++
an open-source implementation of sequence-to-sequence based speech processing engine
deployment
tensorflow
tts
speech-synthesis
transformer
speech-recognition
sequence-to-sequence
unsupervised-learning
speaker-recognition
asr
ctc
wfst
-
Updated
Mar 20, 2022 - Python
Open STT
-
Updated
Mar 11, 2022 - Python
Open
Design a Logo
iceychris
commented
Nov 16, 2020
Design a logo for LibreASR and share it here.
To make an open source project cool, it should have a logo
good first issue
Good for newcomers
Open
Raspberry Pi Support
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
-
Updated
May 7, 2020 - Python
End-to-end ASR/LM implementation with PyTorch
streaming
speech
language-modeling
pytorch
transformer
speech-recognition
seq2seq
attention
automatic-speech-recognition
sequence-to-sequence
language-model
attention-mechanism
asr
ctc
rnn-transducer
transformer-xl
-
Updated
Aug 30, 2021 - Python
Open
Create REST server
1
nshmyrev
commented
Sep 26, 2021
good first issue
Good for newcomers
On-device streaming speech-to-text engine powered by deep learning
android
python
c
raspberry-pi
iot
ios
machine-learning
arm
deep-learning
offline
webassembly
voice-recognition
speech-recognition
speech-to-text
stt
asr
-
Updated
Apr 14, 2022 - Java
Chinese text normalization for speech processing
-
Updated
Apr 29, 2022 - Python
Open tools and data for cloudless automatic speech recognition
-
Updated
Mar 30, 2021 - Python
Sequence-to-Sequence Framework in PyTorch
deep-learning
cnn
pytorch
speech-recognition
seq2seq
neural-machine-translation
nmt
multimodality
asr
-
Updated
Jul 13, 2021 - Jupyter Notebook
OleguerCanal
commented
Apr 9, 2022
1
python
pypi
speech-recognition
nlp-library
asr
nlp-tool
arabic-numbers
arabic-numerals
chinese-numerals
cn2an
-
Updated
Apr 23, 2022 - Python
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
end-to-end
pytorch
transformer
speech-recognition
las
seq2seq
jasper
asr
conformer
attention-is-all-you-need
korean-speech
e2e-asr
las-models
ksponspeech
-
Updated
Sep 16, 2021 - Python
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
recognition
speech
cnn
pytorch
transformer
speech-recognition
conv
convolution
augmented
asr
conformer
transformer-xl
-
Updated
Mar 15, 2022 - Python
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
docker
deep-learning
speech-recognition
chinese
speech-to-text
nvidia-docker
asr
paddlepaddle
deepspeech2
deepspeech
-
Updated
Apr 15, 2022 - Python
Improve this page
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

As implemented in Python in
alphacep/vosk-api@5e46825