Tracking the progress in end-to-end speech translation
-
Updated
Jan 12, 2023
Tracking the progress in end-to-end speech translation
A PyPI package for fast word/character error rate (WER/CER) calculation
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
10 digits recognition system based on DTW, HMM and GMM
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018
software that analyzes speech utterances
Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue
Speech subtask of the 2017 NLI Shared Task
Convex combination of phonotactics for large-scale spoken language identification
Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917
RNN for Spoken Language Understanding
A guide to spoken language processing
Add a description, image, and links to the spoken-language-processing topic page so that developers can more easily learn about it.
To associate your repository with the spoken-language-processing topic, visit your repo's landing page and select "manage topics."