End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
-
Updated
Feb 24, 2020 - Python
{{ message }}
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
Open STT
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
End-to-end ASR/LM implementation with PyTorch
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Mongolian speech recognition with PyTorch
Speech Recognition model based off of FAIR research paper built using Pytorch.
Python implementation of pre-processing for End-to-End speech recognition
A Polymer 3+ webcomponent / button for doing speech recognition
Automatic Speech Recognition using Tensorflow
Vietnamese Automatic Speech Recognition
A Sweet Automatic Speech Recognition like Tiramisu Cake using Tensorflow 2
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
End-to-End Speech Recognition using Neural Networks.
End-to-End Speech Recognition Using Tensorflow
Long audio alignment using Kaldi
UPC Deep Learning for Speech and Language 2018
Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Make Smart Things with TensorFlow
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."