kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Aug 3, 2021 - Shell
{{ message }}
kaldi-asr/kaldi is the official location of the Kaldi project.
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
Code examples for new APIs of iOS 10.
Lingvo
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
WaveNet vocoder
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
DELTA is a deep learning based natural language and speech processing platform.
A PaddlePaddle Speech to Any toolkit.
Python library and CLI tool to interface with Google Translate's text-to-speech API
Open-Source Large Vocabulary Continuous Speech Recognition Engine
hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple
A Python wrapper for Kaldi
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Speech Enhancement Generative Adversarial Network in TensorFlow
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
A neural network for end-to-end speech denoising
I thought about rearranging items in sentence (using drag & drop) in case of misclick or user fault.
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."
I have observed that Example codes are missing from Documentation for the below mentioned Classes from torchaudio, Can I Raise PR for it
Group A