-
Updated
Jun 4, 2020 - Python
Lingvo
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
DELTA is a deep learning based natural language and speech processing platform.
A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
The official repository of the Eesen project
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
SincNet is a neural architecture for efficiently processing raw audio samples.
Open STT
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Thanks for this wonderful library, but it would be much more intuitive for users to get started by providing some simple but clearly training process on self-defined dataset.
On-device streaming speech-to-text engine powered by deep learning
Open tools and data for cloudless automatic speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
The README.md file says to run the following command:
find /media/kaldi_models/ -type f | xargs sed -i 's:test:/opt:g'
The intent seems to be to change paths in ivector_extractor.conf and online_nnet2_decoding.conf from test/models/... to /opt/models/... where they will work with the docker image. However, that command also changes all instances of "test" in the wordlist words.txt! So for
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Kaldi-based Korean ASR (한국어 음성인식) open-source project
End-to-end ASR/LM implementation with pytorch.
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
An opensource speech-to-text software written in tensorflow
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
an open-source implementation of sequence-to-sequence base speech processing engine
Chinese text normalization for speech processing
Can you check the script "data/aishell.py" ?
I got this error"
Traceback (most recent call last):
File "data/aishell.py", line 188, in <module>
tr_file_list = traverse(root, "transcript/train", search_fix="")
File "data/aishell.py", line 19, in traverse
for s_p in sorted(os.listdir(p)):
FileNotFoundError: [Errno 2] No such file or directory: 'Aishell_dataset/transcAdd a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."
I think the filenames in models.sh referred to on lines 4-9 should refer to kaldi-generic-en-tdnn_f-r20190609* which is downloaded on line 3.