DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
Updated
Dec 18, 2020 - C++
{{ message }}
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kaldi-asr/kaldi is the official location of the Kaldi project.
Facebook AI Research's Automatic Speech Recognition Toolkit
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
End-to-End Speech Processing Toolkit
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
NeMo: a toolkit for conversational AI
Lingvo
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Kalliope is a framework that will help you to create your own personal assistant.
DELTA is a deep learning based natural language and speech processing platform.
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Machine Learning Resources, Practice and Research
Open-Source Large Vocabulary Continuous Speech Recognition Engine
一个非常方便的python录音程序,专门为MASR量身定做:
按回车开始录音,说完话后再按Enter结束录音并显示识别结果,录音文件会以识别的文本命名保存,方便后期统计识别率。
代码地址:
https://github.com/deepxuexi/ARFASR
如果觉得好用,给我点个star,谢谢!
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
the open-source virtual assistant for Ubuntu based Linux distributions
One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."
Documentation Is:
E