DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
Updated
Sep 18, 2020 - C++
{{ message }}
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kaldi-asr/kaldi is the official location of the Kaldi project.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Lingvo
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Kalliope is a framework that will help you to create your own personal assistant.
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
the open-source virtual assistant for Ubuntu based Linux distributions
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
hi,
as you know, in SoLoud, the number of filters are limited
we should implement more like different reverbs, fir and irr filters, (these could be used to implement HRTF support), Chorus, One Poll, One Zero, Pole Zero, Two Pole, Two Zero, etc
a library exists called stk under zlib license which already implemented these maybe we can implement some of these out
Descriptive Deep Learning
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
The official repository of the Eesen project
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Adapt Intent Parser
One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.
@voicybot Telegram bot main repository
Open STT
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
speech to text benchmark framework
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."
Documentation Is:
E