A TensorFlow implementation of Baidu's DeepSpeech architecture
-
Updated
Jul 17, 2020 - C++
A TensorFlow implementation of Baidu's DeepSpeech architecture
This is the official location of the Kaldi project.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Lingvo
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Kalliope is a framework that will help you to create your own personal assistant.
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
the open-source virtual assistant for Ubuntu based Linux distributions
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Free, easy, portable audio engine for games
Descriptive Deep Learning
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
The official repository of the Eesen project
Adapt Intent Parser
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
@voicybot Telegram bot main repository
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Open STT
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
speech to text benchmark framework
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."