mozilla / DeepSpeech

Star

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Dec 18, 2020
C++

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Dec 18, 2020
Shell

leon-ai / leon

Star

Open

Add a tutorial to install on a Raspberry Pi

19

borisrorsvort commented Feb 22, 2019

Documentation Is:

Missing
Needed
Confusing
Not Sure?

E

Time module

12

Open

Hardware / Scaling requirements

13

Find more good first issues →

TalAter / annyang

Star

💬 Speech recognition for your site

demo gui tutorial voice speech speech-recognition speech-to-text hacktoberfest

Updated Dec 2, 2020
JavaScript

facebookresearch / wav2letter

Star

Facebook AI Research's Automatic Speech Recognition Toolkit

deep-learning cpp end-to-end speech-recognition wav2letter

Updated Dec 17, 2020
C++

Uberi / speech_recognition

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated Dec 6, 2020
Python

nl8590687 / ASRT_SpeechRecognition

Star

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn speech-recognition speech-to-text ctc chinese-speech-recognition

Updated Oct 23, 2020
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speech-separation speech-enhancement speech-translation

Updated Dec 18, 2020
Python

cmusphinx / pocketsphinx

Star

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

python c speech-recognition

Updated Mar 28, 2020
C

zzw922cn / Automatic_Speech_Recognition

Star

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Nov 13, 2020
Python

NVIDIA / NeMo

Star

NeMo: a toolkit for conversational AI

nlp deep-learning neural-network speech-recognition nlp-machine-learning

Updated Dec 19, 2020
Jupyter Notebook

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Dec 17, 2020
Python

pannous / tensorflow-speech-recognition

Star

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

deep-learning neural-network tensorflow speech-recognition speech-to-text stt

Updated Nov 20, 2018
Python

mravanelli / pytorch-kaldi

Star

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Jun 11, 2020
Python

astorfi / lip-reading-deeplearning

Sponsor Star

🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

computer-vision deep-learning tensorflow speech-recognition 3d-convolutional-network

Updated Mar 3, 2020
Python

bjoernkarmann / project_alias

Star

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

raspberry-pi machine-learning hack smarthome microphone speech-recognition classification alias sound-synthesis wakeword

Updated Apr 5, 2020
Python

alexsosn / iOS_ML

Star

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

swift machine-learning natural-language-processing computer-vision deep-learning neural-network artificial-intelligence speech-recognition gpgpu awesome-list

Updated Jul 30, 2018

kalliope-project / kalliope

Star

Kalliope is a framework that will help you to create your own personal assistant.

linux bot home-automation speech-synthesis speech-recognition personal-assistant bot-creation raspberry speech-to-text jarvis

Updated Dec 15, 2020
Python

Delta-ML / delta

Star

DELTA is a deep learning based natural language and speech processing platform.

Updated Dec 18, 2020
Python

NVIDIA / OpenSeq2Seq

Star

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

text-to-speech deep-learning tensorflow multi-node speech-synthesis speech-recognition seq2seq speech-to-text neural-machine-translation sequence-to-sequence language-model multi-gpu float16 mixed-precision

Updated Oct 19, 2020
Python

audier / DeepSpeechRecognition

Star

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

speech-recognition asr speechrecognition chinese-speech-recognition deep-speech

Updated Mar 26, 2019
Python

yanshengjia / ml-road

Sponsor Star

Machine Learning Resources, Practice and Research

nlp machine-learning computer-vision deep-learning tensorflow pytorch speech-recognition

Updated Nov 6, 2020
Python

julius-speech / julius

Star

Open-Source Large Vocabulary Continuous Speech Recognition Engine

recognition speech speech-recognition audio-processing

Updated Sep 24, 2020
C

nobody132 / masr

Star

Open

一个非常方便的python录音程序，专门为MASR量身定做

2

deepxuexi commented Jun 25, 2019

一个非常方便的python录音程序，专门为MASR量身定做：
按回车开始录音，说完话后再按Enter结束录音并显示识别结果，录音文件会以识别的文本命名保存，方便后期统计识别率。

代码地址:
https://github.com/deepxuexi/ARFASR

如果觉得好用，给我点个star，谢谢！

基于anaconda3安装MASR及测试结果

6

PaddlePaddle / DeepSpeech

Star

A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.

speech speech-recognition speech-to-text deep-speech

Updated Nov 11, 2020
Python

DragonComputer / Dragonfire

Star

the open-source virtual assistant for Ubuntu based Linux distributions

nlp linux machine-learning text-to-speech ubuntu chatbot artificial-intelligence spacy speech-recognition personal-assistant speech-to-text kaldi virtual-assistant

Updated Sep 25, 2020
Python

alphacep / vosk-api

Star

Open

Compress symbol table

3

nshmyrev commented Aug 4, 2020

One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.

sdkcarlos / artyom.js

Star

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

recognition voice-commands speech-synthesis speech-recognition speech-to-text

Updated Dec 2, 2020
JavaScript

react-native-voice / voice

Star

🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)

android ios react-native voice-recognition speech-recognition

Updated Dec 11, 2020
Objective-C

alumae / kaldi-gstreamer-server

Star

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

speech-recognition

Updated Oct 13, 2020
Python

Nov	DEC	Jan
	19
2019	2020	2021

speech-recognition

Here are 1,837 public repositories matching this topic...

mozilla / DeepSpeech

kaldi-asr / kaldi

leon-ai / leon

Add a tutorial to install on a Raspberry Pi

Documentation Is:

E

Time module

Hardware / Scaling requirements

TalAter / annyang

facebookresearch / wav2letter

Uberi / speech_recognition

nl8590687 / ASRT_SpeechRecognition

espnet / espnet

cmusphinx / pocketsphinx

zzw922cn / Automatic_Speech_Recognition

NVIDIA / NeMo

tensorflow / lingvo

pannous / tensorflow-speech-recognition

mravanelli / pytorch-kaldi

astorfi / lip-reading-deeplearning

bjoernkarmann / project_alias

alexsosn / iOS_ML

kalliope-project / kalliope

Delta-ML / delta

NVIDIA / OpenSeq2Seq

audier / DeepSpeechRecognition

yanshengjia / ml-road

julius-speech / julius

nobody132 / masr

一个非常方便的python录音程序，专门为MASR量身定做

基于anaconda3安装MASR及测试结果

PaddlePaddle / DeepSpeech

DragonComputer / Dragonfire

alphacep / vosk-api

Compress symbol table

sdkcarlos / artyom.js

react-native-voice / voice

alumae / kaldi-gstreamer-server

Improve this page

Add this topic to your repo