speech-recognition
Here are 2,867 public repositories matching this topic...
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
-
Updated
Jun 21, 2022 - C++
kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Jun 21, 2022 - Shell
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
-
Updated
Apr 10, 2022 - HTML
Fedora & apt-get
Specs
- Leon version: latest
- OS (or browser) version: Fedora 30
- Node.js version: 10.16.3
- Complete "npm run check" output:
➡ Here is the diagnosis about your current setup
✔ Run
✔ Run modules
✔ Reply you by texting
❗ Amazon Polly text-to-speech
❗ Google Cloud text-to-speech
❗ Watson text-to-speech
❗ Offline text-to-speech
❗ Google Cloud speech-to-text
❗ Watson spee
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
Updated
Jun 22, 2022 - Python
-
Updated
Mar 26, 2021 - JavaScript
Facebook AI Research's Automatic Speech Recognition Toolkit
-
Updated
May 29, 2022 - C++
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
-
Updated
Jun 22, 2022 - Python
End-to-End Speech Processing Toolkit
-
Updated
Jun 22, 2022 - Python
NeMo: a toolkit for conversational AI
-
Updated
Jun 22, 2022 - Jupyter Notebook
目前的多音字使用 pypinyin 或者 g2pM,精度有限,想做一个基于 BERT (或者 ERNIE) 多音字预测模型,简单来说就是假设某语言有 100 个多音字,每个多音字最多有 3 个发音,那么可以在 BERT 后面接 100 个 3 分类器(简单的 fc 层即可),在预测时,找到对应的分类器进行分类即可。
参考论文:
tencent_polyphone.pdf
数据可以用 https://github.com/kakaobrain/g2pM 提供的数据
进阶:多任务的 BERT
, Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
-
Updated
Jun 16, 2022
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
-
Updated
Mar 14, 2022 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
Jun 21, 2022 - C++
-
Updated
Nov 20, 2018 - Python
Machine Learning Resources, Practice and Research
-
Updated
Oct 24, 2021 - Python
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
-
Updated
Dec 30, 2021
-
Updated
Mar 3, 2020 - Python
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
-
Updated
Apr 5, 2020 - Python
Kalliope is a framework that will help you to create your own personal assistant.
-
Updated
Mar 6, 2022 - Python
一个非常方便的python录音程序,专门为MASR量身定做:
按回车开始录音,说完话后再按Enter结束录音并显示识别结果,录音文件会以识别的文本命名保存,方便后期统计识别率。
代码地址:
https://github.com/deepxuexi/ARFASR
如果觉得好用,给我点个star,谢谢!
DELTA is a deep learning based natural language and speech processing platform.
-
Updated
May 26, 2022 - Python
Open-Source Large Vocabulary Continuous Speech Recognition Engine
-
Updated
Apr 15, 2022 - C
Improve this page
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."


Feature request
Is the addition of the 'OPTforSequenceClassification' class scheduled?
Is someone handling it?
When adding these functions, I wonder if it is possible to PR one by one, or if I have to PR all classes supported by other models.
Motivation
Added function of OPT class, which is being actively discussed recently
Your contribution
I personally use the forSequenceCla