speech-recognition

Feature request

Is the addition of the 'OPTforSequenceClassification' class scheduled?
Is someone handling it?
When adding these functions, I wonder if it is possible to PR one by one, or if I have to PR all classes supported by other models.

Motivation

Added function of OPT class, which is being actively discussed recently

Your contribution

I personally use the forSequenceCla

Specs

Leon version: latest
OS (or browser) version: Fedora 30
Node.js version: 10.16.3
Complete "npm run check" output:

➡ Here is the diagnosis about your current setup
✔ Run
✔ Run modules
✔ Reply you by texting
❗ Amazon Polly text-to-speech
❗ Google Cloud text-to-speech
❗ Watson text-to-speech
❗ Offline text-to-speech
❗ Google Cloud speech-to-text
❗ Watson spee

目前的多音字使用 pypinyin 或者 g2pM，精度有限，想做一个基于 BERT (或者 ERNIE) 多音字预测模型，简单来说就是假设某语言有 100 个多音字，每个多音字最多有 3 个发音，那么可以在 BERT 后面接 100 个 3 分类器（简单的 fc 层即可），在预测时，找到对应的分类器进行分类即可。
参考论文：
tencent_polyphone.pdf

数据可以用 https://github.com/kakaobrain/g2pM 提供的数据

进阶：多任务的 BERT
![image](https://user-images.githubusercontent.com/24568452

As implemented in Python in

alphacep/vosk-api@5e46825

一个非常方便的python录音程序，专门为MASR量身定做：
按回车开始录音，说完话后再按Enter结束录音并显示识别结果，录音文件会以识别的文本命名保存，方便后期统计识别率。

代码地址:
https://github.com/deepxuexi/ARFASR

如果觉得好用，给我点个star，谢谢！

May	JUN	Jul
	22
2021	2022	2023

speech-recognition

Here are 2,867 public repositories matching this topic...

huggingface / transformers

Feature request

Motivation

Your contribution

mozilla / DeepSpeech

kaldi-asr / kaldi

kmario23 / deep-learning-drizzle

leon-ai / leon

Specs

Uberi / speech_recognition

TalAter / annyang

flashlight / wav2letter

nl8590687 / ASRT_SpeechRecognition

espnet / espnet

NVIDIA / NeMo

PaddlePaddle / PaddleSpeech

speechbrain / speechbrain

alphacep / vosk-api

cmusphinx / pocketsphinx

zzw922cn / Automatic_Speech_Recognition

snakers4 / silero-models

tensorflow / lingvo

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

mravanelli / pytorch-kaldi

wenet-e2e / wenet

pannous / tensorflow-speech-recognition

yanshengjia / ml-road

syhw / wer_are_we

astorfi / lip-reading-deeplearning

bjoernkarmann / project_alias

kalliope-project / kalliope

nobody132 / masr

Delta-ML / delta

julius-speech / julius

Improve this page

Add this topic to your repo