Real-time microphone noise suppression on Linux.
-
Updated
Aug 11, 2022 - Go
{{ message }}
Real-time microphone noise suppression on Linux.
Automagically synchronize subtitles with video.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
An audio/acoustic activity detection and audio segmentation tool
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Python AI assistant
Voice Activity Detection based on Deep Learning & TensorFlow
Automatically synchronize and translate subtitles with pretrained deep neural networks, forced alignments and transformers. https://subaligner.readthedocs.io/
A statistical model-based Voice Activity Detection
Voice Activity Detection (VAD) using deep learning.
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
On-device voice activity detection (VAD) powered by deep learning.
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."