This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
-
Updated
Apr 12, 2020 - Python
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
SincNet is a neural architecture for efficiently processing raw audio samples.
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Identifying people from small audio fragments
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
In defence of metric learning for speaker recognition
Share some recent speaker recognition papers and their implementations.
This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.
A program for automatic speaker identification using deep learning techniques.
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Official Implementation of Mockingjay in Pytorch
Speaker identification with VGGVox network
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
It is a complete project of voiceprint recognition or speaker recognition.
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Speaker recognition library based on MARF for raspberry pi and other SBCs.
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Deep speaker embeddings in PyTorch, including x-vectors
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Add a description, image, and links to the speaker-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speaker-recognition topic, visit your repo's landing page and select "manage topics."
I think we should give/improve the following points :