Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
Mar 13, 2022 - Python
{{ message }}
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Python/Pytorch app for easily synthesising human voices
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
This repository has implementation for "Neural Voice Cloning With Few Samples"
Phoneme multilingual(Russian-English) voice cloning based on
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
A guide to clone anyone's voice and use it as a text-to-speech with android
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3
TensorFlow implementation of VQ-VAE with WaveNet decoder, based on https://arxiv.org/abs/1711.00937 and https://arxiv.org/abs/1901.08810
the Tensorflow version of multi-speaker TTS training with feedback constraint
One-shot TTS with Improved Unseen Speaker and Style Transfer
This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.
Framework for one-shot multispeaker system based on Deep Learning
Real Time Foreign Accent Conversion
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
Finally, some decent sample sentences
A Sandbox to learn about the applications of Deep Learning and the Mathematics behind it
Audio samples from "Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention"
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
Large publicly available speech datasets
Place for my articles, researches etc.
Add a description, image, and links to the voice-cloning topic page so that developers can more easily learn about it.
To associate your repository with the voice-cloning topic, visit your repo's landing page and select "manage topics."