vc

History

Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
dataset		dataset
logs/quickvc		logs/quickvc
output/quickvc		output/quickvc
test_data		test_data
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
contentvec.py		contentvec.py
convert.py		convert.py
convert.txt		convert.txt
data_utils.py		data_utils.py
encode.py		encode.py
eval-score.py		eval-score.py
eval.py		eval.py
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
pqmf.py		pqmf.py
qvcfinalwhite.png		qvcfinalwhite.png
stft.py		stft.py
stft_loss.py		stft_loss.py
train.py		train.py
transforms.py		transforms.py
utils.py		utils.py

README.md

This is an extention of standard VITS-based VC

The codebase is based on QuickVC but contains several modifications

TPRLS GAN loss (from StyleTTS2)
Multispectral GAN discriminator (Univnet/Vocos/StyleTTS2)
Contentvec instead of Hubert

Pretrained model

Pretrained model is available on hugginface:

https://huggingface.co/alphacep/vosk-vc-ru

Results

On Russian dataset we measure speaker similarity with Resemblyzer

Model	Average similarity	Min similarity
Our
Original QuickVC (trained on VCTK)	0.667	0.477
Trained on Russian data	0.836	0.692
With contentvec	0.880	0.712
Others
Openvoice EN	0.800	0.653

TODO

Test other VC methods (XTTS, GPT-Sovits, RVC, Unitspeech)
Collect wideband dataset (currently 16khz)
Add better speaker and style encoder (3dspeaker, Openvoice)

Inference with pretrained model

python convert.py

You can change convert.txt to select the target and source

Preprocess

python encode.py dataset/VCTK-16K dataset/VCTK-16K

Train

python train.py

References

Initial approach QuickVC

Better content/speaker decomposition Contentvec

Fast MB-iSTFT decoder for VITS MS-ISTFT-VITS

Hubert-soft Soft-VC

Data augmentation (not implemented) FreeVC

TPRLS GAN StyleTTS2, Paper

Multires spectral discriminator UnivNet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

vc

vc

README.md

This is an extention of standard VITS-based VC

Pretrained model

Results

TODO

Inference with pretrained model

Preprocess

Train

References

Files

vc

Directory actions

More options

Directory actions

More options

Latest commit

History

vc

Folders and files

parent directory

README.md

This is an extention of standard VITS-based VC

Pretrained model

Results

TODO

Inference with pretrained model

Preprocess

Train

References