Selective-HuBERT

Pretraining:

Checkout pretrain brach and follow the following steps

Step 1: Install fairseq

Follow https://github.com/facebookresearch/fairseq/tree/main and install fairseq (version 0.12.0)

Step 2: Clone repo

git clone -b pretrain https://github.com/jingru-lin/selective_hubert.git

Move or Create symbolic of criterion/shubert_criterion under fairseq/criterion.

Step 3: Prepare tsv and km label file

Follow fairseq hubert documentation to produce hubert tsv files and kmeans labels: https://github.com/facebookresearch/fairseq/tree/main/examples/hubert

Step 4: Get speaker embeddings

Follow https://github.com/alibaba-damo-academy/3D-Speaker to extract speaker embeddings Use model_id=iic/speech_campplus_sv_zh-cn_16k-common

If you are using LibriSpeech, you can use the provided speaker embeddings.

Step 5: Download pre-trained hubert

From https://github.com/facebookresearch/fairseq/tree/main/examples/hubert

Step 6: Start pretraining

To pretrain the model, use the provided config that contains the default hyperparameters. Change the following paths in pretrain.sh and run it:

--config-dir /dir/to/selective_hubert/config
--config-name shubert_base_librispeech.yaml
task.data=/dir/to/data
task.label_dir=/dir/to/label
common.user_dir=/dir/to/selective_hubert
model.pretrained_ckpt_path=/dir/to/pretrained/hubert_base_ls960.pt
task.spk_embedding_dir=/dir/to/extracted/cam_embeddings

Inference:

Download the pre-trained models here. Refer to inference.py for how to extract representations.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective-HuBERT

Pretraining:

Step 1: Install fairseq

Step 2: Clone repo

Step 3: Prepare tsv and km label file

Step 4: Get speaker embeddings

Step 5: Download pre-trained hubert

Step 6: Start pretraining

Inference:

About

Releases

Packages

Contributors 2

jingru-lin/selective_hubert

Folders and files

Latest commit

History

Repository files navigation

Selective-HuBERT

Pretraining:

Step 1: Install fairseq

Step 2: Clone repo

Step 3: Prepare tsv and km label file

Step 4: Get speaker embeddings

Step 5: Download pre-trained hubert

Step 6: Start pretraining

Inference:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages