Use xvector as segmentation model's input? #885

xIaott-s · 2022-02-09T10:35:11Z

xIaott-s
Feb 9, 2022

I was wondering is that possible to use xvector instead of raw audio as the segmentation model's input?
This may make the model more robust?

hbredin · 2022-02-11T08:46:01Z

hbredin
Feb 11, 2022
Maintainer

This would need a bit of work on your side but that is possible, yes.

Create a new model architecture in lieu of PyanNet that, for instance, replaces its SincNet layer by a pretrained XVector extractor
Train this new model
Profit!

1 reply

xIaott-s Feb 14, 2022
Author

Thanks，I will try and let you know the result.

xIaott-s · 2022-04-29T03:25:38Z

xIaott-s
Apr 29, 2022
Author

I use a frame level xvetor embedding instead of SincNet. In my speaker change detection testset, it works better.
But I have a question that why do you choose the training sample from audios randomly in every epoch?

1 reply

hbredin May 11, 2022
Maintainer

If possible, would you mind sharing the code of this new architecture? It could be useful for other segmentation tasks as well.

Regarding random sampling, what is the alternative? Using a sliding window over the whole training set? That would always generate the same exact chunks (start and end times) over and over again.

The idea of random sampling is to maximize variability of training samples -- you may see this as a kind of (weak) data augmentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use xvector as segmentation model's input? #885

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Use xvector as segmentation model's input? #885

xIaott-s Feb 9, 2022

Replies: 2 comments · 2 replies

hbredin Feb 11, 2022 Maintainer

xIaott-s Feb 14, 2022 Author

xIaott-s Apr 29, 2022 Author

hbredin May 11, 2022 Maintainer

xIaott-s
Feb 9, 2022

Replies: 2 comments 2 replies

hbredin
Feb 11, 2022
Maintainer

xIaott-s Feb 14, 2022
Author

xIaott-s
Apr 29, 2022
Author

hbredin May 11, 2022
Maintainer