Feat/joint diarization and embedding with prepared data #1583

clement-pages · 2023-12-08T13:11:33Z

No description provided.

…e.to Fixes 1397

BREAKING(model): get rid of (flaky) `Model.introspection`

…o feat/joint-diarization-and-embedding

- fixes the dimension error between files id and probabilties arrays - changes the way of how chunks for the embedding task are sampled - creates two functions to draw chunks, one for each subtask Tests are required to ensure that there are no bugs

For now this is a copy past from methods in segmentation task.

as computing this loss probably does not make sense in powerset mode because first class (empty set of labels) does exactly this

as this instance attribute was not used

…` pipeline Co-authored-by: Hervé BREDIN <[email protected]>

as these loop could break gradient flow and to optimize the code

for now do the trick only for the diarization subtask

…-prepared-data

Now, the first `num_dia_samples` samples in a batch are dedicated to the diarization substak, and the remaining sample are for the embedding subtask

... and fix some bugs

…-prepared-data

…computation

…-prepared-data

…ding-with-prepared-data

hbredin · 2024-07-08T19:51:39Z

I just pushed a (possibly buggy) pipeline that seems to work with a joint model

from pyannote.audio.pipelines.speaker_diarization import SpeakerDiarizationV2
import torch

device = torch.device('cuda')
pipeline = SpeakerDiarizationV2('/path/to/joint.ckpt', batch_size=1, step=0.2).to(device)

# parameters obviously need to be optimized
pipeline.instantiate({'clustering': {'threshold': 0.75, 'method': 'centroid', 'min_cluster_size': 1}})

diarization = pipeline('/path/to/audio.wav')

…-prepared-data

…' of https://github.com/clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data

This is done to use the same metrics as for other pyannote's tasks, and to benefit from lightning advantages (parallelization...)

chai3 and others added 30 commits June 8, 2023 08:42

fix: raise TypeError on wrong device type in Pipeline.to and Inferenc…

0551070

…e.to Fixes 1397

feat(task): add support for multi-task models (pyannote#1374)

30ddb0b

BREAKING(model): get rid of (flaky) `Model.introspection`

fix(inference): fix multi-task inference

4eb7190

feat: update FAQtory default answer

dcdfc15

add draft version of the joint diarization and embedding tasks

87f49f9

Merge branch 'develop' of github.com:clement-pages/pyannote-audio int…

6025a80

…o feat/joint-diarization-and-embedding

fix StopIteration error

04de82f

add missing collate methods

d8cb598

For now this is a copy past from methods in segmentation task.

remove support for non-powerset mode

d2d6e14

remove computing of vad loss

e58943b

as computing this loss probably does not make sense in powerset mode because first class (empty set of labels) does exactly this

remove unused imports

bc989cd

fix probabilities do not sum to 1 error

b4d0a78

attempt to fix file duration error

78718b1

attempt to fix negative start_time in embedding part

dfdd8f3

add end-to-end diarization and embedding model

1888360

update end-to-end model

6216d1f

clean multi-task source code

b42cc33

remove support for SegmentationProtocol in the multi-tasks

3d295dd

improve(test): use pyannote.database.registry (pyannote#1413)

3363be6

Set alpha coefficient as attribute

99a7762

remove diarization_database_files attribute

f2a4e34

as this instance attribute was not used

feat(pipeline): add return_embeddings option to `SpeakerDiarization…

017c910

…` pipeline Co-authored-by: Hervé BREDIN <[email protected]>

fix: fix missed speech at the very beginning/end

cf0e3b3

add losses computation in training_step method

f48b74f

doc: add note to self regarding cluster reassignment (pyannote#1419)

f393546

remove for loops in embedding loss computation

5718593

as these loop could break gradient flow and to optimize the code

add validation part into the multi-task

8036572

remove subtask parameter from prepare_chunk

aa36d7b

fix bugs in validation part

6617c9c

for now do the trick only for the diarization subtask

clement-pages added 4 commits December 8, 2023 13:47

update: change name of attribute database_ratio to dia_task_rate

e7da160

wip: attempt to fix issues encountered during training

77ac89f

update: use all the pyannet pretrained model

ea6d06d

fix: fix diarization loss calculation condition in training_step

185798d

hbredin mentioned this pull request Jan 23, 2024

wip: support for joint diarization and embedding #1409

Closed

clement-pages and others added 15 commits May 14, 2024 09:10

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

3fef4f5

…-prepared-data

update joint task with last modifications on data preparation

9d13697

update the way batches are generated in the joint task

6c67fc6

Now, the first `num_dia_samples` samples in a batch are dedicated to the diarization substak, and the remaining sample are for the embedding subtask

fix random generators

519db89

delete remaining call to example_output

106bfc5

update joint task training_step

d3326b1

... and fix some bugs

fix(task): fiw wrong call to receptive_field in prepare_chunk

a36420d

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

101f1d3

…-prepared-data

update(joint task): filter out inactive speaker embeddings from loss …

62fad78

…computation

allow to only compute mean or std in StatsPool

8349818

update diarization + embeddings joint task

0858227

wip: update joint model

ad9e435

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

aeb147f

…-prepared-data

Merge branch 'pyannote:develop' into feat/joint-diarization-and-embed…

f484033

…ding-with-prepared-data

wip: add pipeline working with joint model

8608a1c

clement-pages and others added 9 commits October 18, 2024 09:01

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

1132cfc

…-prepared-data

Merge branch 'develop' into feat/joint-diarization-and-embedding-with…

446c17c

…-prepared-data

Merge branch 'feat/joint-diarization-and-embedding-with-prepared-data…

e6a00b9

…' of https://github.com/clement-pages/pyannote-audio into feat/joint-diarization-and-embedding-with-prepared-data

wip: add validation pipeline

b91df8c

clean validation pipeline code

5e54108

handle overlaped segmentation chunks corner case

9b8e509

add some comments

7708935

replace pyannote.metrics DER by pyannote.audio.torchmetrics one

2edebc4

This is done to use the same metrics as for other pyannote's tasks, and to benefit from lightning advantages (parallelization...)

update joint pipeline

76d4ec9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/joint diarization and embedding with prepared data #1583

Feat/joint diarization and embedding with prepared data #1583

clement-pages commented Dec 8, 2023

hbredin commented Jul 8, 2024 •

edited

Loading

Feat/joint diarization and embedding with prepared data #1583

Are you sure you want to change the base?

Feat/joint diarization and embedding with prepared data #1583

Conversation

clement-pages commented Dec 8, 2023

hbredin commented Jul 8, 2024 • edited Loading

hbredin commented Jul 8, 2024 •

edited

Loading