Replies: 1 comment 3 replies
-
Did you read this? This should answer most of your questions about this model. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
pyannote for speaker diarization based on the following segmentation model:
End-to-end speaker segmentation for overlap-aware resegmentation
In the above paper they wrote, under the Implementation details:
[i.e: 5s audio chunks with a sampling rate of 16kHz]
K max -dimensional speaker activations between 0 and 1 every 16ms.
How to conclude if we have a new segment or number of segments in each output ? number of speaker in output ? (example will be very helpful)
Beta Was this translation helpful? Give feedback.
All reactions