Can pyannote-audio be set to distinguish the number of people？ #1742

Erwen222 · 2024-07-19T06:53:34Z

Tested versions

3.1

System information

Ubuntu 16

Issue description

Sometimes there will be over-segmentation, two people's audio is divided into five people, the longer the audio, the more likely this phenomenon will occur.

Minimal reproduction example (MRE)

wu

FrenchKrab · 2024-07-19T07:34:15Z

The pyannote-audio pipeline automatically estimates the number of speakers
You can tune the clustering threshold of the pipeline so that new clusters (= speaker identities) are not created as easily.
If this only appear on long audio files the clustering must be at fault.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can pyannote-audio be set to distinguish the number of people？ #1742

Can pyannote-audio be set to distinguish the number of people？ #1742

Erwen222 commented Jul 19, 2024

FrenchKrab commented Jul 19, 2024

Can pyannote-audio be set to distinguish the number of people？ #1742

Can pyannote-audio be set to distinguish the number of people？ #1742

Comments

Erwen222 commented Jul 19, 2024

Tested versions

System information

Issue description

Minimal reproduction example (MRE)

FrenchKrab commented Jul 19, 2024