fix: peak-normalize separated sources #1730

joonaskalda · 2024-06-21T15:25:30Z

Since the separation loss used in PixIT is scale-invariant, the separated sources might be scaled up/down massively. This PR adds peak-normalization to the speech separation pipeline to deal with this problem.

Fixes #1729

AllanMisasa

This helped unveil peaks of the separated audio.

clement-pages

After testing, LGTM

joonaskalda and others added 2 commits June 21, 2024 11:18

fix: peak-normalize separated sources

a766f2a

Merge branch 'develop' into pixit-normalize-sources

9972a91

simonSlamka approved these changes Jul 27, 2024

View reviewed changes

AllanMisasa approved these changes Jul 27, 2024

View reviewed changes

clement-pages approved these changes Oct 7, 2024

View reviewed changes

clement-pages mentioned this pull request Oct 7, 2024

Speech Separation cracking the volume too high #1770

Closed

hbredin added 2 commits October 7, 2024 15:13

Merge branch 'develop' into pixit-normalize-sources

6726b2b

doc: update changelog

0583adb

hbredin merged commit bd62a89 into pyannote:develop Oct 8, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: peak-normalize separated sources #1730

fix: peak-normalize separated sources #1730

joonaskalda commented Jun 21, 2024

AllanMisasa left a comment

clement-pages left a comment

fix: peak-normalize separated sources #1730

fix: peak-normalize separated sources #1730

Conversation

joonaskalda commented Jun 21, 2024

AllanMisasa left a comment

Choose a reason for hiding this comment

clement-pages left a comment

Choose a reason for hiding this comment