[src] Multichannel models #427

mpariente · 2021-02-02T20:50:18Z

Started implementing what I suggested in #420.

I'll highlight the places where I doubt in the PR.

Funny things I found along the way:

It's hard to make a model that doesn't take the sample rate as argument

class Model(BaseModel):
    def __init__(self):
        super().__init__(sample_rate=44100)
        
    def forward(self, x):
        return x

    def get_model_args(self):
        return {}

This can be serialized, but not loaded because sample_rate is needed in the model conf to load the model.

OTOH:

class Model(BaseModel):
    def __init__(self):
        super().__init__(sample_rate=44100)

    def forward(self, x):
        return x

    def get_model_args(self):
        return {"sample_rate": self.sample_rate}

This fails at loading, because it tries to pass sample_rate as kwargs, but cannot.
Should we do something about this?

mpariente · 2021-02-02T21:05:50Z

asteroid/models/base_models.py

        super().__init__()
        self.__sample_rate = sample_rate
+        self.n_channels = n_channels


I don't think this needs to be a property as the sample_rate but we could do it.

why not ? some models are tied to the number of channels and actually to the array topology

For sample_rate, we made it like that because the model holds reference to the sample_rate, but the filterbank as well, so we wanted the raise the warning when setting it.

But for the number of channels, for now nothing holds reference to it.
If we see it's a limitation in the future we can always write a setter/getter.

Asteroid, as Python, is for consenting adults 😉

mpariente · 2021-02-02T21:08:22Z

asteroid/separate.py

+    if model.n_channels is not None and wav.shape[-2] != model.n_channels:
+        raise RuntimeError(
+            f"Model supports {model.n_channels}-channel inputs but found audio with {wav.shape[-2]} channels."
+            f"Please match the number of channels."
+        )


Should we add a flag to ignore that, if passed, make us ignore that and take the first channels or something?
Something that would be passed from the CLI to here (--ignore-channels-check).
I'm not sure it's useful.

Not sure...

@popcornell what's your opinion on that?
Maybe we can start by not having it. And if we find it useful later, or there is a user demand, we can change that.

asteroid/separate.py

mpariente · 2021-02-02T21:10:38Z

tests/models/models_test.py

@@ -32,6 +32,24 @@ def test_set_sample_rate_raises_warning():
        model.sample_rate = 16000.0


+def test_multichannel_model_loading():
+    class MCModel(BaseModel):
+        def __init__(self, sample_rate=8000.0, n_channels=2):


As said in the PR. We must have the sample_rate as argument.

If we want fixed number of channels, we don't need it in the __init__, we just super(n_channels=2) and that works.

mpariente · 2021-02-02T21:14:55Z

Another note: there will be several scenarios for LambdaOverlapAdd:

Multichannel input, single output (should work fine)
Multichannel input, multichannel output (not sure it works).

We should add some tests and if the second case doesn't work, raise a useful error.

jonashaag · 2021-02-03T11:40:58Z

This can be serialized, but not loaded because sample_rate is needed in the model conf to load the model.

How about moving the check for missing sample rate to after the model object has been constructed, and then checking using hasattr(model, "sample_rate")? That way you are free to set the sample_rate property however you like as long as it's present.

mpariente · 2021-02-03T12:15:45Z

This can be serialized, but not loaded because sample_rate is needed in the model conf to load the model.

How about moving the check for missing sample rate to after the model object has been constructed, and then checking using hasattr(model, "sample_rate")? That way you are free to set the sample_rate property however you like as long as it's present.

I thought about that.
That seems fine to me, let's do that.

mpariente · 2021-02-03T20:36:36Z

How about moving the check for missing sample rate to after the model object has been constructed, and then checking using hasattr(model, "sample_rate")? That way you are free to set the sample_rate property however you like as long as it's present.

sample_rate has a default value, so actually this won't work because the sample_rate property will be there.

jonashaag · 2021-02-05T12:49:19Z

How about we drop it? It's backwards incompatible, but loading models will still work, and it's easy to fix for people. Maybe it's time we don't default to 8 kHz anymore now that people are using Asteroid for other things than traditional 8 kHz speech separation

mpariente · 2021-02-05T13:06:40Z

Well, I'd be ok with that!

mpariente · 2021-02-06T21:09:41Z

This is enough for this PR.
After merging this, I'll create a PR to remove the default on the sample rate.

mpariente added 8 commits February 2, 2021 21:41

Add n_channels to BaseModel

9cd5fe5

Add commented problematic places

6c43a9a

Load/save test for multichannel models

2c2c356

Add n_channels to Separatable

7d6c923

Remove the comments on loading

c1aae1b

Multichannel in separate

49179cd

Edit BaseModel docstrings + add Optional on n_channels

a9659fc

Optional on n_channels in separate.py

f3f4d46

mpariente commented Feb 2, 2021

View reviewed changes

mpariente added 2 commits February 2, 2021 22:10

Update asteroid/separate.py

46cb293

Add n_channels to LambdaOverlapAdd

035c234

mpariente mentioned this pull request Feb 3, 2021

[Design] Multichannel model support #420

Closed

mpariente merged commit 40bba0d into master Feb 6, 2021

mpariente deleted the multichannel branch February 6, 2021 21:10

mpariente mentioned this pull request Feb 6, 2021

[src] Make sample rate positional #431

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[src] Multichannel models #427

[src] Multichannel models #427

mpariente commented Feb 2, 2021

mpariente Feb 2, 2021

popcornell Feb 2, 2021

mpariente Feb 3, 2021

mpariente Feb 2, 2021

jonashaag Feb 3, 2021

mpariente Feb 3, 2021

mpariente Feb 2, 2021

mpariente commented Feb 2, 2021

jonashaag commented Feb 3, 2021 •

edited

Loading

mpariente commented Feb 3, 2021

mpariente commented Feb 3, 2021

jonashaag commented Feb 5, 2021 •

edited

Loading

mpariente commented Feb 5, 2021

mpariente commented Feb 6, 2021

[src] Multichannel models #427

[src] Multichannel models #427

Conversation

mpariente commented Feb 2, 2021

mpariente Feb 2, 2021

Choose a reason for hiding this comment

popcornell Feb 2, 2021

Choose a reason for hiding this comment

mpariente Feb 3, 2021

Choose a reason for hiding this comment

mpariente Feb 2, 2021

Choose a reason for hiding this comment

jonashaag Feb 3, 2021

Choose a reason for hiding this comment

mpariente Feb 3, 2021

Choose a reason for hiding this comment

mpariente Feb 2, 2021

Choose a reason for hiding this comment

mpariente commented Feb 2, 2021

jonashaag commented Feb 3, 2021 • edited Loading

mpariente commented Feb 3, 2021

mpariente commented Feb 3, 2021

jonashaag commented Feb 5, 2021 • edited Loading

mpariente commented Feb 5, 2021

mpariente commented Feb 6, 2021

jonashaag commented Feb 3, 2021 •

edited

Loading

jonashaag commented Feb 5, 2021 •

edited

Loading