Add `MelSpectrogram` layer #17717

awsaf49 · 2023-03-24T01:25:33Z

This PR will add the MelSpecrtoram layer as an audio_preprocessing layer as mentioned in keras-team/tf-keras#55. I have added the backbone of this layer. This layer will convert raw audio signals to Mel spectrograms. This layer is compatible with both GPU & TPU.

Need to add some tests to ensure everything is okay.

Todo

unbatched audio test
batched audio test
zero values audio test
serialize callable ref

cc: @fchollet , @mattdangerw

Imports are incorrectly sorted and/or formatted.

awsaf49 · 2023-03-24T01:47:09Z

It turns out linting is failing due to conflict between flake8 and black. Black is making the formatted code line too wide which is flagged by flake8 on other hand if code is reformated to adjust for too wide line then black flags it
This is resolved setting --line-length 80 in black

fchollet

Thanks for the PR! This is good work. There's a lot of API design work to be done here to make the API as intuitive as possible.

keras/layers/preprocessing/audio_preprocessing.py

awsaf49 · 2023-03-25T00:22:45Z

@fchollet Thanks for your valuable feedback. While I understand your concern about the potential confusion caused by non-intuitive names, I would like to point out that the current argument names are widely used and recognized in the community, as demonstrated by popular libraries such as librosa,torchaudio, and nnaudio. For example,

for librosa

librosa.feature.melspectrogram(y=None, sr=22050, n_fft=2048, n_mels=128, hop_length=512, win_length=None, 
                               window='hann', power=2.0, fmin=0.0, fmax=None)

for torchaudio

torchaudio.transforms.MelSpectrogram(sample_rate=16000, n_fft=400, win_length=None, hop_length=None, 
                                     f_min=0.0, f_max= None, n_mels = 128, window_fn = <fn>, power = 2.0)

for nnaudio,

nnAudio.Spectrogram.MelSpectrogram(sr=22050, n_fft=2048, n_mels=128, hop_length=512, 
                                   window='hann', power=2.0,fmin=0.0, fmax=None)

While I am open to alternative names, I believe that changing them could create confusion for users who are already familiar with these naming conventions. Therefore, it seems like a trade-off between renowned names and more intuitive names. Any thoughts what should we do?

--line-length 80

1. add reference link 2. explanation 3. use case

fchollet · 2023-03-26T19:09:30Z

Therefore, it seems like a trade-off between renowned names and more intuitive names. Any thoughts what should we do?

We should use more intuitive names.

There are more people who will use these APIs in the future than they are people using these APIs today. We're doing them a service by adopting better naming conventions.
If the names are intuitive, then they will be intuitive / easy to understand for people already familiar with the current APIs.
Keras APIs must be consistent with the Keras API. It would be annoying and surprising if something called "sampling_rate" in several other places of the API was called "sample_rate" here.

awsaf49 · 2023-03-27T08:47:54Z

@fchollet I've updated the names according to your suggestions and replaced the rest with better intuitive names. Let me know if they meet the requirements.

By the way just noticed, this PR keras-team/keras-hub/pull/847 in Keras-NLP is using sample_rate instead of sampling_rate & stride instead of fft_stride.

fchollet

Thanks for the update -- the API is looking good (just one comment)! Please add unit tests.

awsaf49 · 2023-05-19T11:49:09Z

@fchollet could you please check?

awsaf49 · 2023-05-27T01:10:32Z

@gbaned any update?

awsaf49 · 2023-07-01T03:32:48Z

@gbaned could you approve the workflow for unit-test ??

awsaf49 · 2023-07-07T17:45:14Z

@mihirparadkar Hi, I just noticed keras-team-review-pending label has been removed. Any update on this??

fchollet

LGTM

@fchollet

Imported from GitHub PR #17717 This PR will add the `MelSpecrtoram` layer as an `audio_preprocessing` layer as mentioned in #17657. I have added the backbone of this layer. This layer will convert raw audio signals to Mel spectrograms. This layer is compatible with both GPU & TPU. Need to add some tests to ensure everything is okay. ## Todo - [x] unbatched audio test - [x] batched audio test - [x] zero values audio test - [ ] serialize callable `ref` cc: @fchollet , @mattdangerw Copybara import of the project: -- d1d8175 by Awsaf <[email protected]>: Add `MelSpectrogram` layer -- afa9e88 by Awsaf <[email protected]>: Fix for isort Imports are incorrectly sorted and/or formatted. -- ae6d109 by Awsaf <[email protected]>: reorder `super.__init__` -- d4a8daf by Awsaf <[email protected]>: Fix output_shape for 1D input -- 914e75d by Awsaf <[email protected]>: Export to only `experimental` layers -- ba1f18e by Awsaf <[email protected]>: Make inline -- 0fda055 by Awsaf <[email protected]>: Remove outline -- afdf73c by Awsaf <[email protected]>: Reformat with black --line-length 80 -- adb9477 by Awsaf <[email protected]>: Update: docstring 1. add reference link 2. explanation 3. use case -- f3e0fe9 by Awsaf <[email protected]>: Example added -- 6181518 by Awsaf <[email protected]>: Update arg names -- 4273fae by Awsaf <[email protected]>: test added -- 865af24 by Awsaf <[email protected]>: melspec test added Merging this change closes #17717 FUTURE_COPYBARA_INTEGRATE_REVIEW=#17717 from awsaf49:melspec 865af24 PiperOrigin-RevId: 547546687

@fchollet

Imported from GitHub PR #17717 This PR will add the `MelSpecrtoram` layer as an `audio_preprocessing` layer as mentioned in #17657. I have added the backbone of this layer. This layer will convert raw audio signals to Mel spectrograms. This layer is compatible with both GPU & TPU. Need to add some tests to ensure everything is okay. ## Todo - [x] unbatched audio test - [x] batched audio test - [x] zero values audio test - [ ] serialize callable `ref` cc: @fchollet , @mattdangerw Copybara import of the project: -- d1d8175 by Awsaf <[email protected]>: Add `MelSpectrogram` layer -- afa9e88 by Awsaf <[email protected]>: Fix for isort Imports are incorrectly sorted and/or formatted. -- ae6d109 by Awsaf <[email protected]>: reorder `super.__init__` -- d4a8daf by Awsaf <[email protected]>: Fix output_shape for 1D input -- 914e75d by Awsaf <[email protected]>: Export to only `experimental` layers -- ba1f18e by Awsaf <[email protected]>: Make inline -- 0fda055 by Awsaf <[email protected]>: Remove outline -- afdf73c by Awsaf <[email protected]>: Reformat with black --line-length 80 -- adb9477 by Awsaf <[email protected]>: Update: docstring 1. add reference link 2. explanation 3. use case -- f3e0fe9 by Awsaf <[email protected]>: Example added -- 6181518 by Awsaf <[email protected]>: Update arg names -- 4273fae by Awsaf <[email protected]>: test added -- 865af24 by Awsaf <[email protected]>: melspec test added Merging this change closes #17717 FUTURE_COPYBARA_INTEGRATE_REVIEW=#17717 from awsaf49:melspec 865af24 PiperOrigin-RevId: 548211937

awsaf49 · 2023-09-15T17:43:38Z

Currently the PR is on hold due to the following error,

/keras/distribute/ctl_correctness_test.runfiles/org_keras/keras/metrics/confusion_metrics.py", line 22, in <module>
    from keras import activations
  File "/home/kbuilder/.cache/bazel/_bazel_kbuilder/31d6f47147b75c35404d734345be7323/execroot/org_keras/bazel-out/k8-opt/bin/keras/distribute/ctl_correctness_test.runfiles/org_keras/keras/activations.py", line 22, in <module>
    import keras.layers.activation as activation_layers
ImportError: cannot import name 'layers' from partially initialized module 'keras' (most likely due to a circular import) (/home/kbuilder/.cache/bazel/_bazel_kbuilder/31d6f47147b75c35404d734345be7323/execroot/org_keras/bazel-out/k8-opt/bin/keras/distribute/ctl_correctness_test.runfiles/org_keras/keras/__init__.py)

I can't seem to find a fix for it as it relates to keras circular import but the code is written with a similar structure as the image_processing layer which works just fine. Any suggestions would be really helpful.
@fchollet @mattdangerw

sachinprasadhs · 2023-09-19T16:30:11Z

Hello, Thank you for submitting a pull request.

We're currently in the process of migrating the new Keras 3 code base from keras-team/keras-core to keras-team/keras.
Consequently, merging this PR is not possible at the moment. After the migration is successfully completed, feel free to reopen this PR at keras-team/keras if you believe it remains relevant to the Keras 3 code base. If instead this PR fixes a bug or security issue in legacy tf.keras, you can instead reopen the PR at keras-team/tf-keras, which hosts the TensorFlow-only, legacy version of Keras.

awsaf49 added 2 commits March 24, 2023 07:14

Add MelSpectrogram layer

d1d8175

Merge branch 'keras-team:master' into melspec

826ae7f

google-ml-butler bot added the size:M label Mar 24, 2023

google-ml-butler bot assigned gbaned Mar 24, 2023

awsaf49 added 3 commits March 24, 2023 07:31

Fix for isort

afa9e88

Imports are incorrectly sorted and/or formatted.

Merge remote-tracking branch 'origin/melspec' into melspec

36527b0

reorder super.__init__

ae6d109

Fix output_shape for 1D input

d4a8daf

fchollet reviewed Mar 24, 2023

View reviewed changes

awsaf49 added 6 commits March 25, 2023 19:47

Export to only experimental layers

914e75d

Make inline

ba1f18e

Remove outline

0fda055

Reformat with black

afdf73c

--line-length 80

Update: docstring

adb9477

1. add reference link 2. explanation 3. use case

Example added

f3e0fe9

awsaf49 requested a review from fchollet March 26, 2023 09:23

google-ml-butler bot added the keras-team-review-pending Pending review by a Keras team member. label Mar 26, 2023

Update arg names

6181518

gbaned removed the keras-team-review-pending Pending review by a Keras team member. label Mar 28, 2023

test added

4273fae

fchollet reviewed Apr 18, 2023

View reviewed changes

melspec test added

865af24

awsaf49 requested a review from fchollet May 3, 2023 04:00

awsaf49 marked this pull request as ready for review May 31, 2023 03:42

gbaned added the keras-team-review-pending Pending review by a Keras team member. label Jul 4, 2023

mihirparadkar removed the keras-team-review-pending Pending review by a Keras team member. label Jul 6, 2023

fchollet approved these changes Jul 12, 2023

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Jul 12, 2023

kokoro-team removed the kokoro:force-run label Jul 12, 2023

copybara-service bot mentioned this pull request Jul 12, 2023

PR #17717: Add MelSpectrogram layer #18287

Closed

4 tasks

Merge branch 'keras-team:master' into melspec

ad05c7a

google-ml-butler bot removed the ready to pull Ready to be merged into the codebase label Jul 13, 2023

gbaned requested a review from fchollet July 14, 2023 07:39

copybara-service bot mentioned this pull request Jul 14, 2023

PR #17717: Add MelSpectrogram layer #18289

Closed

4 tasks

gbaned requested review from fchollet and removed request for fchollet July 17, 2023 07:54

Merge branch 'keras-team:master' into melspec

c78ccca

awsaf49 mentioned this pull request Sep 22, 2023

Add MelSpectrogram layer #18405

Closed

awsaf49 added 6 commits September 15, 2023 22:30

update: output shape test

a507504

update: all zero input test

d429a75

add: numeric test

40c2e1e

add: keras import

b677110

remove: unused imports

5c3e5be

update: format for flake8

3cf0f23

sachinprasadhs closed this Sep 19, 2023

google-ml-butler bot removed the awaiting review label Sep 19, 2023

awsaf49 mentioned this pull request Feb 18, 2024

Add: MelSpectrogram layer #19194

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `MelSpectrogram` layer #17717

Add `MelSpectrogram` layer #17717

awsaf49 commented Mar 24, 2023 •

edited

Loading

awsaf49 commented Mar 24, 2023 •

edited

Loading

fchollet left a comment

awsaf49 commented Mar 25, 2023 •

edited

Loading

fchollet commented Mar 26, 2023

awsaf49 commented Mar 27, 2023

fchollet left a comment

awsaf49 commented May 19, 2023

awsaf49 commented May 27, 2023

awsaf49 commented Jul 1, 2023

awsaf49 commented Jul 7, 2023

fchollet left a comment

awsaf49 commented Sep 15, 2023

sachinprasadhs commented Sep 19, 2023

Add MelSpectrogram layer #17717

Add MelSpectrogram layer #17717

Conversation

awsaf49 commented Mar 24, 2023 • edited Loading

Todo

awsaf49 commented Mar 24, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

awsaf49 commented Mar 25, 2023 • edited Loading

fchollet commented Mar 26, 2023

awsaf49 commented Mar 27, 2023

fchollet left a comment

Choose a reason for hiding this comment

awsaf49 commented May 19, 2023

awsaf49 commented May 27, 2023

awsaf49 commented Jul 1, 2023

awsaf49 commented Jul 7, 2023

fchollet left a comment

Choose a reason for hiding this comment

awsaf49 commented Sep 15, 2023

sachinprasadhs commented Sep 19, 2023

Add `MelSpectrogram` layer #17717

Add `MelSpectrogram` layer #17717

awsaf49 commented Mar 24, 2023 •

edited

Loading

awsaf49 commented Mar 24, 2023 •

edited

Loading

awsaf49 commented Mar 25, 2023 •

edited

Loading