LID: several random samples for long file #6853

karpnv · 2023-06-12T15:58:59Z

What does this PR do ?

Use several random samples for long files

Collection: ASR

Changelog

Added parameters:

segment_duration (float): random sample duration in seconds
num_segments (int): number of segments of file to use for majority vote

Usage

lang_model = nemo_asr.models.EncDecSpeakerLabelModel.from_pretrained(model_name="langid_ambernet")
lang = lang_model.get_label(filename, segment_duration = np.inf, num_segments = 1, random_seed = None)

PR Type:

[V] New Feature

Who can review?

@fayejf

Signed-off-by: Nikolay Karpov <[email protected]>

…ation_limit

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <[email protected]>

…ation_limit

github-actions · 2023-06-29T02:03:56Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions · 2023-07-06T02:07:28Z

This PR was closed because it has been inactive for 7 days since being marked as stale.

Signed-off-by: Nikolay Karpov <[email protected]>

…ation_limit

Signed-off-by: Nikolay Karpov <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <[email protected]>

…into karpnv/duration_limit

…ation_limit

Signed-off-by: Nikolay Karpov <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <[email protected]>

…into karpnv/duration_limit

Signed-off-by: Nikolay Karpov <[email protected]>

nithinraok · 2023-10-25T00:16:23Z

nemo/collections/asr/models/label_models.py


        Returns:
            label: label corresponding to the trained model
        """
-        _, logits = self.infer_file(path2audio_file=path2audio_file)
+        audio, sr = librosa.load(path2audio_file, sr=None)


can you replace this with sf.read it is much faster

I know it might not have been originally designed to support reading mp3 or multi-channel (stereo) wav files, but it was able to do so in the past with librosa.load, however, it may result in errors after switching to sf.read. Should we consider adding support for more formats or stick to using librosa.load for consistency?

nithinraok · 2023-10-25T00:17:59Z

nemo/collections/asr/models/label_models.py

-            path2audio_file: path to audio wav file
+            path2audio_file (str): path to audio wav file
+            segment_duration (float): random sample duration in seconds
+            num_segments (int): number of segments of file to use for majority vote


instead of num_segments, just do non-overlap segments from start to end based on 5 sec audio samples? Have you done ablation study on what is best?

I personally didn't, but it was suggested by Fai. This is for very long audio (several hours). We take several segments and get result by majority vote

…ation_limit

Signed-off-by: Nikolay Karpov <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Nikolay Karpov <[email protected]>

…into karpnv/duration_limit

nithinraok

LGTM, Please add a random seed for selection of random segments.

Signed-off-by: Nikolay Karpov <[email protected]>

…ation_limit

for more information, see https://pre-commit.ci

karpnv · 2023-10-26T10:36:01Z

added random_seed parameter

nithinraok

Thanks, LGTM

Signed-off-by: Nikolay Karpov <[email protected]>

karpnv · 2023-10-30T17:12:26Z

jenkins

Signed-off-by: Nikolay Karpov <[email protected]>

…ation_limit

nithinraok · 2023-11-03T16:03:21Z

jenkins

Signed-off-by: Nikolay Karpov <[email protected]>

nithinraok · 2023-11-03T19:59:54Z

jenkins

* add random samlpes (num_segments) with segment_duration for get_label(filename, segment_duration = 60*6, num_segments) --------- Signed-off-by: Nikolay Karpov <[email protected]> Signed-off-by: Piotr Żelasko <[email protected]>

* add random samlpes (num_segments) with segment_duration for get_label(filename, segment_duration = 60*6, num_segments) --------- Signed-off-by: Nikolay Karpov <[email protected]>

add duration_limit

44fbcd8

Signed-off-by: Nikolay Karpov <[email protected]>

github-actions bot added the ASR label Jun 12, 2023

karpnv and others added 4 commits June 12, 2023 08:59

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

3a2303a

…ation_limit

[pre-commit.ci] auto fixes from pre-commit.com hooks

034bb3a

for more information, see https://pre-commit.ci

target_sr

ebd9f1b

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

30cee72

…ation_limit

github-actions bot added the stale label Jun 29, 2023

github-actions bot closed this Jul 6, 2023

karpnv and others added 10 commits July 6, 2023 05:56

limit first

5c1034d

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

facb08d

…ation_limit

soundfile

c6eae3e

Signed-off-by: Nikolay Karpov <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

d936bc6

for more information, see https://pre-commit.ci

rm soudfile

54da5d8

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

06ff20e

…into karpnv/duration_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

541ae3a

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

4125955

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

20e67ab

…ation_limit

infer_segment

514f66d

Signed-off-by: Nikolay Karpov <[email protected]>

karpnv reopened this Oct 24, 2023

pre-commit-ci bot and others added 4 commits October 24, 2023 17:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

270789f

for more information, see https://pre-commit.ci

soundfile

68bd103

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

c41b09f

…into karpnv/duration_limit

docstring

4a0acc6

Signed-off-by: Nikolay Karpov <[email protected]>

karpnv marked this pull request as ready for review October 24, 2023 17:52

karpnv changed the title ~~duration limit~~ LID: several random samples for long file Oct 24, 2023

karpnv requested a review from nithinraok October 24, 2023 17:56

nithinraok reviewed Oct 25, 2023

View reviewed changes

github-actions bot removed the stale label Oct 25, 2023

karpnv and others added 3 commits October 25, 2023 00:13

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

961893d

…ation_limit

soundfile

360fcf4

Signed-off-by: Nikolay Karpov <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

9b15855

for more information, see https://pre-commit.ci

karpnv requested a review from nithinraok October 25, 2023 07:30

karpnv added 2 commits October 25, 2023 03:08

type float

26bcfa7

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'karpnv/duration_limit' of https://github.com/NVIDIA/NeMo …

93ecb72

…into karpnv/duration_limit

nithinraok previously approved these changes Oct 25, 2023

View reviewed changes

karpnv added 2 commits October 26, 2023 03:31

random_seed

783121a

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

5000160

…ation_limit

karpnv dismissed nithinraok’s stale review via 5000160 October 26, 2023 10:32

[pre-commit.ci] auto fixes from pre-commit.com hooks

476dec3

for more information, see https://pre-commit.ci

karpnv requested a review from nithinraok October 26, 2023 10:36

nithinraok previously approved these changes Oct 26, 2023

View reviewed changes

Merge branch 'main' into karpnv/duration_limit

6824fce

Signed-off-by: Nikolay Karpov <[email protected]>

karpnv added 3 commits October 31, 2023 10:36

Merge branch 'main' into karpnv/duration_limit

58ffd07

Signed-off-by: Nikolay Karpov <[email protected]>

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

7619746

…ation_limit

Merge branch 'main' of https://github.com/NVIDIA/NeMo into karpnv/dur…

bae985e

…ation_limit

to float

3df2df1

Signed-off-by: Nikolay Karpov <[email protected]>

karpnv dismissed nithinraok’s stale review via 3df2df1 November 3, 2023 19:57

karpnv requested a review from nithinraok November 3, 2023 19:59

nithinraok approved these changes Nov 3, 2023

View reviewed changes

karpnv merged commit 286e84e into main Nov 6, 2023
15 checks passed

karpnv deleted the karpnv/duration_limit branch November 6, 2023 07:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LID: several random samples for long file #6853

LID: several random samples for long file #6853

karpnv commented Jun 12, 2023 •

edited

Loading

github-actions bot commented Jun 29, 2023

github-actions bot commented Jul 6, 2023

nithinraok Oct 25, 2023

karpnv Oct 25, 2023

liuspencersjtu Mar 10, 2024

nithinraok Oct 25, 2023

karpnv Oct 25, 2023

nithinraok left a comment

karpnv commented Oct 26, 2023

nithinraok left a comment

karpnv commented Oct 30, 2023

nithinraok commented Nov 3, 2023

nithinraok commented Nov 3, 2023

LID: several random samples for long file #6853

LID: several random samples for long file #6853

Conversation

karpnv commented Jun 12, 2023 • edited Loading

What does this PR do ?

Changelog

Usage

Who can review?

github-actions bot commented Jun 29, 2023

github-actions bot commented Jul 6, 2023

nithinraok Oct 25, 2023

Choose a reason for hiding this comment

karpnv Oct 25, 2023

Choose a reason for hiding this comment

liuspencersjtu Mar 10, 2024

Choose a reason for hiding this comment

nithinraok Oct 25, 2023

Choose a reason for hiding this comment

karpnv Oct 25, 2023

Choose a reason for hiding this comment

nithinraok left a comment

Choose a reason for hiding this comment

karpnv commented Oct 26, 2023

nithinraok left a comment

Choose a reason for hiding this comment

karpnv commented Oct 30, 2023

nithinraok commented Nov 3, 2023

nithinraok commented Nov 3, 2023

karpnv commented Jun 12, 2023 •

edited

Loading