Training data does not improve initial Custom Speech model compared to the baseline model #58

KatieProchilo · 2020-06-03T16:22:10Z

Using data provided by the Custom Speech team, the first Custom Speech model that users train will not improve compared to the baseline model when we train it using data in the training folder.

This results in the pipeline rightfully failing when the word error rate does not improve, but this means no releases will be created, which is a huge loss for users who want to learn about that.

Solution

The training data at that link should improve recognition against the evaluation test data (audio + human-labeled transcripts) in the testing folder at that link.

KatieProchilo · 2020-06-17T23:44:56Z

This has been filed as CSE feedback.

KatieProchilo added the P1 Good to fix. label Jun 3, 2020

KatieProchilo self-assigned this Jun 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training data does not improve initial Custom Speech model compared to the baseline model #58

Training data does not improve initial Custom Speech model compared to the baseline model #58

KatieProchilo commented Jun 3, 2020 •

edited

Loading

KatieProchilo commented Jun 17, 2020

Training data does not improve initial Custom Speech model compared to the baseline model #58

Training data does not improve initial Custom Speech model compared to the baseline model #58

Comments

KatieProchilo commented Jun 3, 2020 • edited Loading

Solution

KatieProchilo commented Jun 17, 2020

KatieProchilo commented Jun 3, 2020 •

edited

Loading