Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make a standard training / test / verification data set #13

Open
Tracked by #2
alexgarel opened this issue Mar 21, 2022 · 0 comments
Open
Tracked by #2

make a standard training / test / verification data set #13

alexgarel opened this issue Mar 21, 2022 · 0 comments
Assignees

Comments

@alexgarel
Copy link
Member

alexgarel commented Mar 21, 2022

If we want to be able to verify our progress we need to have standardized set of products.

Build a list of products (list of barcode):

  • A first list should be used to be used for training / test sets (we can have a standard split, but this should be also used for cross training with different train / test split)
  • A second list should be used for validation (it's always the same)

However the sets should be built accounting for the different features present in data. So there should be some balancing in training / test / validation set for data having the different modalities : title / ingredients / nutritional data / images / OCR.

depends on

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: To discuss and validate
Development

No branches or pull requests

2 participants