Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve duplicate image detection #8445

Open
Tracked by #1627
teolemon opened this issue May 23, 2023 · 4 comments
Open
Tracked by #1627

Improve duplicate image detection #8445

teolemon opened this issue May 23, 2023 · 4 comments
Labels
🖼️ Images infrastructure https://wiki.openfoodfacts.org/Infrastructure 🎯 P1 ⏰ Stale This issue hasn't seen activity in a while. You can try documenting more to unblock it.

Comments

@teolemon
Copy link
Member

teolemon commented May 23, 2023

What

  • Improve duplicate image detection to detect duplicates with byte size difference that are not perceivable.
  • We should have a tolerance threshold to be defined (1%, 5%)
  • We should also backprocess the 2 last years of edits by yuka

Why

  • We lack disk space, and since Yuka (mostly) has had this behaviour for at least 2 years, there are sizeable savings to be made.
image

https://world.openfoodfacts.org/product/3173289105708/quadro
https://world.openfoodfacts.org/images/products/317/328/910/5708/2.jpg
https://world.openfoodfacts.org/images/products/317/328/910/5708/4.jpg

Who for

Part of

@teolemon teolemon added ♞ Epic An epic groups several tasks/issues. It should have a meaning for users. 🖼️ Images infrastructure https://wiki.openfoodfacts.org/Infrastructure labels May 23, 2023
@teolemon
Copy link
Member Author

@github-actions
Copy link
Contributor

This issue has been open 90 days with no activity. Can you give it a little love by linking it to a parent issue, adding relevant labels and projets, creating a mockup if applicable, adding code pointers from https://github.com/openfoodfacts/openfoodfacts-server/blob/main/.github/labeler.yml, giving it a priority, editing the original issue to have a more comprehensive description… Thank you very much for your contribution to 🍊 Open Food Facts

@github-actions github-actions bot added the ⏰ Stale This issue hasn't seen activity in a while. You can try documenting more to unblock it. label Aug 22, 2023
Copy link
Contributor

This issue has been open 90 days with no activity. Can you give it a little love by linking it to a parent issue, adding relevant labels and projets, creating a mockup if applicable, adding code pointers from https://github.com/openfoodfacts/openfoodfacts-server/blob/main/.github/labeler.yml, giving it a priority, editing the original issue to have a more comprehensive description… Thank you very much for your contribution to 🍊 Open Food Facts

@teolemon
Copy link
Member Author

Potentially

@teolemon teolemon removed the ♞ Epic An epic groups several tasks/issues. It should have a meaning for users. label Oct 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🖼️ Images infrastructure https://wiki.openfoodfacts.org/Infrastructure 🎯 P1 ⏰ Stale This issue hasn't seen activity in a while. You can try documenting more to unblock it.
Projects
Status: No status
Status: To discuss and validate
Development

No branches or pull requests

1 participant