Generate and import missing image embeddings #1177

raphael0202 · 2023-08-24T09:03:44Z

The new product categorizer was deployed in March 2023, and since then it categorizes new uploaded products. However, we still don't have predictions for the rest of the database.
It uses the 10 most recent images of the product, using image embedding as input (see https://openfoodfacts.github.io/robotoff/explanations/category-prediction/ for more information about the model, section "ML prediction").
To predict categories on the full dataset, we need to generate and import image embeddings for all missing images, to be able to launch category detection.
The model that is used to generate the embeddings is stored here: https://github.com/openfoodfacts/robotoff-models/releases/tag/clip-vit-base-patch32. See Robotoff codebase for preprocessing code.

Here is a list of all the missing image paths: source_images.txt.gz

Here is a tutorial on how to download images on Open Food Facts: https://openfoodfacts.github.io/openfoodfacts-server/api/how-to-download-images/

alexgarel · 2023-10-10T10:42:48Z

As asked by Christelle, here are some embedings from production.

I generated it by using this code

corresponding images are at:

301/762/042/2003/<image_id>.jpg for 3017620422003_embeddings.json
327/408/000/5003/<image_id>.jpg for 3274080005003_embeddings.json

raphael0202 mentioned this issue Aug 29, 2023

Category prediction (tracker) #379

Closed

teolemon added ✅ Task logos & labels labels Jul 30, 2024

teolemon removed the ✅ Task label Oct 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate and import missing image embeddings #1177

Generate and import missing image embeddings #1177

raphael0202 commented Aug 24, 2023

alexgarel commented Oct 10, 2023

Generate and import missing image embeddings #1177

Generate and import missing image embeddings #1177

Comments

raphael0202 commented Aug 24, 2023

alexgarel commented Oct 10, 2023