Input data format #25464

amyeroberts · 2023-08-11T14:49:51Z

What does this PR do?

Adds the input_data_format argument to all of the image processor methods.

This allows for passing in of images with an unusual number of channels, or ones where it's difficult to infer because of ambiguity e.g size (3, 3, 3).

This is an alternative to #24577

Fixes issues like:

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-08-11T15:08:11Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks a lot for your work on this!

sgugger · 2023-08-16T15:27:37Z

src/transformers/models/conditional_detr/image_processing_conditional_detr.py

        **kwargs,
    ) -> np.ndarray:
        """
        Resize the image to the given size. Size can be `min_size` (scalar) or `(height, width)` tuple. If size is an
        int, smaller edge of the image will be matched to this number.
+
+        Args:


Nice to add this here!

sgugger · 2023-08-16T15:30:00Z

src/transformers/models/mask2former/image_processing_mask2former.py

+                Image to resize.
+            size (`Dict[str, int]`):
+                The size of the output image.
+            size_divisor (`int`, *optional*, defaults to `0`):


Nit:

Suggested change

size_divisor (`int`, *optional*, defaults to `0`):

size_divisor (`int`, *optional*, defaults to 0):

sgugger · 2023-08-16T15:30:48Z

src/transformers/models/maskformer/image_processing_maskformer.py

+                Image to resize.
+            size (`Dict[str, int]`):
+                The size of the output image.
+            size_divisor (`int`, *optional*, defaults to `0`):


Suggested change

size_divisor (`int`, *optional*, defaults to `0`):

size_divisor (`int`, *optional*, defaults to 0):

( may have missed some so worth doing a quick search!)

* Add copied from statements for image processors * Move out rescale and normalize to base image processor * Remove rescale and normalize from vit (post rebase) * Update docstrings and tidy up * PR comments * Add input_data_format as preprocess argument * Resolve tests and tidy up * Remove num_channels argument * Update doc strings -> default ints not in code formatting

amyeroberts force-pushed the input-data-format branch from c950341 to 8aef2a0 Compare August 15, 2023 10:46

amyeroberts added 5 commits August 16, 2023 14:16

Add copied from statements for image processors

8952859

Move out rescale and normalize to base image processor

0e28ec0

Remove rescale and normalize from vit (post rebase)

57179a5

Update docstrings and tidy up

8857882

PR comments

c4c6125

amyeroberts force-pushed the input-data-format branch from 276cce5 to e113320 Compare August 16, 2023 14:36

amyeroberts added 3 commits August 16, 2023 14:39

Add input_data_format as preprocess argument

e85cf63

Resolve tests and tidy up

207019c

Remove num_channels argument

3c5a39d

amyeroberts force-pushed the input-data-format branch from e113320 to 3c5a39d Compare August 16, 2023 14:39

amyeroberts requested a review from sgugger August 16, 2023 14:54

sgugger approved these changes Aug 16, 2023

View reviewed changes

Update doc strings -> default ints not in code formatting

24c9bd8

amyeroberts merged commit 6bca43b into huggingface:main Aug 16, 2023
3 checks passed

amyeroberts deleted the input-data-format branch August 16, 2023 16:45

This was referenced Aug 16, 2023

Add imageArray #24577

Closed

YOLOS - reset default return_pixel_mask value #25559

Merged

rafaelpadilla mentioned this pull request Aug 22, 2023

removing unnecesssary extra parameter #25643

Merged

5 tasks

ArthurZucker mentioned this pull request Aug 23, 2023

Unable to follow Object Detection Task Example due to ImageProcessor error #25666

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input data format #25464

Input data format #25464

amyeroberts commented Aug 11, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 11, 2023 •

edited

Loading

sgugger left a comment

sgugger Aug 16, 2023

sgugger Aug 16, 2023

sgugger Aug 16, 2023

	size_divisor (`int`, optional, defaults to `0`):
	size_divisor (`int`, optional, defaults to 0):

Input data format #25464

Input data format #25464

Conversation

amyeroberts commented Aug 11, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Aug 11, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

sgugger Aug 16, 2023

Choose a reason for hiding this comment

sgugger Aug 16, 2023

Choose a reason for hiding this comment

sgugger Aug 16, 2023

Choose a reason for hiding this comment

amyeroberts commented Aug 11, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 11, 2023 •

edited

Loading