Add ZoeDepth #30136

NielsRogge · 2024-04-09T07:44:42Z

What does this PR do?

This PR adds ZoeDepth as introduced in ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth.

To do:

double check image processor (not sure we can support the same resize). Update image processor accordingly
remove testing scripts
verify relative position bias table/index when loading a beit model from the hub
add slow integration test
add image processor tests
should we add backbone_hidden_size?
make doc tests pass

HuggingFaceDocBuilderDev · 2024-04-09T08:04:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

NielsRogge · 2024-06-03T11:21:36Z

@amyeroberts feel free to approve the PR as all comments have been addressed

amyeroberts · 2024-06-04T10:16:11Z

@NielsRogge Have you tested for the issues related to weight loading / DPT?

NielsRogge · 2024-06-04T16:07:40Z

Yes, can confirm:

>>> from transformers import ZoeDepthForDepthEstimation
>>> model = ZoeDepthForDepthEstimation.from_pretrained("Intel/zoedepth-nyu-kitti")
config.json: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.23k/2.23k [00:00<00:00, 1.41MB/s]
model.safetensors: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1.38G/1.38G [00:29<00:00, 47.2MB/s]
>>> model = ZoeDepthForDepthEstimation.from_pretrained("Intel/zoedepth-nyu")
config.json: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.13k/2.13k [00:00<00:00, 3.27MB/s]
model.safetensors: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1.38G/1.38G [00:29<00:00, 46.4MB/s]
>>> model = ZoeDepthForDepthEstimation.from_pretrained("Intel/zoedepth-kitti")
config.json: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2.12k/2.12k [00:00<00:00, 1.52MB/s]
model.safetensors: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1.38G/1.38G [00:29<00:00, 46.0MB/s]
>>>

amyeroberts

Thanks for running all the tests to confirm the model behaviours!

There's still some comments which weren't resolved. In particular, the behaviour of the image processor's arguments wrt keep_aspect_ratio and ensure_multiple_of need to be throughly tested

src/transformers/models/zoedepth/image_processing_zoedepth.py

amyeroberts · 2024-06-06T11:40:23Z

tests/models/zoedepth/test_image_processing_zoedepth.py

+
+    def test_keep_aspect_ratio(self):
+        size = {"height": 512, "width": 512}
+        image_processor = ZoeDepthImageProcessor(size=size, keep_aspect_ratio=True, ensure_multiple_of=32)


enusre_multiple_of should also be tested with keep_aspect_ratio=False

NielsRogge · 2024-06-10T13:16:14Z

@amyeroberts addressed your comment, failing CI is unrelated.

amyeroberts

Thanks for the continued work on this!

There's still a bit of work do to on the testing and documentation of keep_aspect_ratio and ensure_multiple_of, otherwise the PR looks good!

I realise it might seem like I'm being picky here, but there's two reasons why this is important:

Users should be able to read the docstring and know how to use the objects. At the moment, the descriptions mean that someone still has to go look at the code.
There have been quite a few fixes to resizing logic for models like e.g. yolos. This wasn't caught because the behaviour wasn't properly tested. It's important we test thoroughly and correctly now, as it's harder to fix post-merge.

In particular, when testing, it's fine if some of the logic seems repetitive. Tests should be DAMP rather than DRY. They serve not just as a safety net, but also documentation. As such, it's important we isolate so we're testing a single idea at a time, and the behaviour being tested is obvious (e.g. having the same output with keep_aspect_ratio as True and False doesn't tell us about what keep_aspect_ratio does).

src/transformers/models/zoedepth/image_processing_zoedepth.py

tests/models/zoedepth/test_image_processing_zoedepth.py

Co-authored-by: amyeroberts <[email protected]>

tests/models/zoedepth/test_modeling_zoedepth.py

amyeroberts

Thanks for all the work adding this model!

NielsRogge · 2024-07-08T09:05:28Z

Slow tests passing locally, CI failing tests are unrelated, merging.

amyeroberts · 2024-07-08T09:46:06Z

@NielsRogge Slow tests have to pass on the CI runs, not locally before merging. Please do not forcibly merge like this again. There are numerical differences which are introduced from the running environment and hardware which mean integration values may not match. You'll need to follow this up and make sure the slow tests pass on our CI images.

NielsRogge added 27 commits January 29, 2024 12:15

First draft

c074ce8

Fix merge

ca4d141

Add docs

420397f

Merge remote-tracking branch 'upstream/main' into add_zoedepth

02d775a

Clean up code

de9d51e

Convert model

705f4c6

Add image processor

8080b35

Convert Zoe_K

7e511c2

More improvements

ceb079b

Improve variable names and docstrings

331b48d

Improve variable names

be57cc6

Improve variable names

27c013c

Replace nn.sequential

090bb82

Merge remote-tracking branch 'upstream/main' into add_zoedepth

74088b3

More improvements

712b483

Convert ZoeD_NK

a1f9520

Fix most tests

04cd658

Verify pixel values

73bd15e

Verify pixel values

2198070

Add squeeze

470856b

Update beit to support arbitrary window sizes

ad188e5

Improve image processor

f422f24

Improve docstring

8c611c3

Improve beit

69b3593

Improve model outputs

35f86df

Add figure

0146011

Fix beit

a8d7739

NielsRogge mentioned this pull request Apr 11, 2024

Make sure HF download metrics work lpiccinelli-eth/UniDepth#7

Merged

Update checkpoint

46a6479

amyeroberts reviewed Jun 6, 2024

View reviewed changes

NielsRogge added 3 commits June 6, 2024 16:37

Improve docstrings, add test

525869e

Fix merge

5aad6c7

Fix interpolate_pos_encoding

bcd7ae1

NielsRogge force-pushed the add_zoedepth branch from 72fb159 to bcd7ae1 Compare June 7, 2024 11:46

Fix slow tests

39e2ca3

NielsRogge assigned amyeroberts Jun 7, 2024

Add docstring

bde8dda

amyeroberts reviewed Jun 12, 2024

View reviewed changes

NielsRogge and others added 3 commits June 16, 2024 18:32

Update src/transformers/models/zoedepth/image_processing_zoedepth.py

80bb3ed

Co-authored-by: amyeroberts <[email protected]>

Update src/transformers/models/zoedepth/image_processing_zoedepth.py

6137387

Co-authored-by: amyeroberts <[email protected]>

Improve tests and docstrings

e6d8aac

amyeroberts mentioned this pull request Jun 18, 2024

ValueError: The checkpoint you are trying to load has model type zoedepth but Transformers does not recognize this architecture #31477

Closed

4 tasks

amyeroberts reviewed Jun 18, 2024

View reviewed changes

tests/models/zoedepth/test_modeling_zoedepth.py Outdated Show resolved Hide resolved

NielsRogge added 7 commits June 28, 2024 15:44

Fix merge

dffbfea

Use run_common_tests

34a1abb

Improve docstrings

1bcd19f

Improve docstrings

c6e5d6f

Improve tests

8ac163f

Improve tests

6cb3c56

Remove print statements

b2534b2

amyeroberts approved these changes Jul 2, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main' into add_zoedepth

617487d

NielsRogge merged commit 06fd797 into huggingface:main Jul 8, 2024
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ZoeDepth #30136

Add ZoeDepth #30136

NielsRogge commented Apr 9, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 9, 2024

NielsRogge commented Jun 3, 2024

amyeroberts commented Jun 4, 2024

NielsRogge commented Jun 4, 2024

amyeroberts left a comment

amyeroberts Jun 6, 2024

NielsRogge commented Jun 10, 2024

amyeroberts left a comment

amyeroberts left a comment

NielsRogge commented Jul 8, 2024

amyeroberts commented Jul 8, 2024

Add ZoeDepth #30136

Add ZoeDepth #30136

Conversation

NielsRogge commented Apr 9, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 9, 2024

NielsRogge commented Jun 3, 2024

amyeroberts commented Jun 4, 2024

NielsRogge commented Jun 4, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 6, 2024

Choose a reason for hiding this comment

NielsRogge commented Jun 10, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

NielsRogge commented Jul 8, 2024

amyeroberts commented Jul 8, 2024

NielsRogge commented Apr 9, 2024 •

edited

Loading