Cache usdz archive traversal for subsequent lookup/queries on external references. #1578

marsupial · 2021-07-29T17:48:22Z

Description of Change(s)

While linearly traversing the usdz archive, cache the path and header information so that subsequent lookups can be accelerated.

Fixes Issue(s)

#1577, #1579

This brings down load time of a usdz file that was taking ~3 minutes to ~16 seconds, and can often wind up performing better than parsing the original usdc file that generated the usdz (likely because IO is avoided on the references as the files are in memory from the archive).

The test case is an example that takes more than 15 seconds to load currently, but with caching goes down to 2 seconds. It has 25000 entries, chosen to demonstrate the time difference in a way that can be tested against regression. The amount of entries is also good to test against any race conditions filling in the cache (though per the comments in code, a relatively simplistic filling is used to as it performs well enough and keeps the code simple)

jilliene · 2021-08-06T17:48:30Z

Filed as internal issue #USD-6817

sunyab · 2021-11-13T01:37:30Z

Just wanted to mention I'm looking at merging this PR in now. Thanks for your patience!

sunyab · 2021-12-15T23:35:43Z

Thanks for your patience. I ended up reworking some of these changes and separating the fix for #1577 and #1579 into different commits. I also pulled the performance test into an internal test suite instead of committing them, as I'm hesitant to add a semi-large-ish file to the repo. We could certainly revisit that if necessary.

I'm going to close this PR out in favor of those separate commits, which will land with the next dev push. Thanks again!

@marsupial

…l references. The idea for this change and the initial implementation were contributed by @marsupial in PR #1578. I did some further simplifications, cleanup to conform to our standard style, and added a C++ unit test to exercise iterator behavior. From the original submission: While linearly traversing the **usdz** archive, cache the path and header information so that subsequent lookups can be accelerated. This brings down load time of a **usdz** file that was taking ~3 minutes to ~16 seconds, and can often wind up performing better than parsing the original **usdc** file that generated the **usdz** (likely because IO is avoided on the references as the files are _in memory_ from the archive). Fixes #1577 (Internal change: 2205950)

@marsupial

where default-constructed structs may incorrectly be detected as valid if their signature field happened to be initialized with a value that matched the expected signature. This initial fix for this issue was provided by @marsupial in PR #1578. Fixes #1579 (Internal change: 2206828)

@marsupial

…l references. The idea for this change and the initial implementation were contributed by @marsupial in PR PixarAnimationStudios#1578. I did some further simplifications, cleanup to conform to our standard style, and added a C++ unit test to exercise iterator behavior. From the original submission: While linearly traversing the **usdz** archive, cache the path and header information so that subsequent lookups can be accelerated. This brings down load time of a **usdz** file that was taking ~3 minutes to ~16 seconds, and can often wind up performing better than parsing the original **usdc** file that generated the **usdz** (likely because IO is avoided on the references as the files are _in memory_ from the archive). Fixes PixarAnimationStudios#1577 (Internal change: 2205950)

@marsupial

where default-constructed structs may incorrectly be detected as valid if their signature field happened to be initialized with a value that matched the expected signature. This initial fix for this issue was provided by @marsupial in PR PixarAnimationStudios#1578. Fixes PixarAnimationStudios#1579 (Internal change: 2206828)

marsupial changed the base branch from release to dev July 29, 2021 17:48

marsupial added 2 commits July 29, 2021 14:28

[usd] Fix usdz O(n^2) performance for heavily referenced files/archives.

48b98a4

[usd] Add test for usdz O(n^2) performance.

2d30f0e

marsupial force-pushed the PR/1577-usdz-cache-linear-traversal branch from 8ef1053 to 2d30f0e Compare July 29, 2021 18:29

sunyab closed this Dec 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache usdz archive traversal for subsequent lookup/queries on external references. #1578

Cache usdz archive traversal for subsequent lookup/queries on external references. #1578

marsupial commented Jul 29, 2021 •

edited

Loading

jilliene commented Aug 6, 2021

sunyab commented Nov 13, 2021

sunyab commented Dec 15, 2021

Cache usdz archive traversal for subsequent lookup/queries on external references. #1578

Cache usdz archive traversal for subsequent lookup/queries on external references. #1578

Conversation

marsupial commented Jul 29, 2021 • edited Loading

Description of Change(s)

Fixes Issue(s)

jilliene commented Aug 6, 2021

sunyab commented Nov 13, 2021

sunyab commented Dec 15, 2021

marsupial commented Jul 29, 2021 •

edited

Loading