CI: try to make caching more reliable #25259

kassens · 2022-09-13T21:53:22Z

~/.yarn/cache is now restored from an hierarchical cache key, if no precise match is found, we fallback to less precise ones.
The yarn install in fixtures/dom is also cached. Notably, is utilizes the cache from root, but stores into its more precise key.
Steps running in root no longer have a yarn install and rely on the cache from the setup step.
Retry yarn install once on failure.

kassens · 2022-09-13T21:56:04Z

.circleci/config.yml

-          # all jobs run on this branch with the same lockfile.
-          name: Save node_modules cache
-          # This cache key is per branch, a yarn install in setup is required.
-          key: v2-node-{{ arch }}-{{ .Branch }}-{{ checksum "yarn.lock" }}-{{ checksum "workspace_info.txt" }}-node-modules


I don't know why we would include the branch name in the caching. This seems to just needlessly split the cache keys (wastes cache storage and more cache misses)

What if you add a new dependency in a branch? Doesn't the cache need to break so you install the dependency?

Answer: In that case the yarn.lock would change for that branch.

But that doesn't have anything todo with the branch (name?) right? We might install a new dependency on the main branch or re-use a branch name across multiple PRs.

My thinking is that yarn.lock + workspace_info should capture everything relevant.

If we're feeling unsure about re-using the node_modules cache across PRs, we could also use the git revision for the key (I saw that's available in the docs)

kassens · 2022-09-13T22:24:24Z

.circleci/config.yml

+      name: Save node_modules cache
+      key: v1-node_modules-{{ arch }}-{{ checksum "yarn.lock" }}-{{ checksum "workspace_info.txt" }}
+      paths:
+        - node_modules


Just node_modules is actually insufficient as yarn workspaces create node_modules directories in each project.

This feels tricky to get stable. Maybe we should include the yarn install, but restore the yarn cache so the yarn install can make use of that.

I thought that that was what we do?

kassens · 2022-09-13T22:29:39Z

So something is still missing here. I'll keep investigating.

This reverts commit 565d8df.

kassens · 2022-09-14T14:28:14Z

I have 2 working configurations now:

yarn install everywhere, no more caching of node_modules. This simplifies the config and makes it more robust. sample run
Cache node_modules including the node_modules in each package. This avoids the yarn install which takes about 30s per step, but it's a bit fragile as it lists out all the node_module directories for each package. I also think there's a possibility of a cache injection by a malicious PR, so I included the epoch in the key to give every build its own cache. sample run

rickhanlonii · 2022-09-14T15:16:44Z

I'd prefer option 2 as 30s increases the feedback loop by 50% for most of the tests.

poteto · 2022-09-14T15:16:52Z

@kassens An additional thing we could try is to retry the yarn install command if we see one of those 503 errors, which could still happen despite caching.

We could also try to see if caching is working with running the subsequent yarn install commands with the --offline flag which I think will error if it has to reach out to the network. If there are then maybe we need to make the initial setup job actually install all the packages.

kassens · 2022-09-14T15:51:34Z

@poteto if we remove the yarn installs from all subsequent steps and rely on node_modules caching, I don't think we need the offline flag. I do like the idea of adding a retry though for the yarn install at the very beginning. I'll add that here too.

kassens · 2022-09-14T16:59:42Z

I think this is finally ready for review!

- `~/.yarn/cache` is now restored from an hierarchical cache key, if no precise match is found, we fallback to less precise ones. - The yarn install in `fixtures/dom` is also cached. Notably, is utilizes the cache from root, but stores into its more precise key. - Steps running in root no longer have a `yarn install` and rely on the cache from the setup step. - Retry `yarn install` once on failure.

kassens requested review from rickhanlonii and acdlite September 13, 2022 21:53

facebook-github-bot added the CLA Signed label Sep 13, 2022

kassens force-pushed the ci-caching branch from 5d23daa to ec291fd Compare September 13, 2022 21:54

kassens commented Sep 13, 2022

View reviewed changes

kassens force-pushed the ci-caching branch from ec291fd to 8747360 Compare September 13, 2022 22:21

kassens commented Sep 13, 2022

View reviewed changes

kassens force-pushed the ci-caching branch 2 times, most recently from 91c2884 to 41333e2 Compare September 13, 2022 22:41

kassens added 3 commits September 14, 2022 09:48

CI: updated caching strategy

ccab23d

add missing node_modules

0268eb2

always cache yarn cache instead of partially node_modules

565d8df

kassens force-pushed the ci-caching branch from 41333e2 to 565d8df Compare September 14, 2022 13:48

kassens added 3 commits September 14, 2022 09:58

Revert "always cache yarn cache instead of partially node_modules"

9e0d024

This reverts commit 565d8df.

bump node_modules cache to v2

3db7f85

include epoch in node_modules cache

8377a51

use revision instead of epoch

8866f9f

also update key for restore step

af45c32

kassens added 3 commits September 14, 2022 11:53

remove prepare_node_modules_cache_key

8fc2ca6

add retry

58f3e66

add comment about cache key per revision

92d0e84

kassens requested a review from poteto September 14, 2022 16:59

rickhanlonii approved these changes Sep 14, 2022

View reviewed changes

kassens merged commit c327b91 into facebook:main Sep 14, 2022

kassens deleted the ci-caching branch September 14, 2022 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: try to make caching more reliable #25259

CI: try to make caching more reliable #25259

kassens commented Sep 13, 2022 •

edited

Loading

kassens Sep 13, 2022

rickhanlonii Sep 13, 2022

rickhanlonii Sep 13, 2022

kassens Sep 13, 2022

kassens Sep 13, 2022

rickhanlonii Sep 14, 2022

kassens commented Sep 13, 2022

kassens commented Sep 14, 2022

rickhanlonii commented Sep 14, 2022

poteto commented Sep 14, 2022 •

edited

Loading

kassens commented Sep 14, 2022

kassens commented Sep 14, 2022

CI: try to make caching more reliable #25259

CI: try to make caching more reliable #25259

Conversation

kassens commented Sep 13, 2022 • edited Loading

kassens Sep 13, 2022

Choose a reason for hiding this comment

rickhanlonii Sep 13, 2022

Choose a reason for hiding this comment

rickhanlonii Sep 13, 2022

Choose a reason for hiding this comment

kassens Sep 13, 2022

Choose a reason for hiding this comment

kassens Sep 13, 2022

Choose a reason for hiding this comment

rickhanlonii Sep 14, 2022

Choose a reason for hiding this comment

kassens commented Sep 13, 2022

kassens commented Sep 14, 2022

rickhanlonii commented Sep 14, 2022

poteto commented Sep 14, 2022 • edited Loading

kassens commented Sep 14, 2022

kassens commented Sep 14, 2022

kassens commented Sep 13, 2022 •

edited

Loading

poteto commented Sep 14, 2022 •

edited

Loading