jobs/garbage-collection: add containers #1029

jbtrystram · 2024-09-06T14:24:22Z

Add containers tags in the garbage collection job. These can run in parrallel without issues.

jobs/garbage-collection.Jenkinsfile

jlebon

I guess the next step after this is actually defining GC policies for containers.

jobs/garbage-collection.Jenkinsfile

jbtrystram · 2024-09-30T18:09:44Z

Just ran this as a temporary test job in the pipeline :

[2024-09-30T18:07:35.830Z] + set -xeuo pipefail
[2024-09-30T18:07:35.830Z] ++ umask
[2024-09-30T18:07:35.830Z] + '[' 0022 = 0000 ']'
[2024-09-30T18:07:35.830Z] + cosa container-prune --policy tmp/gc-policy.yaml --registry-auth-file=**** --stream rawhide --dry-run quay.io/fedora/fedora-coreos

[2024-09-30T18:07:35.830Z] Pulling tags from quay.io/fedora/fedora-coreos
[2024-09-30T18:07:36.085Z] This will delete any images older than 2024-09-16 18:07:35.978655 from the stream rawhide
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:41.20240720.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240820.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240822.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240826.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240827.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240830.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240903.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240904.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240906.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240909.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240911.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240914.91.0
[2024-09-30T18:07:36.086Z] Dry-run: would prune image quay.io/fedora/fedora-coreos:42.20240916.91.0

Add containers tags in the garbage collection job. These can run in parrallel without issues.

Pruning dev streams after 2 weeks and stable streams after 2 months as decided in coreos/fedora-coreos-tracker#1367 (comment)

dustymabe

I'm wondering if we should incorporate the container-prune and cloud-prune code together and also update the builds.json when we've pruned container images. This is more heavyweight, but brings into alignment the GC (i.e. operating on each build). The OSTree pruning that we do today that was set up by me a while back I guess will go away eventually so we don't need to worry about making that process align I don't think.

dustymabe · 2024-10-01T16:28:21Z

jobs/garbage-collection.Jenkinsfile

-                lock(resource: "builds-json-${params.STREAM}") {
-                    stage('Upload Builds JSON') {
+            // containers tags and cloud artifacts can be GCed in parallel
+            def parallelruns = [:]


I'm with @jlebon here. I think we should run things serially since the job is 100% not time critical. If things fail in either stage of this pipeline I'm hoping it is very clear what went wrong when and what did and did not happen.

dustymabe · 2024-10-01T16:34:27Z

gc-policy.yaml

We should probably match the policy done for our OSTree repo for now.

For our production streams we don't prune at all (maybe we should, but this would affect our extended upgrade testing):

https://github.com/coreos/fedora-coreos-releng-automation/blob/d18a30c23ac1853cec7ce60c26574be508760666/fedora-ostree-pruner/fedora-ostree-pruner#L81-L84

For our non production streams we prune at 90 days:

https://github.com/coreos/fedora-coreos-releng-automation/blob/d18a30c23ac1853cec7ce60c26574be508760666/fedora-ostree-pruner/fedora-ostree-pruner#L42-L43

For our production streams we don't prune at all (maybe we should, but this would affect our extended upgrade testing):

ahh I see you have special handling in the code for barrier releases.

jlebon · 2024-10-01T19:01:45Z

I'm wondering if we should incorporate the container-prune and cloud-prune code together and also update the builds.json when we've pruned container images. This is more heavyweight, but brings into alignment the GC (i.e. operating on each build).

Some backstory on this in coreos/coreos-assembler#3798 (comment). I think for merging to make sense, we would want to flip things around so that the source is also the build metadata for pruning container images, rather than container repos. Otherwise, they're quite disparate currently.

Though actually, at a higher level what we probably want is both. E.g. taking AMIs as an example:

build a list of all AMIs we want to keep by crawling the builds.json and applying the policy we have to filter down the list
look for all the AMIs we own in AWS and delete everything that's not in the approved list (minus e.g. AMIs that were created in the last couple of days or so to have a wide margin for things still being built)

This could be applied to any remote resource, including container registries and other clouds. It ensures that we also eventually GC things we've pushed but because of pipeline failures it never was registered in any build metadata.

This is the difference between what container-prune and cloud-prune do today, and their respective approaches are the two parts of that picture.

dustymabe · 2024-10-01T21:14:11Z

I'm wondering if we should incorporate the container-prune and cloud-prune code together and also update the builds.json when we've pruned container images. This is more heavyweight, but brings into alignment the GC (i.e. operating on each build).

Some backstory on this in coreos/coreos-assembler#3798 (comment). I think for merging to make sense, we would want to flip things around so that the source is also the build metadata for pruning container images, rather than container repos. Otherwise, they're quite disparate currently.

Yes. I agree. We'd basically consult the meta and build.json and prune based on that rather than using skopeo to gather all tags in the repo.

Though actually, at a higher level what we probably want is both. E.g. taking AMIs as an example:

build a list of all AMIs we want to keep by crawling the builds.json and applying the policy we have to filter down the list

To do it that way we'd have to build a list of AMIs from all streams each time we ran, which would be pretty heavyweight. And we'd also have to consider that we aren't the only ones that create and want to retain AMIs in the account.

look for all the AMIs we own in AWS and delete everything that's not in the approved list (minus e.g. AMIs that were created in the last couple of days or so to have a wide margin for things still being built)

This could be applied to any remote resource, including container registries and other clouds. It ensures that we also eventually GC things we've pushed but because of pipeline failures it never was registered in any build metadata.

Yeah. It's good to try to clean up things that fell through the cracks periodically, but I do think it would make things more complicated

This is the difference between what container-prune and cloud-prune do today, and their respective approaches are the two parts of that picture.

jbtrystram · 2024-10-01T21:30:40Z

Quoting @jlebon message from the original PR to add some context :

(1) it seems more natural there to make the source of truth be the registry repo

I don't really understand why I didn't question this, as meta.json have a reference to the container tag. I think I didn't find references in build.json so I thought the containers images were not tracked as they were moving tags. I then went to do a separate script since I was not planning to reuse the s3 logic anyway.

(2) the GC policy is much more aggressive than the policy we have in mind here, so this would require adding another key to the builds.json's policy-cleanup which would in principle need to be added to all builds we've created so far

So @dustymabe recommends that we follow the same policies that are used for all the other images, but it was decided in a community meeting (albeit a long time ago) different rules : coreos/fedora-coreos-tracker#1367 (comment)

(3) there's extra logic needed there for barrier releases which makes it not as good a fit for the model here.

I ended up implementing it anyway so this point is no longer relevant I think

jbtrystram · 2024-10-01T21:32:40Z

Allright, I will go and look at cosa and see what I can come up with.

dustymabe · 2024-10-01T21:33:52Z

Allright, I will go and look at cosa and see what I can come up with.

:) - maybe hold off on making any changes for now. I think we are still discussing what the ideal outcome is. And of course there is "ideal" and there is "already works". Sometimes "already works" wins too.

jbtrystram · 2024-10-01T21:39:46Z

To do it that way we'd have to build a list of AMIs from all streams each time we ran, which would be pretty heavyweight.

I think we already do that in the cloud-prune code : https://github.com/coreos/coreos-assembler/blob/main/src/cmd-cloud-prune#L123C31-L123C37
I don't see how we can get around it

dustymabe · 2024-10-01T23:06:15Z

To do it that way we'd have to build a list of AMIs from all streams each time we ran, which would be pretty heavyweight.

I think we already do that in the cloud-prune code : https://github.com/coreos/coreos-assembler/blob/main/src/cmd-cloud-prune#L123C31-L123C37 I don't see how we can get around it

No. If the build doesn't need to be pruned we break. We don't need to calculate every AMI that exists before we can prune any.

I think there are two cases we are talking about here:

We know the universe of things that exist (total) and also the ones we want to keep (keep). Once you've computed everything you toprune = total - keep.
We don't know the universe of things that exist. The universe doesn't matter. The indivdual build is either kept or not based on the policy (time). We can make a decision while processing each build. We don't need to know the universe first.

Right now cloud-prune uses strategy 2. and container-prune uses strategy 1..

jlebon · 2024-10-02T00:55:48Z

To do it that way we'd have to build a list of AMIs from all streams each time we ran, which would be pretty heavyweight.

For GC, I don't think we should be worried too much about how much time it takes given that it's not time critical; we could also easily lower the cadence if it's a problem?

And we'd also have to consider that we aren't the only ones that create and want to retain AMIs in the account.

Yes, definitely. This is relevant for Fedora where we do this. I think it's to everyone's benefit to have resources properly tagged anyway per team/group (which I think we've started doing for FCOS at least recently-ish?). And that's helpful too for querying since most APIs have ways to limit/filter to make it less expensive.

Anyway, to be clear I'm not suggesting we pivot to this right away. I think both approaches are "kinda right" and that the ideal IMO is "both" (which I guess means... yeah I think eventually we should probably merge because we'd do this for other resources too). But it's not worth blocking getting GC set up for it.

jlebon · 2024-10-02T01:08:43Z

I also think any restructuring should wait until we look at RHCOS GC requirements too, which are going to be very different from FCOS. (E.g. any AMI that was once listed as a bootimage in the installer cannot be pruned; this is similar in a way with how we cross-check with barrier releases. Also, listing tags in the prod OCP repo is expensive because we use a monorepo approach so starting from build metadata might be better. But actually... it's very possible that GC in that repo is being taken care of at a higher OCP level anyway and we shouldn't touch it at all.)

dustymabe · 2024-10-02T12:34:37Z

To do it that way we'd have to build a list of AMIs from all streams each time we ran, which would be pretty heavyweight.

For GC, I don't think we should be worried too much about how much time it takes given that it's not time critical; we could also easily lower the cadence if it's a problem?

It's not necessarily time for me, but waste. If we change it to the "calculate the universe" method then we literally have to grab every meta.json, for every architecture, for every build in the build history before we can do any pruning. That's pretty wasteful IMO.

Now there are some "cloud resources" that make this much easier. For example in the container-prune case we are able to query all tags using skopeo (though also as you noted this doesn't necessarily scale infinitely and eventually is hard on the registry) and then filter them based on our version scheme. So in that case we can get all tags without having to go through all the meta.json files. We can't do that with AMIs. We might be able to do something like that with GCP because we have image families.

Either way I think what I'm saying is I like the more methodical approach where we don't have to calculate the universe first.

And we'd also have to consider that we aren't the only ones that create and want to retain AMIs in the account.

Yes, definitely. This is relevant for Fedora where we do this. I think it's to everyone's benefit to have resources properly tagged anyway per team/group (which I think we've started doing for FCOS at least recently-ish?). And that's helpful too for querying since most APIs have ways to limit/filter to make it less expensive.

Yes we do now tag, but we don't have that for RHCOS so we'd have to support one strategy for one and another for the other.

Anyway, to be clear I'm not suggesting we pivot to this right away. I think both approaches are "kinda right" and that the ideal IMO is "both" (which I guess means... yeah I think eventually we should probably merge because we'd do this for other resources too). But it's not worth blocking getting GC set up for it.

ehh. "both" is just more work IMO. The only problem with not blocking IMO is that we do do accounting to keep track of what has been pruned so it would be nice to have that be correct, but I suspect we could just make the code do the right thing if the remote resource doesn't exist and it wouldn't be that big of a deal.

I also think any restructuring should wait until we look at RHCOS GC requirements too, which are going to be very different from FCOS. (E.g. any AMI that was once listed as a bootimage in the installer cannot be pruned; this is similar in a way with how we cross-check with barrier releases. Also, listing tags in the prod OCP repo is expensive because we use a monorepo approach so starting from build metadata might be better. But actually... it's very possible that GC in that repo is being taken care of at a higher OCP level anyway and we shouldn't touch it at all.)

yeah that does make me think.. for FCOS we should probably consider barrier releases.. currently we don't prune any production streams but if we had the ability to skip barrier releases in the pruning then I think my stance on that would change quickly. For RHCOS and FCOS we could have an exception mechanism built into the GC to ignore specific versions (barriers for FCOS, bootimages in the installer for RHCOS).

jlebon · 2024-10-02T14:49:47Z

It's not necessarily time for me, but waste. If we change it to the "calculate the universe" method then we literally have to grab every meta.json, for every architecture, for every build in the build history before we can do any pruning. That's pretty wasteful IMO.

True, though we can easily cache this.

Either way I think what I'm saying is I like the more methodical approach where we don't have to calculate the universe first.

Right, I think how we approached this is the right call for a first pass (both for the cloud case and the container case). I'm saying that ideally we extend this in the future.

Yes we do now tag, but we don't have that for RHCOS so we'd have to support one strategy for one and another for the other.

We could/should add it. But anyway, for RHCOS AMIs specifically I'm pretty sure we already do some ad-hoc pruning currently and AFAIK that operation doesn't go through build metadata, but OCP releases. I think there it does basically do "(the universe) - (AMIs in OCP releases)" (@mike-nguyen would know more).

ehh. "both" is just more work IMO. The only problem with not blocking IMO is that we do do accounting to keep track of what has been pruned so it would be nice to have that be correct, but I suspect we could just make the code do the right thing if the remote resource doesn't exist and it wouldn't be that big of a deal.

Right yeah, all the pruning code on the cloud-prune side should already handle the ENOENT case. So if we ever pivot container-prune, we'd want to retain that too.

yeah that does make me think.. for FCOS we should probably consider barrier releases.. currently we don't prune any production streams but if we had the ability to skip barrier releases in the pruning then I think my stance on that would change quickly. For RHCOS and FCOS we could have an exception mechanism built into the GC to ignore specific versions (barriers for FCOS, bootimages in the installer for RHCOS).

Hmm, in my mind the goal is to eventually roll out the cloud pruning to production streams but indeed looking at coreos/fedora-coreos-tracker#99, we never agreed on any time window for those. Are you saying even then, we'd want to retain the bootimages from barrier releases? At the risk of stating the obvious, it's only really needed for OSTree commits (hard/awkward) and container images (easy).

jlebon reviewed Sep 10, 2024

View reviewed changes

jobs/garbage-collection.Jenkinsfile Outdated Show resolved Hide resolved

jobs/garbage-collection.Jenkinsfile Outdated Show resolved Hide resolved

jobs/garbage-collection.Jenkinsfile Outdated Show resolved Hide resolved

jbtrystram force-pushed the gc-containers branch from d0a47ba to 94a7d95 Compare September 11, 2024 10:33

jlebon reviewed Sep 13, 2024

View reviewed changes

jobs/garbage-collection.Jenkinsfile Outdated Show resolved Hide resolved

jobs/garbage-collection.Jenkinsfile Outdated Show resolved Hide resolved

jbtrystram force-pushed the gc-containers branch from 4cae8b1 to ff5e453 Compare September 30, 2024 18:12

jbtrystram requested review from jlebon, dustymabe and gursewak1997 September 30, 2024 18:13

jbtrystram force-pushed the gc-containers branch from ff5e453 to a72aea8 Compare October 1, 2024 08:47

jbtrystram added 2 commits October 1, 2024 16:38

jobs/garbage-collection: add containers

a7ae6d6

Add containers tags in the garbage collection job. These can run in parrallel without issues.

gc-policy: prune containers

270fcc9

Pruning dev streams after 2 weeks and stable streams after 2 months as decided in coreos/fedora-coreos-tracker#1367 (comment)

jbtrystram force-pushed the gc-containers branch from a72aea8 to 270fcc9 Compare October 1, 2024 14:40

dustymabe reviewed Oct 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jobs/garbage-collection: add containers #1029

jobs/garbage-collection: add containers #1029

jbtrystram commented Sep 6, 2024

jlebon left a comment

jbtrystram commented Sep 30, 2024

dustymabe left a comment

dustymabe Oct 1, 2024

dustymabe Oct 1, 2024

dustymabe Oct 1, 2024

jlebon commented Oct 1, 2024

dustymabe commented Oct 1, 2024

jbtrystram commented Oct 1, 2024 •

edited

Loading

jbtrystram commented Oct 1, 2024

dustymabe commented Oct 1, 2024

jbtrystram commented Oct 1, 2024

dustymabe commented Oct 1, 2024 •

edited

Loading

jlebon commented Oct 2, 2024

jlebon commented Oct 2, 2024

dustymabe commented Oct 2, 2024 •

edited

Loading

jlebon commented Oct 2, 2024

jobs/garbage-collection: add containers #1029

Are you sure you want to change the base?

jobs/garbage-collection: add containers #1029

Conversation

jbtrystram commented Sep 6, 2024

jlebon left a comment

Choose a reason for hiding this comment

jbtrystram commented Sep 30, 2024

dustymabe left a comment

Choose a reason for hiding this comment

dustymabe Oct 1, 2024

Choose a reason for hiding this comment

dustymabe Oct 1, 2024

Choose a reason for hiding this comment

dustymabe Oct 1, 2024

Choose a reason for hiding this comment

jlebon commented Oct 1, 2024

dustymabe commented Oct 1, 2024

jbtrystram commented Oct 1, 2024 • edited Loading

jbtrystram commented Oct 1, 2024

dustymabe commented Oct 1, 2024

jbtrystram commented Oct 1, 2024

dustymabe commented Oct 1, 2024 • edited Loading

jlebon commented Oct 2, 2024

jlebon commented Oct 2, 2024

dustymabe commented Oct 2, 2024 • edited Loading

jlebon commented Oct 2, 2024

jbtrystram commented Oct 1, 2024 •

edited

Loading

dustymabe commented Oct 1, 2024 •

edited

Loading

dustymabe commented Oct 2, 2024 •

edited

Loading