Eliminate potential duplicate files (layer) in cache and possibly apply consistent layout #1982

chanseokoh · 2019-09-12T18:21:27Z

For base image layers (downloaded from registries), the layout is

<cache dir>/layers/<(1) SHA of compressed layer blob>/<(2) SHA of uncompressed layer blob>

That is, (1) is a directory and (2) is a filename. (1) is directly visible from the manifest JSON. Different registries may use different compression levels/methods (we actually considered using a different compression level for layers from docker save), so I think hypothetically it is possible that we duplicate the same (uncompressed) layer file. For example,

<cache dir>/layers/<SHA from compression level 1>/<SHA of same contents>
<cache dir>/layers/<SHA from compression level 2>/<SHA of same contents>

Now, #1957 implements caching local layers, and the layout is in reverse:

<cache dir>/local/<SHA of uncompressed layer>/<SHA of compressed layer blob>

The reason is that we need to be able to query the cache using the SHA of an uncompressed layer first.

It would be nice if we can have consistency across the board. This can potentially remove almost-identical code duplicate in #1957 by following one execution path.

The text was updated successfully, but these errors were encountered:

chanseokoh added the area/jib-core label Sep 12, 2019

chanseokoh mentioned this issue Sep 12, 2019

Cleaning Jib's local base image cache #1956

Closed

mpeddada1 added the cleanup label Dec 30, 2020

meltsufin added the priority: p4 label Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliminate potential duplicate files (layer) in cache and possibly apply consistent layout #1982

Eliminate potential duplicate files (layer) in cache and possibly apply consistent layout #1982

chanseokoh commented Sep 12, 2019

Eliminate potential duplicate files (layer) in cache and possibly apply consistent layout #1982

Eliminate potential duplicate files (layer) in cache and possibly apply consistent layout #1982

Comments

chanseokoh commented Sep 12, 2019