-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The same image seems to be pulled in parallel causing disk exhaustion #141
Comments
I noticed this was just implemented in the |
@aartur I am not able to reproduce this. I tried to launch 10 jobs with same image Before I launched the jobs (~55 GB of disk space)
$ nomad job status
NOTE: the pending ones are because the memory is exhausted on my VM and nomad is not able to place those allocations. After the jobs are running, disk space is still ~55 GB
Also, I checked using
|
I'm able to reproduce it by submitting 100 jobs with the following bash script: #!/bin/bash
for i in $(seq 1 100); do
cat << EOT > job.nomad
job "bash_loop_$i" {
datacenters = ["mydatacenter"]
type = "service"
group "main" {
task "main" {
driver = "containerd-driver"
config {
image = "archlinux"
command = "/bin/bash"
args = ["-c", "while [ 1 ]; do sleep 1; done"]
}
resources {
cpu = 100
memory = 30
}
}
}
}
EOT
echo "Running job $i"
nomad job run -detach job.nomad
done (
|
We have about 100 parameterized job definitions that use the same
image
config:The problem is that disk space is exhausted on Nomad clients and it looks like the reason is that the image is being pulled individually for each job, despite specifying the same exact image with the same tag. When using
docker
Nomad driver this didn't happen and all jobs made use of a single image that was pulled and extracted once.I might be wrong on the explanation but this is what I get from multiple (hundreds) of error messages like:
I.e. it looks like each allocation has it's own
extraction snapshot
? Is it possible to configure the driver (orcontainerd
) so that all jobs will share a single image snapshot?The text was updated successfully, but these errors were encountered: