2.0 regression: large overhead of `libsolv`'s `solver_unifyrules` when multichannels are used #3393

Hind-M · 2024-08-05T10:08:42Z

From @ndevenish in the QS lobby on gitter:
"
Is there any known issues with current micromamba about resource usage, possibly related to Centos/RHEL? I've had two separate people come to me this week with issues with:
a) using micromamba in a container build dying because it filled their entire temp disk (when installing very few packages).
b) being what looked like OOM killed after taking >60% of their memory. Both tasks which have worked before.

The out-of-disk-space instance was running:
micromamba create -y -c conda-forge gnuplot python numpy pymca workflows>=1.7 xraylib zocalo
and it took at least 4GB of scratch disk space (the smallest of possible locations that podman was using to do container working on their system).

The other instance didn't get past resolving (an admittedly rather large requirement) but was using >9GB of ram on a 16GB machine the last time I checked before it died.
"

The text was updated successfully, but these errors were encountered:

Hind-M · 2024-08-05T10:10:40Z

Used micromamba version: 2.0.0rc0

jjerphan · 2024-08-05T12:14:27Z

I cannot reproduce the errors which you report using conda-forge/label/micromamba_dev/linux-64/micromamba-2.0.0{rc0-1,rc1-2}.

On my machine, installing those packages take around 1.5GiB of memory storage in the $CONDA_PREFIX, while using less than 1GiB of RAM.

@ndevenish: Could you provide the difference of your instances' resource usage when using micromamba<2.0.0rc0 and micromamba>=2.0.0rc0?

ndevenish · 2024-09-26T08:07:44Z

When this ticket was made it was a while since I had seen it happen to people.

Now 2.0.0 is out I am seeing this happen on CI

ndevenish · 2024-09-26T08:09:01Z

On this environment file

ndevenish · 2024-09-26T08:10:02Z

This is exactly 700 GB btw

See mamba-org/mamba#3393

ndevenish · 2024-09-28T21:45:45Z

RHEL8, 16GB memory machine:

curl -JLO https://raw.githubusercontent.com/dials/dials/refs/heads/main/.conda-envs/linux.txt
curl -Ls https://micro.mamba.pm/api/micromamba/linux-64/latest | tar -xvj bin/micromamba
psrecord  --plot out.png "bin/micromamba create -yp ENV/ -c conda-forge --file linux.txt"

% bin/micromamba --version
2.0.0

ndevenish · 2024-09-29T11:17:32Z

Possibly because it seems to be in a package-cache-fetching loop?
https://github.com/user-attachments/assets/cf71deec-db90-4735-93b1-b8e6365f2fe7

jjerphan · 2024-10-01T10:01:08Z

The repodata.json is reparsed for each package (since conda-forge:: is specified for everyone of them), causing major resource usage.

This is a regression of micromamba 2.0.0.

jjerphan · 2024-10-01T13:33:01Z

From bisecting, e874e7e from #2986 is the culprit.

ndevenish · 2024-10-01T14:28:22Z

Ah, excellent detective work. Removing the conda-forge:: prefix sounds like it should give us a way to solve the problem before a more widespread fix. From recollection, we started doing that in order to prevent pulling in from other places, but I think the only way that we generate installations now avoids that completely, so it shouldn't be needed any more.

jjerphan · 2024-10-01T14:42:34Z

Yes, we must only parse the subdirectory once.

mamba-org/mamba#3393

See mamba-org/mamba#3393

conda-forge:: prefix on package specification was causing redownload and reparsing for every dependency. See mamba-org/mamba#3393

jjerphan · 2024-10-10T12:28:54Z

jjerphan:mamba:fix/parsing-subdir is a WIP branch to resolve this issue, it is currently blocked by jbeder/yaml-cpp#1322.

See mamba-org/mamba#3393

conda-forge:: prefix on package specification was causing redownload and reparsing for every dependency. See mamba-org/mamba#3393

jjerphan · 2024-10-10T15:06:46Z

Actually, the channel duplication is not the only cause: most of the runtime after its correction is also due to a costly quick sort execution in libsolv's solver_unifyrules.

Using samply:

samply record $HOME/dev/mamba/build/micromamba/micromamba create -yp /tmp/5ENV/ -c conda-forge --file /tmp/linux.txt

With the conda-forge:: prefix:

Without the conda-forge:: prefix:

I guess this might be due to comparison function for package solvable when the resolution is run.

jjerphan · 2024-10-17T14:12:25Z

Bisecting indicates that the regression has been first introduced by e874e7e, the merge commit of #2986.

dagewa added a commit to dials/dials that referenced this issue Sep 28, 2024

Revert to an older release of micromamba to get builds working again

b491c22

See mamba-org/mamba#3393

jjerphan self-assigned this Oct 1, 2024

ndevenish added a commit to cctbx/dxtbx that referenced this issue Oct 2, 2024

MNT: Pin old micromamba to avoid issue

9cbe8d5

mamba-org/mamba#3393

ndevenish added a commit to ndevenish/dials-fork that referenced this issue Oct 10, 2024

Work around micromamba 2.0 memory issue

562d951

See mamba-org/mamba#3393

ndevenish mentioned this issue Oct 10, 2024

Work around micromamba 2.0 memory issue dials/dials#2768

Merged

ndevenish added a commit to dials/dials that referenced this issue Oct 10, 2024

Work around micromamba 2.0 memory issue (#2768)

b0c8ae1

conda-forge:: prefix on package specification was causing redownload and reparsing for every dependency. See mamba-org/mamba#3393

dagewa added a commit to dials/dials that referenced this issue Oct 10, 2024

Revert to an older release of micromamba to get builds working again

f7407aa

See mamba-org/mamba#3393

dagewa pushed a commit to dials/dials that referenced this issue Oct 10, 2024

Work around micromamba 2.0 memory issue (#2768)

8a32309

conda-forge:: prefix on package specification was causing redownload and reparsing for every dependency. See mamba-org/mamba#3393

jjerphan mentioned this issue Oct 10, 2024

fix: Only register channels in the context once #3521

Merged

NewUserHa mentioned this issue Oct 14, 2024

mamba high memory usage on windows #3534

Open

3 tasks

jjerphan mentioned this issue Oct 17, 2024

micromamba 2.0.0: environment creation process killed when using several channels #3482

Open

3 tasks

jjerphan changed the title ~~Resource usage with micromamba~~ 2.0 regression: large overhead of libsolv's solver_unifyrules when multichannels are used Oct 17, 2024

jjerphan closed this as completed in #3521 Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2.0 regression: large overhead of `libsolv`'s `solver_unifyrules` when multichannels are used #3393

2.0 regression: large overhead of `libsolv`'s `solver_unifyrules` when multichannels are used #3393

Hind-M commented Aug 5, 2024

Hind-M commented Aug 5, 2024 •

edited

Loading

jjerphan commented Aug 5, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 28, 2024 •

edited

Loading

ndevenish commented Sep 29, 2024

jjerphan commented Oct 1, 2024 •

edited

Loading

jjerphan commented Oct 1, 2024 •

edited

Loading

ndevenish commented Oct 1, 2024

jjerphan commented Oct 1, 2024

jjerphan commented Oct 10, 2024

jjerphan commented Oct 10, 2024

jjerphan commented Oct 17, 2024

2.0 regression: large overhead of libsolv's solver_unifyrules when multichannels are used #3393

2.0 regression: large overhead of libsolv's solver_unifyrules when multichannels are used #3393

Comments

Hind-M commented Aug 5, 2024

Hind-M commented Aug 5, 2024 • edited Loading

jjerphan commented Aug 5, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 26, 2024

ndevenish commented Sep 28, 2024 • edited Loading

ndevenish commented Sep 29, 2024

jjerphan commented Oct 1, 2024 • edited Loading

jjerphan commented Oct 1, 2024 • edited Loading

ndevenish commented Oct 1, 2024

jjerphan commented Oct 1, 2024

jjerphan commented Oct 10, 2024

jjerphan commented Oct 10, 2024

jjerphan commented Oct 17, 2024

2.0 regression: large overhead of `libsolv`'s `solver_unifyrules` when multichannels are used #3393

2.0 regression: large overhead of `libsolv`'s `solver_unifyrules` when multichannels are used #3393

Hind-M commented Aug 5, 2024 •

edited

Loading

ndevenish commented Sep 28, 2024 •

edited

Loading

jjerphan commented Oct 1, 2024 •

edited

Loading

jjerphan commented Oct 1, 2024 •

edited

Loading