Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{ai,lib}[foss/2023a,gfbf/2023a] RAPIDS v24.4, CUDA-Python v12.1.0 w/ CUDA 12.1.1 #21058

Merged
merged 4 commits into from
Sep 25, 2024

Conversation

lexming
Copy link
Contributor

@lexming lexming commented Jul 25, 2024

(created using eb --new-pr)

Depends on:

⚠️ this needs an actual Nvidia GPU device to pass the sanity checks, and it has to be Volta or newer

@lexming lexming added the new label Jul 25, 2024
@lexming
Copy link
Contributor Author

lexming commented Jul 26, 2024

Test report by @lexming
SUCCESS
Build succeeded for 4 out of 4 (2 easyconfigs in total)
node403.hydra.os - Linux Rocky Linux 8.10, x86_64, AMD EPYC 7282 16-Core Processor (zen2), 1 x NVIDIA NVIDIA A100-PCIE-40GB, 550.90.07, Python 3.6.8
See https://gist.github.com/lexming/878bc1b22d4964b9e5203298c8a56c9e for a full test report.

@boegel boegel mentioned this pull request Aug 6, 2024
2 tasks
@VRehnberg
Copy link
Contributor

Test report by @VRehnberg
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
alvis1-06 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/VRehnberg/9e66063671585fa966cda7d1ca88bec0 for a full test report.

@VRehnberg
Copy link
Contributor

Test report by @VRehnberg
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
alvis1-06 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/VRehnberg/bedbdd2d519a1fcbb26a6ed8f3d39619 for a full test report.

@VRehnberg
Copy link
Contributor

Test report by @VRehnberg
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
alvis1-06 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/VRehnberg/8ab750090d92370d20ffdc05953e02c0 for a full test report.

@VRehnberg
Copy link
Contributor

Test report by @VRehnberg
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
alvis1-06 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/VRehnberg/b7f37707afc9014a518dfc625e78fe29 for a full test report.

@VRehnberg
Copy link
Contributor

Test report by @VRehnberg
FAILED
Build succeeded for 0 out of 1 (1 easyconfigs in total)
alvis1-06 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6244 CPU @ 3.60GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/VRehnberg/16b3ace385760c9341362e0bd71279a5 for a full test report.

@boegel boegel added this to the 4.x milestone Sep 25, 2024
@boegel
Copy link
Member

boegel commented Sep 25, 2024

@VRehnberg Last attempt failed with "ModuleNotFoundError: No module named 'pyarrow._orc'" because changes in #21056 were not taken into account?

@VRehnberg
Copy link
Contributor

Probably. Will try to remember to do a rerun tomorrow.

@lexming
Copy link
Contributor Author

lexming commented Sep 25, 2024

Synced with current develop branch, which will help with those issues.

@boegel
Copy link
Member

boegel commented Sep 25, 2024

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Collaborator

@boegel: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=21058 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_21058 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 4940

Test results coming soon (I hope)...

- notification for comment with ID 2374554898 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.4, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 555.42.06, Python 3.9.18
See https://gist.github.com/boegelbot/9c6d0159a887875074b02e5cf66f0c7e for a full test report.

@boegel
Copy link
Member

boegel commented Sep 25, 2024

Test report by @boegel
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
node3306.joltik.os - Linux RHEL 8.8, x86_64, Intel(R) Xeon(R) Gold 6242 CPU @ 2.80GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 545.23.08, Python 3.6.8
See https://gist.github.com/boegel/5306bfd109aa242df97cf6d5e0e11a11 for a full test report.

Copy link
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel
Copy link
Member

boegel commented Sep 25, 2024

Going in, thanks @lexming!

@boegel boegel modified the milestones: 4.x, release after 4.9.4 Sep 25, 2024
@boegel
Copy link
Member

boegel commented Sep 25, 2024

Going in, thanks @lexming!

@boegel boegel merged commit 0bcdf16 into easybuilders:develop Sep 25, 2024
9 checks passed
@lexming lexming deleted the 20240725192235_new_pr_RAPIDS244 branch September 26, 2024 07:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants