set specific GPU architectures for builds #160

jameslamb · 2024-10-08T17:17:20Z

Contributes to #115

While inspecting the size of the conda packages, I was surprised to see that they were tiny (2.5MB for the GPU variant on Python 12). I investigated that, and realized that in CI builds, legate-boost is not currently targeting a specific set of GPU architectures.

It's falling back to native. Since builds are done on runners without a GPU, that means the packages are being built for whatever the single default target architecture is in the version of CMake getting pulled in (CMake docs on that).

This PR proposes matching legate and cunumeric's behavior, setting CUDAARCHS="all-major". @RAMitchell @trivialfis if you'd prefer to hard-code a specific set of architectures like 80;86;90 or similar, let me know.

Notes for Reviewers

Bad (good?) timing... as I opened this, pre-commit 4.0 came out, breaking some of our configurations here that were relying on hooks that were incompatible with that version. pre-commit-related changes that you see in the diff are to fix that.

Mainly:

updating all hooks to their latest versions
pinning docformatter to a specific unreleased commit, to get the fixes from 🩹 Fix pre commit hook manifest PyCQA/docformatter#287

RAMitchell · 2024-10-10T15:43:50Z

Following cunumeric is perfect. Although I suspect they might be compiling for architectures they don't support (@Jacobfaib does this make sense?). The downside is large binaries - this can get pretty annoying for users when it gets > 300mb.

jameslamb · 2024-10-10T15:58:22Z

The downside is large binaries - this can get pretty annoying for users when it gets > 300mb.

Ah yeah! I should have mentioned... with this change, legate-boost conda packages only grow to around 6MB.

legate-boost 0.1.0 cuda12_py312_0_gpu
-------------------------------------
file name   : legate-boost-0.1.0-cuda12_py312_0_gpu.tar.bz2
name        : legate-boost
version     : 0.1.0
build       : cuda12_py312_0_gpu
build number: 0
size        : 6.2 MB

(build link)

Jacobfaib · 2024-10-10T15:59:24Z

Although I suspect they might be compiling for architectures they don't support

Entirely likely. I'll be honest, I don't think we thought that hard about it. Someone at some point said "you should compile for all-major" and so we did. Downside of course is that the binaries are downright hefty.

I'd be careful with big binaries though, if you are doing ABI-versioned symlinks and shipping Python wheels. Python wheels don't support symlinks, instead it makes deep copies of everything. So what used to be

libfoo.so.1.2.3 # 200MB
libfoo.so.1 -> libfoo.so.1.2.3
libfoo.so -> libfoo.so.1.2.3

will now be

libfoo.so.1.2.3 # 200MB
libfoo.so.1 # 200MB
libfoo.so # 200MB

That's partly the reason why cunumeric is such a big install...

jameslamb · 2024-10-10T16:32:19Z

Thanks for that note @Jacobfaib , it's an important one!

For now, legate-boost is targeting publishing conda packages only, not wheels, and not worrying about exporting its shared library for use by other projects.

set specific GPU architectures for builds

d309de2

jameslamb added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Oct 8, 2024

jameslamb added 6 commits October 8, 2024 12:25

pre-commit config

e855a70

skip pre-commit for now

9c16f7c

re-enable pre-commit

f24c1be

remove isort

f3c02d1

fix indentation and see if that was the isort problem too

9fed45a

fix docformatter

4affce0

jameslamb changed the title ~~WIP: set specific GPU architectures for builds~~ set specific GPU architectures for builds Oct 10, 2024

jameslamb requested review from RAMitchell and trivialfis October 10, 2024 15:37

jameslamb marked this pull request as ready for review October 10, 2024 15:37

RAMitchell approved these changes Oct 10, 2024

View reviewed changes

jameslamb merged commit fd04bbc into rapidsai:main Oct 10, 2024
9 checks passed

jameslamb deleted the ci/architectures branch October 10, 2024 16:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

set specific GPU architectures for builds #160

set specific GPU architectures for builds #160

jameslamb commented Oct 8, 2024 •

edited

Loading

RAMitchell commented Oct 10, 2024

jameslamb commented Oct 10, 2024

Jacobfaib commented Oct 10, 2024

jameslamb commented Oct 10, 2024

set specific GPU architectures for builds #160

set specific GPU architectures for builds #160

Conversation

jameslamb commented Oct 8, 2024 • edited Loading

Notes for Reviewers

RAMitchell commented Oct 10, 2024

jameslamb commented Oct 10, 2024

Jacobfaib commented Oct 10, 2024

jameslamb commented Oct 10, 2024

jameslamb commented Oct 8, 2024 •

edited

Loading