Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP][FEA]: Validate cuda.parallel type matching in build and execution #2429

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

rwgk
Copy link
Contributor

@rwgk rwgk commented Sep 18, 2024

Description

Closes: #2416

WIP, starting with a brute-force experiment. (I still don't have a Linux desktop.)

These are the failing tests:

40_cuda__python__CTK12.5_nvcc_GCC____GCC13__Test_amd64__V100_.txt:

2024-09-19T02:02:59.9853501Z =========================== short test summary info ============================
2024-09-19T02:02:59.9854748Z FAILED tests/test_reduce.py::test_device_reduce[uint8] - AssertionError
2024-09-19T02:02:59.9856076Z FAILED tests/test_reduce.py::test_device_reduce[uint16] - AssertionError
2024-09-19T02:02:59.9857378Z FAILED tests/test_reduce.py::test_device_reduce[uint32] - AssertionError
2024-09-19T02:02:59.9858691Z FAILED tests/test_reduce.py::test_device_reduce[uint64] - AssertionError
2024-09-19T02:02:59.9859991Z FAILED tests/test_reduce.py::test_complex_device_reduce - AssertionError
2024-09-19T02:02:59.9861276Z FAILED tests/test_reduce_api.py::test_device_reduce - AssertionError
2024-09-19T02:02:59.9862306Z ============================== 6 failed in 15.00s ==============================

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Copy link
Contributor

🟨 CI finished in 6h 01m: Pass: 97%/437 | Total: 2d 12h | Avg: 8m 18s | Max: 1h 27m | Hits: 99%/41645
  • 🟨 cub: Pass: 93%/136 | Total: 23h 20m | Avg: 10m 17s | Max: 1h 27m | Hits: 99%/4362

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/128 | Total: 22h 47m | Avg: 10m 40s | Max:  1h 27m | Hits:  99%/4362  
      🟩 arm64              Pass: 100%/8   | Total: 33m 37s | Avg:  4m 12s | Max:  4m 26s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 16m 51s | Hits:  99%/727   
      🟩 11.8               Pass: 100%/3   | Total: 14m 18s | Avg:  4m 46s | Max:  5m 03s
      🔍 12.6               Pass:  92%/118 | Total: 21h 58m | Avg: 11m 10s | Max:  1h 27m | Hits:  99%/3635  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 54s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 16m 51s | Hits:  99%/727   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 18s | Avg:  4m 46s | Max:  5m 03s
      🔍 nvcc12.6           Pass:  92%/116 | Total: 21h 51m | Avg: 11m 18s | Max:  1h 27m | Hits:  99%/3635  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 54s
      🔍 nvcc               Pass:  93%/134 | Total: 23h 13m | Avg: 10m 23s | Max:  1h 27m | Hits:  99%/4362  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 31s | Avg:  4m 35s | Max:  5m 41s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 06s | Avg:  5m 22s | Max:  5m 54s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 27s | Avg:  4m 36s | Max:  4m 42s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 03s | Avg:  4m 30s | Max:  4m 44s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 25s | Avg:  4m 36s | Max:  4m 53s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 04s | Avg:  4m 46s | Max:  5m 18s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/4   | Total: 19m 11s | Avg:  4m 47s | Max:  5m 37s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 40s | Avg:  4m 55s | Max:  5m 16s
      🟨 Clang18            Pass:  84%/26  | Total:  7h 21m | Avg: 16m 57s | Max: 42m 29s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 40s | Avg:  3m 50s | Max:  4m 10s
      🟩 GCC7               Pass: 100%/6   | Total: 24m 25s | Avg:  4m 04s | Max:  5m 04s
      🟩 GCC8               Pass: 100%/6   | Total: 22m 27s | Avg:  3m 44s | Max:  4m 07s
      🟩 GCC9               Pass: 100%/6   | Total: 24m 37s | Avg:  4m 06s | Max:  4m 43s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 38s | Avg:  4m 24s | Max:  4m 45s
      🟩 GCC11              Pass: 100%/7   | Total: 31m 41s | Avg:  4m 31s | Max:  5m 03s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 59s
      🟨 GCC13              Pass:  82%/29  | Total:  8h 55m | Avg: 18m 28s | Max:  1h 27m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 16m 14s | Avg:  5m 24s | Max:  5m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 16m 51s | Avg: 16m 51s | Max: 16m 51s | Hits:  99%/727   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 16s | Avg: 12m 38s | Max: 12m 39s | Hits:  99%/1454  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 41m 14s | Avg: 13m 44s | Max: 14m 02s | Hits:  99%/2181  
    🟨 cxx_family
      🟨 Clang              Pass:  93%/63  | Total: 10h 18m | Avg:  9m 49s | Max: 42m 29s
      🟨 GCC                Pass:  92%/64  | Total: 11h 22m | Avg: 10m 40s | Max:  1h 27m
      🟩 Intel              Pass: 100%/3   | Total: 16m 14s | Avg:  5m 24s | Max:  5m 33s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 23m | Avg: 13m 53s | Max: 16m 51s | Hits:  99%/4362  
    🟨 jobs
      🟩 Build              Pass: 100%/103 | Total:  8h 41m | Avg:  5m 04s | Max: 16m 51s | Hits:  99%/4362  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  4h 01m | Avg: 30m 09s | Max:  1h 27m
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 37m | Avg: 19m 40s | Max: 29m 36s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 55m | Avg: 21m 53s | Max: 30m 19s
      🟥 SmallGMem          Pass:   0%/1   | Total: 40m 26s | Avg: 40m 26s | Max: 40m 26s
      🟥 TestGPU            Pass:   0%/8   | Total:  4h 24m | Avg: 33m 04s | Max: 49m 23s
    🟨 gpu
      🟨 v100               Pass:  93%/136 | Total: 23h 20m | Avg: 10m 17s | Max:  1h 27m | Hits:  99%/4362  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 18s | Avg:  4m 46s | Max:  5m 03s
      🟩 90a                Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  3m 56s
    🟨 std
      🟨 11                 Pass:  94%/35  | Total:  4h 54m | Avg:  8m 25s | Max: 28m 04s
      🟨 14                 Pass:  94%/38  | Total:  6h 54m | Avg: 10m 54s | Max:  1h 27m | Hits:  99%/2181  
      🟨 17                 Pass:  92%/38  | Total:  6h 57m | Avg: 10m 59s | Max: 40m 26s | Hits:  99%/1454  
      🟨 20                 Pass:  92%/25  | Total:  4h 34m | Avg: 10m 58s | Max: 42m 29s | Hits:  99%/727   
    
  • 🟥 pycuda: Pass: 0%/1 | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 ctk
      🟥 12.5               Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 cudacxx
      🟥 nvcc12.5           Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 gpu
      🟥 v100               Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
    
  • 🟩 thrust: Pass: 100%/122 | Total: 14h 15m | Avg: 7m 00s | Max: 38m 31s | Hits: 99%/20070

    🟩 cpu
      🟩 amd64              Pass: 100%/114 | Total: 13h 38m | Avg:  7m 10s | Max: 38m 31s | Hits:  99%/20070 
      🟩 arm64              Pass: 100%/8   | Total: 37m 30s | Avg:  4m 41s | Max:  5m 12s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 19m | Avg:  5m 17s | Max: 20m 37s | Hits:  99%/2230  
      🟩 11.8               Pass: 100%/3   | Total: 15m 34s | Avg:  5m 11s | Max:  5m 53s
      🟩 12.6               Pass: 100%/104 | Total: 12h 40m | Avg:  7m 19s | Max: 38m 31s | Hits:  99%/17840 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 18s | Avg:  5m 09s | Max:  5m 20s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 19m | Avg:  5m 17s | Max: 20m 37s | Hits:  99%/2230  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 34s | Avg:  5m 11s | Max:  5m 53s
      🟩 nvcc12.6           Pass: 100%/102 | Total: 12h 30m | Avg:  7m 21s | Max: 38m 31s | Hits:  99%/17840 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 18s | Avg:  5m 09s | Max:  5m 20s
      🟩 nvcc               Pass: 100%/120 | Total: 14h 05m | Avg:  7m 02s | Max: 38m 31s | Hits:  99%/20070 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 25s | Avg:  5m 14s | Max:  6m 21s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 03s | Avg:  6m 01s | Max:  6m 27s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 39s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 38s | Avg:  4m 54s | Max:  5m 02s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 47s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  5m 27s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 59s | Avg:  5m 14s | Max:  5m 45s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 52s | Avg:  5m 13s | Max:  5m 38s
      🟩 Clang18            Pass: 100%/18  | Total:  2h 03m | Avg:  6m 53s | Max: 13m 33s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 27s | Avg:  4m 13s | Max:  4m 27s
      🟩 GCC7               Pass: 100%/6   | Total: 26m 19s | Avg:  4m 23s | Max:  4m 59s
      🟩 GCC8               Pass: 100%/6   | Total: 27m 55s | Avg:  4m 39s | Max:  5m 25s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 47s | Avg:  4m 27s | Max:  5m 17s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 56s | Avg:  4m 59s | Max:  5m 14s
      🟩 GCC11              Pass: 100%/7   | Total: 36m 54s | Avg:  5m 16s | Max:  5m 53s
      🟩 GCC12              Pass: 100%/4   | Total: 21m 24s | Avg:  5m 21s | Max:  5m 43s
      🟩 GCC13              Pass: 100%/20  | Total:  2h 43m | Avg:  8m 11s | Max: 38m 31s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 14s | Avg:  6m 24s | Max:  6m 59s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 37s | Avg: 20m 37s | Max: 20m 37s | Hits:  99%/2230  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 35m 34s | Avg: 17m 47s | Max: 18m 21s | Hits:  99%/4460  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  2h 10m | Avg: 21m 48s | Max: 25m 07s | Hits:  99%/13380 
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  5h 18m | Avg:  5m 47s | Max: 13m 33s
      🟩 GCC                Pass: 100%/55  | Total:  5h 31m | Avg:  6m 01s | Max: 38m 31s
      🟩 Intel              Pass: 100%/3   | Total: 19m 14s | Avg:  6m 24s | Max:  6m 59s
      🟩 MSVC               Pass: 100%/9   | Total:  3h 07m | Avg: 20m 46s | Max: 25m 07s | Hits:  99%/20070 
    🟩 gpu
      🟩 v100               Pass: 100%/122 | Total: 14h 15m | Avg:  7m 00s | Max: 38m 31s | Hits:  99%/20070 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 01m | Avg:  5m 50s | Max: 21m 02s | Hits:  99%/13380 
      🟩 TestCPU            Pass: 100%/11  | Total:  2h 09m | Avg: 11m 47s | Max: 25m 07s | Hits:  99%/6690  
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 04m | Avg: 15m 37s | Max: 38m 31s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 34s | Avg:  5m 11s | Max:  5m 53s
      🟩 90a                Pass: 100%/4   | Total: 17m 47s | Avg:  4m 26s | Max:  4m 45s
    🟩 std
      🟩 11                 Pass: 100%/31  | Total:  2h 43m | Avg:  5m 15s | Max: 13m 00s
      🟩 14                 Pass: 100%/35  | Total:  4h 16m | Avg:  7m 20s | Max: 24m 56s | Hits:  99%/8920  
      🟩 17                 Pass: 100%/34  | Total:  4h 04m | Avg:  7m 10s | Max: 23m 18s | Hits:  99%/6690  
      🟩 20                 Pass: 100%/22  | Total:  3h 11m | Avg:  8m 43s | Max: 38m 31s | Hits:  99%/4460  
    
  • 🟩 libcudacxx: Pass: 100%/116 | Total: 19h 43m | Avg: 10m 12s | Max: 36m 32s | Hits: 99%/17005

    🟩 cpu
      🟩 amd64              Pass: 100%/108 | Total: 18h 52m | Avg: 10m 28s | Max: 36m 32s | Hits:  99%/17005 
      🟩 arm64              Pass: 100%/8   | Total: 51m 16s | Avg:  6m 24s | Max: 23m 32s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 08m | Avg:  8m 34s | Max: 36m 32s | Hits:  99%/2642  
      🟩 11.8               Pass: 100%/3   | Total: 56m 56s | Avg: 18m 58s | Max: 31m 25s
      🟩 12.6               Pass: 100%/98  | Total: 16h 37m | Avg: 10m 10s | Max: 31m 18s | Hits:  99%/14363 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 38m 37s | Avg: 19m 18s | Max: 20m 37s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 08m | Avg:  8m 34s | Max: 36m 32s | Hits:  99%/2642  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 56m 56s | Avg: 18m 58s | Max: 31m 25s
      🟩 nvcc12.6           Pass: 100%/96  | Total: 15h 59m | Avg:  9m 59s | Max: 31m 18s | Hits:  99%/14363 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 38m 37s | Avg: 19m 18s | Max: 20m 37s
      🟩 nvcc               Pass: 100%/114 | Total: 19h 04m | Avg: 10m 02s | Max: 36m 32s | Hits:  99%/17005 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 59s | Avg:  4m 29s | Max:  5m 43s
      🟩 Clang10            Pass: 100%/3   | Total: 32m 40s | Avg: 10m 53s | Max: 21m 12s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 46s | Avg:  4m 26s | Max:  4m 33s
      🟩 Clang12            Pass: 100%/4   | Total: 58m 13s | Avg: 14m 33s | Max: 28m 33s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 38s | Avg:  4m 39s | Max:  5m 03s
      🟩 Clang14            Pass: 100%/4   | Total: 17m 58s | Avg:  4m 29s | Max:  4m 47s
      🟩 Clang15            Pass: 100%/4   | Total: 58m 17s | Avg: 14m 34s | Max: 28m 55s
      🟩 Clang16            Pass: 100%/4   | Total: 38m 26s | Avg:  9m 36s | Max: 25m 04s
      🟩 Clang17            Pass: 100%/4   | Total:  1h 18m | Avg: 19m 31s | Max: 31m 18s
      🟩 Clang18            Pass: 100%/14  | Total:  2h 47m | Avg: 11m 56s | Max: 28m 27s
      🟩 GCC6               Pass: 100%/2   | Total: 39m 43s | Avg: 19m 51s | Max: 36m 32s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 05m | Avg: 10m 50s | Max: 27m 59s
      🟩 GCC8               Pass: 100%/6   | Total: 20m 39s | Avg:  3m 26s | Max:  4m 08s
      🟩 GCC9               Pass: 100%/6   | Total: 32m 29s | Avg:  5m 24s | Max: 13m 57s
      🟩 GCC10              Pass: 100%/4   | Total: 36m 24s | Avg:  9m 06s | Max: 23m 07s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 29m | Avg: 12m 50s | Max: 31m 25s
      🟩 GCC12              Pass: 100%/4   | Total: 37m 41s | Avg:  9m 25s | Max: 25m 36s
      🟩 GCC13              Pass: 100%/21  | Total:  3h 53m | Avg: 11m 07s | Max: 29m 53s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 49s | Avg:  6m 16s | Max:  6m 39s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 20m 12s | Avg: 20m 12s | Max: 20m 12s | Hits:  99%/2642  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 27m 46s | Avg: 13m 53s | Max: 14m 31s | Hits:  99%/5646  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 46m 54s | Avg: 15m 38s | Max: 16m 09s | Hits:  99%/8717  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  8h 34m | Avg: 10m 04s | Max: 31m 18s
      🟩 GCC                Pass: 100%/56  | Total:  9h 15m | Avg:  9m 55s | Max: 36m 32s
      🟩 Intel              Pass: 100%/3   | Total: 18m 49s | Avg:  6m 16s | Max:  6m 39s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 34m | Avg: 15m 48s | Max: 20m 12s | Hits:  99%/17005 
    🟩 gpu
      🟩 v100               Pass: 100%/116 | Total: 19h 43m | Avg: 10m 12s | Max: 36m 32s | Hits:  99%/17005 
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 15h 20m | Avg:  8m 56s | Max: 36m 32s | Hits:  99%/17005 
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 44m | Avg: 26m 01s | Max: 29m 53s
      🟩 Test               Pass: 100%/8   | Total:  2h 36m | Avg: 19m 34s | Max: 28m 27s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 56m 56s | Avg: 18m 58s | Max: 31m 25s
      🟩 90a                Pass: 100%/4   | Total: 16m 51s | Avg:  4m 12s | Max:  4m 31s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  4h 16m | Avg:  8m 33s | Max: 36m 32s
      🟩 14                 Pass: 100%/33  | Total:  4h 57m | Avg:  9m 00s | Max: 27m 21s | Hits:  99%/8128  
      🟩 17                 Pass: 100%/32  | Total:  6h 35m | Avg: 12m 22s | Max: 31m 25s | Hits:  99%/5806  
      🟩 20                 Pass: 100%/20  | Total:  3h 51m | Avg: 11m 33s | Max: 31m 18s | Hits:  99%/3071  
    
  • 🟩 cudax: Pass: 100%/58 | Total: 2h 43m | Avg: 2m 48s | Max: 11m 02s | Hits: 90%/208

    🟩 cpu
      🟩 amd64              Pass: 100%/54  | Total:  2h 34m | Avg:  2m 51s | Max: 11m 02s | Hits:  90%/208   
      🟩 arm64              Pass: 100%/4   | Total:  9m 03s | Avg:  2m 15s | Max:  2m 57s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max: 11m 02s | Hits:  90%/104   
      🟩 12.6               Pass: 100%/35  | Total:  1h 38m | Avg:  2m 49s | Max: 10m 00s | Hits:  90%/104   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max: 11m 02s | Hits:  90%/104   
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1h 38m | Avg:  2m 49s | Max: 10m 00s | Hits:  90%/104   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/58  | Total:  2h 43m | Avg:  2m 48s | Max: 11m 02s | Hits:  90%/208   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 58s | Avg:  2m 29s | Max:  3m 01s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 42s | Avg:  2m 21s | Max:  2m 23s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 39s | Avg:  2m 24s | Max:  2m 56s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 26s | Avg:  2m 21s | Max:  2m 47s
      🟩 Clang13            Pass: 100%/4   | Total:  8m 23s | Avg:  2m 05s | Max:  2m 30s
      🟩 Clang14            Pass: 100%/6   | Total: 18m 21s | Avg:  3m 03s | Max:  4m 32s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 29s | Avg:  2m 44s | Max:  2m 59s
      🟩 Clang16            Pass: 100%/4   | Total: 10m 48s | Avg:  2m 42s | Max:  3m 00s
      🟩 Clang17            Pass: 100%/2   | Total:  4m 52s | Avg:  2m 26s | Max:  2m 27s
      🟩 Clang18            Pass: 100%/4   | Total: 12m 42s | Avg:  3m 10s | Max:  4m 18s
      🟩 GCC9               Pass: 100%/2   | Total:  3m 38s | Avg:  1m 49s | Max:  1m 52s
      🟩 GCC10              Pass: 100%/4   | Total:  7m 55s | Avg:  1m 58s | Max:  2m 08s
      🟩 GCC11              Pass: 100%/4   | Total:  7m 46s | Avg:  1m 56s | Max:  2m 07s
      🟩 GCC12              Pass: 100%/9   | Total: 27m 15s | Avg:  3m 01s | Max:  4m 39s
      🟩 GCC13              Pass: 100%/3   | Total:  6m 13s | Avg:  2m 04s | Max:  2m 15s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 02s | Avg: 11m 02s | Max: 11m 02s | Hits:  90%/104   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 00s | Avg: 10m 00s | Max: 10m 00s | Hits:  90%/104   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/34  | Total:  1h 29m | Avg:  2m 37s | Max:  4m 32s
      🟩 GCC                Pass: 100%/22  | Total: 52m 47s | Avg:  2m 23s | Max:  4m 39s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 02s | Avg: 10m 31s | Max: 11m 02s | Hits:  90%/208   
    🟩 gpu
      🟩 v100               Pass: 100%/58  | Total:  2h 43m | Avg:  2m 48s | Max: 11m 02s | Hits:  90%/208   
    🟩 jobs
      🟩 Build              Pass: 100%/50  | Total:  2h 08m | Avg:  2m 34s | Max: 11m 02s | Hits:  90%/208   
      🟩 Test               Pass: 100%/8   | Total: 34m 38s | Avg:  4m 19s | Max:  4m 39s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  1m 49s | Avg:  1m 49s | Max:  1m 49s
      🟩 90a                Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
    🟩 std
      🟩 17                 Pass: 100%/32  | Total:  1h 19m | Avg:  2m 29s | Max:  4m 39s
      🟩 20                 Pass: 100%/26  | Total:  1h 23m | Avg:  3m 12s | Max: 11m 02s | Hits:  90%/208   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 16m 26s | Avg: 4m 06s | Max: 4m 37s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 37s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  7m 17s | Avg:  3m 38s | Max:  4m 01s
      🟩 12.6               Pass: 100%/2   | Total:  9m 09s | Avg:  4m 34s | Max:  4m 37s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  7m 17s | Avg:  3m 38s | Max:  4m 01s
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 09s | Avg:  4m 34s | Max:  4m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 37s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 GCC6               Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 32s | Avg:  4m 32s | Max:  4m 32s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  8m 38s | Avg:  4m 19s | Max:  4m 37s
      🟩 GCC                Pass: 100%/2   | Total:  7m 48s | Avg:  3m 54s | Max:  4m 32s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 37s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 16m 26s | Avg:  4m 06s | Max:  4m 37s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
CUDA Experimental
pycuda
CUDA C Core Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CUDA C Core Library

🏃‍ Runner counts (total jobs: 437)

# Runner
320 linux-amd64-cpu16
66 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
23 windows-amd64-cpu16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

[FEA]: Validate cuda.parallel type matching in build and execution
1 participant