NCCL only multi-gpu multi-node training without MPI #1114
ci.yml
on: pull_request
build-cuda-windows
2m 26s
build-ubuntu20-04
2m 32s
build-cuda-fp32
1m 20s
build-cuda-bf16
1m 14s
build-cuda-fp16
1m 10s
build-cuda-kernels
1m 28s
Matrix: build-and-test-cpu