Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
563 workflow runs
563 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[C/PyTorch] Add max_t support for THD (#1244)
Deploy nightly docs #695: Commit 7fb22c3 pushed by cyanguwa
October 25, 2024 20:29 1m 9s main
October 25, 2024 20:29 1m 9s
[C/PyTorch] Add THD MQA/GQA (#1266)
Deploy nightly docs #694: Commit 83f9cc0 pushed by cyanguwa
October 25, 2024 20:25 1m 10s main
October 25, 2024 20:25 1m 10s
Support building documentation in Python 3.12 (#1274)
Deploy nightly docs #693: Commit 8e4ee12 pushed by timmoon10
October 25, 2024 19:10 1m 7s main
October 25, 2024 19:10 1m 7s
[TE/JAX] Update required JAX version for FFI custom calls with cudaGr…
Deploy nightly docs #692: Commit 7cef756 pushed by phu0ngng
October 25, 2024 17:48 1m 3s main
October 25, 2024 17:48 1m 3s
[Pytorch] Check gradient in test numerics (#1229)
Deploy nightly docs #691: Commit 7b284fe pushed by pggPL
October 24, 2024 18:13 1m 4s main
October 24, 2024 18:13 1m 4s
[Paddle] Update type names for Paddle 3.0 (#1286)
Deploy nightly docs #690: Commit 7a5fd0c pushed by timmoon10
October 24, 2024 17:52 1m 6s main
October 24, 2024 17:52 1m 6s
[JAX] XLA Custom Calls with FFI for FusedAttnFwd, Quantize, Transpose…
Deploy nightly docs #689: Commit 18c2234 pushed by huanghua1994
October 24, 2024 15:27 1m 43s main
October 24, 2024 15:27 1m 43s
[JAX] Fix correctness of JAX fused attention with CP and improve nume…
Deploy nightly docs #688: Commit 20c7529 pushed by mgoldfarb-nvidia
October 24, 2024 13:51 1m 7s main
October 24, 2024 13:51 1m 7s
Add THD + GQA supports (#1260)
Deploy nightly docs #687: Commit d9b4bfb pushed by cyanguwa
October 22, 2024 22:37 58s main
October 22, 2024 22:37 58s
[JAX] Skip V100 encoder tests (#1262)
Deploy nightly docs #686: Commit 35f7d26 pushed by phu0ngng
October 22, 2024 19:44 1m 14s main
October 22, 2024 19:44 1m 14s
Fused Attention Support 64-bit Ragged Offsets for Large THD Tensors (…
Deploy nightly docs #685: Commit 7b18f23 pushed by mgoldfarb-nvidia
October 22, 2024 14:24 1m 11s main
October 22, 2024 14:24 1m 11s
[PyTorch] Reduce the number of FA versions in L3 tests (#1280)
Deploy nightly docs #684: Commit 29e3a09 pushed by timmoon10
October 21, 2024 21:43 1m 0s main
October 21, 2024 21:43 1m 0s
[PyTorch] Remove PyTorch L0 distributed test (#1273)
Deploy nightly docs #683: Commit 3ea7dd3 pushed by timmoon10
October 18, 2024 18:04 1m 9s main
October 18, 2024 18:04 1m 9s
[Paddle] Debug wheel test (#1265)
Deploy nightly docs #682: Commit 927bca7 pushed by timmoon10
October 18, 2024 17:22 1m 4s main
October 18, 2024 17:22 1m 4s
[PyTorch] Reorganize L1 tests (#1255)
Deploy nightly docs #681: Commit 41fe1e5 pushed by timmoon10
October 18, 2024 01:57 1m 4s main
October 18, 2024 01:57 1m 4s
Fix seq_dim in CP implementation (#1264)
Deploy nightly docs #680: Commit a488b8b pushed by xrennvidia
October 17, 2024 18:21 1m 33s main
October 17, 2024 18:21 1m 33s
[TE/JAX] Enabling CudaGraph for custom calls with FFI (#1228)
Deploy nightly docs #679: Commit 12f30ea pushed by phu0ngng
October 17, 2024 15:30 1m 5s main
October 17, 2024 15:30 1m 5s
[Bugfix] Fix bias for 0-dim tensors in gemm (#1246)
Deploy nightly docs #678: Commit 8e97c8d pushed by yaox12
October 17, 2024 14:48 1m 15s main
October 17, 2024 14:48 1m 15s
[PyTorch] Fix wgrads for GroupedLinear when weights don't require gra…
Deploy nightly docs #677: Commit 2d7020e pushed by yaox12
October 17, 2024 13:20 1m 7s main
October 17, 2024 13:20 1m 7s
Changed VERSION to 1.13.0.dev
Deploy nightly docs #676: Commit 9001081 pushed by ptrendx
October 16, 2024 17:16 1m 5s main
October 16, 2024 17:16 1m 5s
[PyTorch] Fix FP8 activation recompute (#1254)
Deploy nightly docs #675: Commit a518151 pushed by ksivaman
October 16, 2024 15:27 1m 40s main
October 16, 2024 15:27 1m 40s
Upgrade pylint to 3.3.1 (#1257)
Deploy nightly docs #674: Commit 6e90fcb pushed by ksivaman
October 16, 2024 15:27 1m 8s main
October 16, 2024 15:27 1m 8s
[PyTorch] Drop FA as an installation requirement (#1226)
Deploy nightly docs #673: Commit 161b1d9 pushed by cyanguwa
October 16, 2024 02:35 1m 5s main
October 16, 2024 02:35 1m 5s
fix assertion bug for SWA API in TE-JAX (#1242)
Deploy nightly docs #672: Commit 43b9e1e pushed by phu0ngng
October 16, 2024 01:00 1m 4s main
October 16, 2024 01:00 1m 4s
[PyTorch] Build custom ORT ops before running ONNX export tests (#1252)
Deploy nightly docs #671: Commit f6b766b pushed by timmoon10
October 16, 2024 00:34 1m 36s main
October 16, 2024 00:34 1m 36s