Skip to content

Actions: deepspeedai/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,615 workflow runs
4,615 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Training multiple models
nv-accelerate-v100 #13425: Pull request #7018 synchronize by tjruwase
March 6, 2025 18:41 8m 12s olruwase/zero_multi_models
March 6, 2025 18:41 8m 12s
Training multiple models
nv-accelerate-v100 #13424: Pull request #7018 synchronize by tjruwase
March 6, 2025 18:17 8m 9s olruwase/zero_multi_models
March 6, 2025 18:17 8m 9s
Training multiple models
nv-accelerate-v100 #13423: Pull request #7018 synchronize by tjruwase
March 6, 2025 17:06 8m 18s olruwase/zero_multi_models
March 6, 2025 17:06 8m 18s
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-accelerate-v100 #13421: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:43 8m 39s raza-sikander:master
March 6, 2025 07:43 8m 39s
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-accelerate-v100 #13420: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:39 Action required raza-sikander:master
March 6, 2025 07:39 Action required
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-accelerate-v100 #13419: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:37 Action required raza-sikander:master
March 6, 2025 07:37 Action required
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-accelerate-v100 #13418: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:36 Action required raza-sikander:master
March 6, 2025 07:36 Action required
[XPU] Support XCCL on deepspeed side
nv-accelerate-v100 #13417: Pull request #7113 synchronize by ys950902
March 6, 2025 07:26 8m 3s ys950902:sy/xccl_enable
March 6, 2025 07:26 8m 3s
[XPU] Support XCCL on deepspeed side
nv-accelerate-v100 #13416: Pull request #7113 opened by ys950902
March 6, 2025 07:25 Action required ys950902:sy/xccl_enable
March 6, 2025 07:25 Action required
fix keep_module_on_host
nv-accelerate-v100 #13415: Pull request #7112 synchronize by inkcherry
March 6, 2025 03:02 8m 7s inkcherry:fix_keep_module_on_host
March 6, 2025 03:02 8m 7s
fix keep_module_on_host
nv-accelerate-v100 #13414: Pull request #7112 opened by inkcherry
March 6, 2025 03:00 Action required inkcherry:fix_keep_module_on_host
March 6, 2025 03:00 Action required
Enable torch.autocast with ZeRO
nv-accelerate-v100 #13413: Pull request #6993 synchronize by tohtana
March 6, 2025 01:48 8m 13s tohtana/support_autocast
March 6, 2025 01:48 8m 13s
nv-accelerate-v100
nv-accelerate-v100 #13412: Scheduled
March 6, 2025 00:07 8m 5s master
March 6, 2025 00:07 8m 5s
Update Domino for Llama3
nv-accelerate-v100 #13411: Pull request #7084 synchronize by shenzheyu
March 5, 2025 22:59 8m 23s shenzheyu:master
March 5, 2025 22:59 8m 23s
Update Domino for Llama3
nv-accelerate-v100 #13410: Pull request #7084 synchronize by shenzheyu
March 5, 2025 22:56 Action required shenzheyu:master
March 5, 2025 22:56 Action required
Enable python 3.11 and 3.12 tests
nv-accelerate-v100 #13409: Pull request #7007 synchronize by loadams
March 5, 2025 20:12 8m 57s loadams/reenable-py311-312
March 5, 2025 20:12 8m 57s
Training multiple models
nv-accelerate-v100 #13407: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 8m 18s olruwase/zero_multi_models
March 5, 2025 16:08 8m 18s
Training multiple models
nv-accelerate-v100 #13406: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 41s olruwase/zero_multi_models
March 5, 2025 16:08 41s
Enable ZeRO set/get APIs for NVMe offload
nv-accelerate-v100 #13405: Pull request #7046 synchronize by loadams
March 5, 2025 15:46 8m 8s olruwase/update_nvme_offload_states
March 5, 2025 15:46 8m 8s
Conditionally quote env vars
nv-accelerate-v100 #13404: Pull request #7071 synchronize by loadams
March 5, 2025 15:45 8m 11s saurabhkoshatwar:bugfix/env_export
March 5, 2025 15:45 8m 11s
Variable batch size and LR scheduler
nv-accelerate-v100 #13402: Pull request #7104 synchronize by bm-synth
March 5, 2025 09:22 7m 4s bm-synth:variable_batch_size_and_lr_2
March 5, 2025 09:22 7m 4s
Enable torch.autocast with ZeRO
nv-accelerate-v100 #13401: Pull request #6993 synchronize by tohtana
March 5, 2025 02:56 8m 52s tohtana/support_autocast
March 5, 2025 02:56 8m 52s
Enable torch.autocast with ZeRO
nv-accelerate-v100 #13400: Pull request #6993 synchronize by tohtana
March 5, 2025 02:55 1m 28s tohtana/support_autocast
March 5, 2025 02:55 1m 28s