Skip to content

Actions: microsoft/DeepSpeed

hpu-gaudi2

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,307 workflow runs
1,307 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

hpu-gaudi2
hpu-gaudi2 #1551: Scheduled
January 28, 2025 00:11 2h 0m 7s master
January 28, 2025 00:11 2h 0m 7s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1550: Pull request #6932 synchronize by loadams
January 27, 2025 17:07 58m 24s oelayan7:linear
January 27, 2025 17:07 58m 24s
Set dataloader shuffle=true
hpu-gaudi2 #1549: Pull request #6950 synchronize by loadams
January 27, 2025 16:53 57m 25s loadams/shuffle-true-dataloader
January 27, 2025 16:53 57m 25s
Remove assumption that padding only occurs on last rank
hpu-gaudi2 #1548: Pull request #6974 synchronize by xylian86
January 27, 2025 07:32 59m 6s xylian86:zero12_padding_issue
January 27, 2025 07:32 59m 6s
hpu-gaudi2
hpu-gaudi2 #1547: Scheduled
January 27, 2025 00:11 2h 1m 21s master
January 27, 2025 00:11 2h 1m 21s
Remove assumption that padding only occurs on last rank
hpu-gaudi2 #1546: Pull request #6974 opened by xylian86
January 26, 2025 16:39 Action required xylian86:zero12_padding_issue
January 26, 2025 16:39 Action required
hpu-gaudi2
hpu-gaudi2 #1545: Scheduled
January 26, 2025 00:11 1h 58m 55s master
January 26, 2025 00:11 1h 58m 55s
hpu-gaudi2
hpu-gaudi2 #1544: Scheduled
January 25, 2025 00:10 2h 1m 17s master
January 25, 2025 00:10 2h 1m 17s
Extend DeepSpeed inference initialization API with a 'quantize_groups' argument
hpu-gaudi2 #1543: Pull request #3519 synchronize by loadams
January 24, 2025 22:27 54m 49s sakogan:quant-params
January 24, 2025 22:27 54m 49s
Autotp training
hpu-gaudi2 #1541: Pull request #6922 synchronize by delock
January 24, 2025 07:34 Action required inkcherry:autotp_training
January 24, 2025 07:34 Action required
hpu-gaudi2
hpu-gaudi2 #1540: Scheduled
January 24, 2025 00:10 2h 6m 19s master
January 24, 2025 00:10 2h 6m 19s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1539: Pull request #6932 synchronize by loadams
January 23, 2025 18:35 2h 47m 48s oelayan7:linear
January 23, 2025 18:35 2h 47m 48s
Autotp training
hpu-gaudi2 #1538: Pull request #6922 synchronize by inkcherry
January 23, 2025 07:50 3m 56s inkcherry:autotp_training
January 23, 2025 07:50 3m 56s
hpu-gaudi2
hpu-gaudi2 #1537: Scheduled
January 23, 2025 00:10 2h 1m 56s master
January 23, 2025 00:10 2h 1m 56s
Precisely track nvme optimizer offload
hpu-gaudi2 #1536: Pull request #6963 synchronize by tjruwase
January 22, 2025 15:54 55m 48s olruwase/ds_4998
January 22, 2025 15:54 55m 48s
Autotp training
hpu-gaudi2 #1535: Pull request #6922 synchronize by inkcherry
January 22, 2025 05:40 57m 17s inkcherry:autotp_training
January 22, 2025 05:40 57m 17s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
hpu-gaudi2 #1534: Pull request #6553 synchronize by gyou2021
January 22, 2025 03:03 Action required gyou2021:configurable_autoTP
January 22, 2025 03:03 Action required
hpu-gaudi2
hpu-gaudi2 #1533: Scheduled
January 22, 2025 00:11 2h 1m 27s master
January 22, 2025 00:11 2h 1m 27s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1532: Pull request #6932 synchronize by loadams
January 21, 2025 22:34 57m 56s oelayan7:linear
January 21, 2025 22:34 57m 56s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1531: Pull request #6932 synchronize by loadams
January 21, 2025 21:54 Action required oelayan7:linear
January 21, 2025 21:54 Action required