Skip to content

Actions: Azure/MS-AMP

GitHub Pages

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
179 workflow runs
179 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bug Fixed] Support MS-AMP+TE+DDP and MS-AMP+TE+DeepSpeed
GitHub Pages #108: Pull request #144 opened by wkcn
December 13, 2023 08:17 50s wkcn:fix_wgrad_for_ds
December 13, 2023 08:17 50s
Add cifar10 example using TE+DeepSpeed-Zero+MS-AMP
GitHub Pages #107: Pull request #143 synchronize by tocean
December 13, 2023 02:58 51s yuxaing/ds_te_example
December 13, 2023 02:58 51s
Add cifar10 example using TE+DeepSpeed-Zero+MS-AMP
GitHub Pages #106: Pull request #143 synchronize by tocean
December 13, 2023 02:48 47s yuxaing/ds_te_example
December 13, 2023 02:48 47s
Add cifar10 example using TE+DeepSpeed-Zero+MS-AMP
GitHub Pages #105: Pull request #143 synchronize by tocean
December 12, 2023 11:25 54s yuxaing/ds_te_example
December 12, 2023 11:25 54s
Add cifar10 example using TE+DeepSpeed-Zero+MS-AMP
GitHub Pages #104: Pull request #143 opened by tocean
December 12, 2023 11:17 50s yuxaing/ds_te_example
December 12, 2023 11:17 50s
[Feature] Auto scaling factor tuning for FP8 collective communication
GitHub Pages #99: Pull request #140 synchronize by wkcn
December 10, 2023 13:24 52s wkcn:wgrad_auto_scaling
December 10, 2023 13:24 52s
fix bug in deep fp8 zero
GitHub Pages #94: Pull request #141 synchronize by tocean
December 8, 2023 06:19 52s yuxiang/zero_bugfix
December 8, 2023 06:19 52s
fix bug in deep fp8 zero
GitHub Pages #93: Pull request #141 synchronize by tocean
December 8, 2023 05:39 54s yuxiang/zero_bugfix
December 8, 2023 05:39 54s
fix bug in deep fp8 zero
GitHub Pages #92: Pull request #141 opened by tocean
December 8, 2023 05:36 55s yuxiang/zero_bugfix
December 8, 2023 05:36 55s
Use ScalingTensor for state in AdamW optimizer (#138)
GitHub Pages #86: Commit ca2d8d5 pushed by tocean
December 1, 2023 08:12 57s main
December 1, 2023 08:12 57s
Use ScalingTensor for state in AdamW optimizer
GitHub Pages #85: Pull request #138 synchronize by tocean
December 1, 2023 06:55 52s yuxiang/adamw_refine
December 1, 2023 06:55 52s
Use ScalingTensor for state in AdamW optimizer
GitHub Pages #84: Pull request #138 opened by tocean
December 1, 2023 06:51 53s yuxiang/adamw_refine
December 1, 2023 06:51 53s
[Bugfix] when parameters has no grad or ScalingParameter has no is_me…
GitHub Pages #83: Commit aef18eb pushed by tocean
November 30, 2023 04:38 55s main
November 30, 2023 04:38 55s