Fix ds-chat CI regression #7015

tjruwase · 2025-02-07T14:52:07Z

Fix #7014
Avoid naming collision on partition()

tjruwase · 2025-02-07T14:52:35Z

tjruwase · 2025-02-07T14:52:58Z

deepspeed/module_inject/layers.py

@@ -124,7 +124,7 @@ def backward(ctx: Any, grad_output: torch.Tensor) -> Tuple[None, torch.Tensor]:
        return None, grad_output


-class Replaced_Layer(nn.Module, ABC):
+class TensorParallel_Layer(nn.Module, ABC):


@inkcherry, please note this renaming for your tutorial work.

tjruwase · 2025-02-08T09:02:28Z

deepspeed/runtime/hybrid_engine.py

+                    if self.inference_policies[child.__class__][0] == LinearLayer:
+                        self._other_layers.append(self.inference_policies[child.__class__][0](module=child,
+                                                                                              mp_group=None,
+                                                                                              skip_partition=True))


@inkcherry, I set mp_group=None here because I was not sure whether autotp_training is tested/compatible with hybrid_engine. Is this okay?

Fix ds-chat CI regression

9882116

tjruwase requested a review from loadams February 7, 2025 14:52

tjruwase requested review from tohtana and hwchen2017 as code owners February 7, 2025 14:52

tjruwase commented Feb 7, 2025

View reviewed changes

tjruwase removed request for hwchen2017 and tohtana February 7, 2025 14:54

tjruwase added 4 commits February 7, 2025 10:30

Fix bug

4a1dd0f

Avoid naming collision on partition()

0ac4457

Use new API

2ae2062

Merge branch 'master' into olruwase/ds_7014

9fb73a4

tjruwase commented Feb 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ds-chat CI regression #7015

Fix ds-chat CI regression #7015

tjruwase commented Feb 7, 2025 •

edited

Loading

tjruwase commented Feb 7, 2025

tjruwase Feb 7, 2025

tjruwase Feb 8, 2025 •

edited

Loading

Fix ds-chat CI regression #7015

Are you sure you want to change the base?

Fix ds-chat CI regression #7015

Conversation

tjruwase commented Feb 7, 2025 • edited Loading

tjruwase commented Feb 7, 2025

tjruwase Feb 7, 2025

Choose a reason for hiding this comment

tjruwase Feb 8, 2025 • edited Loading

Choose a reason for hiding this comment

tjruwase commented Feb 7, 2025 •

edited

Loading

tjruwase Feb 8, 2025 •

edited

Loading