Linear with DID loop split #3650

Priya2698 · 2024-12-28T03:14:07Z

This PR adds a test case demonstrating DID loop split for linear.
The test does not require any changes to the LinearOp::evaluate method and works out of the box. DID split on logical domain continues to work for linear using the additional WARs present that squeeze/unsqueeze the DID dimension. Those WARs can be removed once we completely switch to representing device parallelism using allocation and loop domain.

Priya2698 · 2024-12-28T06:10:06Z

!build

wujingyue · 2024-12-29T06:24:23Z

tests/python/test_multidevice.py

+            self.weight = self.define_tensor([d * e, e], contiguity=[True, True])
+            self.bias = self.define_tensor([d * e], contiguity=[True])


Suggested change

self.weight = self.define_tensor([d * e, e], contiguity=[True, True])

self.bias = self.define_tensor([d * e], contiguity=[True])

self.weight = self.define_tensor([d * e, e], contiguity=True)

self.bias = self.define_tensor([d * e], contiguity=True)

Also, a question for you: is contiguity=True necessary here? Is there a problem with non-contiguous inputs?

wujingyue · 2024-12-29T06:25:38Z

tests/python/test_multidevice.py

+    expected_out_tensor = unsharded_out_tensor.view([b, s, d, e]).permute(2, 0, 1, 3)[
+        rank : rank + 1
+    ]


Can we use shard_tensor here?

Priya2698 added 4 commits December 18, 2024 18:46

Loop split for linear

09de065

clean

683a6b2

lint

09b4fb7

undo extraneous changes

747629d

Priya2698 requested review from wujingyue, samnordmann and cowanmeg and removed request for cowanmeg and samnordmann December 28, 2024 06:10

wujingyue approved these changes Dec 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear with DID loop split #3650

Linear with DID loop split #3650

Priya2698 commented Dec 28, 2024 •

edited

Loading

Priya2698 commented Dec 28, 2024

wujingyue Dec 29, 2024

wujingyue Dec 29, 2024

		self.weight = self.define_tensor([d * e, e], contiguity=[True, True])
		self.bias = self.define_tensor([d * e], contiguity=[True])

Linear with DID loop split #3650

Are you sure you want to change the base?

Linear with DID loop split #3650

Conversation

Priya2698 commented Dec 28, 2024 • edited Loading

Priya2698 commented Dec 28, 2024

wujingyue Dec 29, 2024

Choose a reason for hiding this comment

wujingyue Dec 29, 2024

Choose a reason for hiding this comment

Priya2698 commented Dec 28, 2024 •

edited

Loading