Add fp16 support #38

santhnm2 · 2021-09-20T07:33:42Z

No description provided.

examples/parser.py

test/test_grid_search.py

siddharth-krishna · 2021-09-22T08:38:46Z

dist_ir/backend/torch.py

@@ -415,8 +417,12 @@ def print_memory_usage():
            assert isinstance(output, tuple)
            for i, v in enumerate(op.outputs):
                value_map[v] = output[i]
+                if torch.any(torch.isnan(output[i])):


I'd put these under a debug flag to avoid slowing down executions

examples/gpt2.py

siddharth-krishna · 2021-09-22T09:02:26Z

examples/grid_search.py

-        args.dram_bandwidth = simulation_parameters["dram_bandwidth"]
-        args.kernel_launch_overhead = simulation_parameters["kernel_launch_overhead"]
+        args.device_throughput = 1.0 / simulation_parameters["device_parameters"][0]
+        args.dram_bandwidth = 1.0 / simulation_parameters["device_parameters"][1]


Won't this become infinity if one of the regression coefficients is 0?

Yea but I think that's ok for now since we don't see much utility from the dram bandwidth at sufficiently large data sizes right?

When I ran it before Tue's meeting, it threw a RuntimeError for dividing by zero (not sure if it was from this line though), so maybe safer to store the parameters if you run into that error again.

siddharth-krishna · 2021-09-22T09:13:26Z

examples/parser.py

@@ -89,7 +90,7 @@ def add_backend_config_arguments(self):
        self.add_argument(
            "--use_gpu",
            action="store_true",
-            default=torch.cuda.is_available(),
+            default=False,


Why did you change the default?

There's no way to only use CPU otherwise, but maybe we should just make this --use_cpu instead

siddharth-krishna reviewed Sep 20, 2021

View reviewed changes

examples/parser.py Show resolved Hide resolved

test/test_grid_search.py Show resolved Hide resolved

santhnm2 added 2 commits September 20, 2021 14:27

Add fp16 support

099f0d4

Add fp16 support for GPT2

b5d1da8

santhnm2 force-pushed the fp16 branch from a5d54aa to b5d1da8 Compare September 20, 2021 21:29

santhnm2 and others added 6 commits September 20, 2021 14:31

Fix formatting

4bce097

Fix tests with no GPU available

7a3e573

Add fp16 support to simulator calibration

b621cb1

Create DataFrame of calibration data and dump to csv

eb777a6

Directly store device parameters

60abe92

GPT fp16 fixes

2b12914

siddharth-krishna reviewed Sep 22, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16 support #38

Add fp16 support #38

santhnm2 commented Sep 20, 2021

siddharth-krishna Sep 22, 2021

siddharth-krishna Sep 22, 2021

santhnm2 Sep 22, 2021

siddharth-krishna Sep 22, 2021

siddharth-krishna Sep 22, 2021

santhnm2 Sep 22, 2021

Add fp16 support #38

Are you sure you want to change the base?

Add fp16 support #38

Conversation

santhnm2 commented Sep 20, 2021

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

santhnm2 Sep 22, 2021

Choose a reason for hiding this comment

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

siddharth-krishna Sep 22, 2021

Choose a reason for hiding this comment

santhnm2 Sep 22, 2021

Choose a reason for hiding this comment