-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add torchao to PT2 Benchmark Runner #2268
Conversation
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/pytorch#126469 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Reviewed By: jerryzh168 Differential Revision: D57463273
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273
This pull request was exported from Phabricator. Differential Revision: D57463273 |
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Differential Revision: D57463273 Pulled By: xuzhao9
Summary: X-link: #2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. X-link: pytorch/pytorch#126469 Reviewed By: jerryzh168 Differential Revision: D57463273 Pulled By: xuzhao9 fbshipit-source-id: 64520f18b63107ce5f07447ef7f4a8c841d9ff1f
Summary: X-link: pytorch/benchmark#2268 Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline. Test Plan: ``` $ buck2 run mode/opt //caffe2/benchmarks/dynamo:torchbench -- --only BERT_pytorch --bfloat16 --quantization int8dynamic --performance --inference --print-memory loading model: 0it [00:50, ?it/s] cuda eval BERT_pytorch memory: eager: 0.75 GB, dynamo: 0.75 GB, ratio: 1.00 running benchmark: 100% 1.003x ``` Reviewed By: jerryzh168 Differential Revision: D57463273 Pull Request resolved: #126469 Approved by: https://github.com/huydhn
Summary: Support torchao performance and accuracy tests in PT2 Benchmark Runner, using the inductor backend as the baseline.
Reviewed By: jerryzh168
Differential Revision: D57463273