Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

swolchok · 2025-01-22T21:43:34Z

No description provided.

[ghstack-poisoned]

swolchok · 2025-01-22T21:43:35Z

Stack from ghstack (oldest at bottom):

-> Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

pytorch-bot · 2025-01-22T21:43:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7846

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit d3ecd5b with merge base 03cba2c ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
>>> Lint for kernels/test/op_native_group_norm_test.cpp:
pull / unittest / linux / linux-job (gh)
[ FAILED ] OpNativeGroupNormOutTest.SmokeTest
pull / unittest / macos / macos-job (gh)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 938abf9d9ceafbd9d492d099722056ca1b124d44 ghstack-comment-id: 2608331848 Pull Request resolved: #7846

manuelcandales · 2025-01-23T17:08:55Z

kernels/test/op_native_group_norm_test.cpp

+      EXPECT_TENSOR_CLOSE_WITH_TOL(
+          out0,
+          out0_expected,
+          2e-1,


That's 20%. This doesn't make me feel comfortable.

group_norm is one of the ops that automatic mixed precision will autocast to float32: https://intel.github.io/intel-extension-for-pytorch/xpu/1.10.200+gpu/tutorials/features/amp.html

I think the norm ops are just particularly prone to roundoff error, but I'm certainly not a numerical analysis person.

(unresolving for posterity)

Dug in a little further. Here's the PR that originally made PyTorch group_norm support Half: https://github.com/pytorch/pytorch/pull/100234/files#diff-7927db349f568afca2de9b94d74ea5c3b8cb468cb6a433d0cc1e61e65c515a36

It looks like the test is atol=rtol=5e-3. I think it's reasonable to argue that if we can't get the tolerances to be broadly similar then we have a correctness issue and thus don't actually support Half. I'll see what I can do; this one might have to wait for code sharing.

test doesn't pass with atol=rtol=5e-3. Holding off on group_norm until we have code sharing.

[ghstack-poisoned]

ghstack-source-id: 1deac3151e791e9b04da6b08e800eaac1d17111a ghstack-comment-id: 2608331848 Pull Request resolved: #7846

Update

21f6f98

[ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 22, 2025

swolchok added a commit that referenced this pull request Jan 22, 2025

Support Half/BFloat16 in native_group_norm

53a6f75

ghstack-source-id: 938abf9d9ceafbd9d492d099722056ca1b124d44 ghstack-comment-id: 2608331848 Pull Request resolved: #7846

swolchok added the release notes: ops & kernels Changes to the opset and any new / changed kernel implementations label Jan 22, 2025

swolchok requested review from dbort and manuelcandales and removed request for dbort January 22, 2025 21:44

manuelcandales approved these changes Jan 23, 2025

View reviewed changes

manuelcandales reviewed Jan 23, 2025

View reviewed changes

manuelcandales self-requested a review January 23, 2025 17:19

manuelcandales approved these changes Jan 23, 2025

View reviewed changes

Update

d3ecd5b

[ghstack-poisoned]

swolchok added a commit that referenced this pull request Jan 23, 2025

Support Half/BFloat16 in native_group_norm

c372155

ghstack-source-id: 1deac3151e791e9b04da6b08e800eaac1d17111a ghstack-comment-id: 2608331848 Pull Request resolved: #7846

swolchok changed the title ~~Support Half/BFloat16 in native_group_norm~~ Support Half/BFloat16 in native_group_norm (needs accuracy fix) Jan 23, 2025

swolchok mentioned this pull request Jan 23, 2025

Unsupported Scalar Type 5? -- Portable/optimized ops don't consistently support half/bfloat16 #7748

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

swolchok commented Jan 22, 2025 •

edited

Loading

swolchok commented Jan 22, 2025 •

edited

Loading

pytorch-bot bot commented Jan 22, 2025 •

edited

Loading

manuelcandales Jan 23, 2025

swolchok Jan 23, 2025 •

edited

Loading

swolchok Jan 23, 2025

swolchok Jan 23, 2025

Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

Are you sure you want to change the base?

Support Half/BFloat16 in native_group_norm (needs accuracy fix) #7846

Conversation

swolchok commented Jan 22, 2025 • edited Loading

swolchok commented Jan 22, 2025 • edited Loading

pytorch-bot bot commented Jan 22, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7846

❌ 3 New Failures

manuelcandales Jan 23, 2025

Choose a reason for hiding this comment

swolchok Jan 23, 2025 • edited Loading

Choose a reason for hiding this comment

swolchok Jan 23, 2025

Choose a reason for hiding this comment

swolchok Jan 23, 2025

Choose a reason for hiding this comment

swolchok commented Jan 22, 2025 •

edited

Loading

swolchok commented Jan 22, 2025 •

edited

Loading

pytorch-bot bot commented Jan 22, 2025 •

edited

Loading

swolchok Jan 23, 2025 •

edited

Loading