Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[E2E Accuracy] timm jx_nest_base amp_fp16 inference accuracy failed randomly #979

Open
mengfei25 opened this issue Oct 17, 2024 · 4 comments

Comments

@mengfei25
Copy link
Contributor

mengfei25 commented Oct 17, 2024

🐛 Describe the bug

Details in https://github.com/intel/torch-xpu-ops/actions/runs/11361002852

dev name batch_size accuracy
xpu jx_nest_base 8 fail_accuracy
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 pass
xpu jx_nest_base 8 fail_accuracy

Versions

env:
pytorch: bdb42e7c944eb8c3bbfa0327e49e5db797a0bd92
torch-xpu-ops: 1d217ae
keep_torch_xpu_ops: false
python: 3.10
TRITON_COMMIT_ID: 91b14bf5593cf58a8541f3e6b9125600a867d4ef
TORCH_COMMIT_ID: bdb42e7c944eb8c3bbfa0327e49e5db797a0bd92
TRANSFORMERS_VERSION: 243e186efbf7fb93328dd6b34927a4e8c8f24395
DRIVER_VERSION: 803.61
KERNEL_VERSION: 5.15.0-73-generic #80-Ubuntu SMP Mon May 15 15:18:26 UTC 2023
BUNDLE_VERSION: 0.5.3
OS_PRETTY_NAME: Ubuntu 22.04.2 LTS
GCC_VERSION: 11

@retonym
Copy link
Contributor

retonym commented Nov 18, 2024

could not reproduce the random issue locally. This model passed in last weekly test.
Will check the condition in next weekly test.

@retonym
Copy link
Contributor

retonym commented Nov 19, 2024

amp_fp16 inference is not meta dashboard targeted datatype, move to milestone: PT2.7

@retonym retonym modified the milestones: PT2.6, PT2.7 Nov 19, 2024
@mengfei25
Copy link
Contributor Author

@DDEle
Copy link
Contributor

DDEle commented Feb 20, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants