Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[E2E] HF/Timm/Torchbench models got "eager_two_runs_differ" #1256

Open
Tracked by #1223
libohao1201 opened this issue Jan 7, 2025 · 1 comment
Open
Tracked by #1223

[E2E] HF/Timm/Torchbench models got "eager_two_runs_differ" #1256

libohao1201 opened this issue Jan 7, 2025 · 1 comment

Comments

@libohao1201
Copy link

🐛 Describe the bug

The following models got "eager_two_runs_differ"

eager_two_runs_diff
LNL HF
Timm
Torchbench Super_SloMo (train_eager_fp32/amp_bf16) pytorch_CycleGAN_and_pix2pix (train_eager_fp32/bf16/amp_bf16)
BMG HF
Timm
Torchbench Super_SloMo (train_eager_fp32) pytorch_CycleGAN_and_pix2pix (train_eager_fp32/bf16)
ARC HF DistilBertForMaskedLM(train_fp16_eager)
Timm convnext_base (train_eager) jx_nest_base (train_eager) swin_base_patch4_window7_224 (train_eager) twins_pcpvt_base (train_eager) coat_lite_mini (train) convit_base mobilevit_s tnt_s_patch16_224
Torchbench hf_Reformer(train_eager) timm_regnet (train_eager_fp32)

Versions

Env:

stock pytorch https://github.com/pytorch/pytorch/tree/90b7dcf2c5ee13b892701822f2abbc0e64f5584d pip install --pre torch==2.6.0.dev20241202+xpu torchvision==0.20.0.dev20241202+xpu torchaudio==2.5.0.dev20241202+xpu --index-url https://download.pytorch.org/whl/nightly/xpu
torch-xpu-ops Commit: bf4bab1 Commit: 0f48ac0 (including #1187) - for UT
Driver 32.0.101.6314 32.0.101.6252(bmg)
Conda python 3.10
transformer 243e186efbf7fb93328dd6b34927a4e8c8f24395
@mengfei25
Copy link
Contributor

mengfei25 commented Jan 10, 2025

Super_SloMo got same fail on PVC for FP32 training

github-merge-queue bot pushed a commit that referenced this issue Jan 16, 2025
Last reference updated is 20240709
Related issues: 

- [x] #1216
- [x] #1217
- [x] #1219
- [x] #1220
- [ ] #1221
- [x] #1222
- [ ] #1256
- [ ] #1260
- [ ] #1261
- [ ] #1262
- [ ] #1263
- [ ] #1264
- [ ] #1273
- [ ] #1274
- [ ] #1275
- [ ] #1276
- [ ] #1277
- [ ] #1278
- [ ] #508
- [ ] #509
- [ ] #510
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants