[BF16]For LayoutLMForSequenceClassification model on stock pytorch, gelu cost time on pvc-1100 worse than A100 * ratio #800

xiaowangintel · 2024-08-22T02:17:21Z

🐛 Describe the bug

For more details, please refer to https://jira.devtools.intel.com/browse/PYTORCHDGQ-5064.

For more details, please refer to https://jira.devtools.intel.com/browse/PYTORCHDGQ-5089?filter=-2.

Versions

pytorch commit:03480213dea1f60f6d12e7348904d2f3ef7314d0
torch-xpu-ops commit:718bc42c667539977e5eadb11ea4dec602544bf2
driver : hotfix_agama-ci-devel-881.19
pti : l_intel-pti-dev_p_0.9.0.38_offline.sh
basekit : l_BaseKit_p_2024.2.1.100_offline.sh

retonym · 2024-11-19T08:31:19Z

xpu performance is not targeted to PT 2.6

weishi-deng · 2025-01-03T02:26:55Z

rerun this test and the perf for gelu is reasonable.

xytintel added the loops_kernel Loops Kernel Backbone label Sep 10, 2024

chuanqi129 added this to the PT2.6 milestone Oct 14, 2024

chuanqi129 added E2E performance labels Oct 14, 2024

riverliuintel assigned retonym Nov 12, 2024

retonym modified the milestones: PT2.6, PT2.7 Nov 19, 2024

weishi-deng added the triaged label Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BF16]For LayoutLMForSequenceClassification model on stock pytorch, gelu cost time on pvc-1100 worse than A100 * ratio #800

[BF16]For LayoutLMForSequenceClassification model on stock pytorch, gelu cost time on pvc-1100 worse than A100 * ratio #800

xiaowangintel commented Aug 22, 2024 •

edited

Loading

retonym commented Nov 19, 2024

weishi-deng commented Jan 3, 2025

[BF16]For LayoutLMForSequenceClassification model on stock pytorch, gelu cost time on pvc-1100 worse than A100 * ratio #800

[BF16]For LayoutLMForSequenceClassification model on stock pytorch, gelu cost time on pvc-1100 worse than A100 * ratio #800

Comments

xiaowangintel commented Aug 22, 2024 • edited Loading

🐛 Describe the bug

Versions

retonym commented Nov 19, 2024

weishi-deng commented Jan 3, 2025

xiaowangintel commented Aug 22, 2024 •

edited

Loading