[Fixbug] Fix for softmmax cpu causing issues #437

fishingguy456 · 2024-03-09T16:40:59Z

moved implement_cpu to the cpu task

works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash initial commit works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash change imports fix for diff size, compiledmodule error fix

works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash

… of 8

…e more

…rm` during operator fusion pass (#437) Closes #393 I spent some time looking into the issue without much progress, but I first found that the error message in the linked issue disappeared after commenting out either the `resolve_variant_pass()` or `fuse_operator_pass()` [here](https://github.com/CentML/hidet/blob/bfbb4db6d7792ed3de3be4e9702e597b8fbbe373/python/hidet/graph/transforms/__init__.py#L55-L61). Then, I found that simply adding the `EmbeddingBagOp` to the `NOT_FUSIBLE` set resolves the error. It is a workaround for now, but I am unaware of better solutions.

vadiklyutiy · 2024-12-22T11:18:15Z

@yaoyaoding
Do you remember about this changes? Do we need it?

…rm` during operator fusion pass (#437) Closes #393 I spent some time looking into the issue without much progress, but I first found that the error message in the linked issue disappeared after commenting out either the `resolve_variant_pass()` or `fuse_operator_pass()` [here](https://github.com/CentML/hidet/blob/bfbb4db6d7792ed3de3be4e9702e597b8fbbe373/python/hidet/graph/transforms/__init__.py#L55-L61). Then, I found that simply adding the `EmbeddingBagOp` to the `NOT_FUSIBLE` set resolves the error. It is a workaround for now, but I am unaware of better solutions.

yaoyaoding · 2025-01-06T20:44:37Z

Seems there is some problem in the non-fp32 softmax but I don't remember the exact problem. But it's okay to close this and add a PR to fix the problem by fixing the operator template.

fishingguy456 · 2025-01-06T21:01:19Z

I don't remember the issue exactly but I think it had something to do with the kernel working in isolation but not when it was included in a larger model graph because I put one of the functions in the wrong place. The change is simple so it can just be incorporated in another PR.

fishingguy456 added 30 commits August 4, 2023 11:46

works on multidimensional, axis=-1

7896c45

initial commit

ff90ed5

works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash

change imports

fc61204

fix for diff size, compiledmodule error fix

f84201f

works on multidimensional, axis=-1

6f2e43c

initial commit

25f22cf

works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash

initial commit

aafbb0f

works on 8x8 at least but bad exp save for omp changes working and faster than pytorch works and is fast but exp is WIP remove useless files minor changes for rebase delete trash fix trash fix trash

change imports

44993e2

fix for diff size, compiledmodule error fix

a86d866

works on multidimensional, axis=-1

b59ffa2

wrap up softmax, starting layernorm

7edf0eb

layernorm kinda works but not rly

44c04b3

better code for softmax

2ccc4b6

layernorm works for last layer

13ea5dc

move find sum and find max to registered function

d89036d

find max in registered func

b0659f6

not working softmax on not last dim, minor changes

904760b

layernorm works for any dims

29b7ba7

comments

0c8dc3a

tuning, fix for flowgraph operator resolve

77fe8d9

softmax works

ac40695

commented tensors dont work, i.e. axis is not last 2 AND not multiple…

4938a1f

… of 8

actually works rn frfr so fast 💯

1d447cf

cleanup

30224ce

more cleanup

67d4d56

random testing stuff

09ca2f8

allow epilogue

8352dd8

better epiloguing

27f6cbb

janky matmul resolve

cce1d42

fishingguy456 added 22 commits September 14, 2023 13:20

better epiloguing

8a1167e

janky matmul resolve

0f4876f

still epilogue problem?

49c072f

Merge remote-tracking branch 'origin/main'

0bd13d8

clean up for pr

de74231

fix test

9ab0bac

lint

f779a1d

minor pr edits

124fb09

pytests, cpu child class

6c4efd9

potential fix for failing tests? but prob not will have to investigat…

40fd71f

…e more

weird diff

90c4ffb

merge conflict resolve build.py

587ba64

remove shady batch mat mul

89d5646

lint thing

a3a4b03

move helpers to new file

aec95d2

lint

7a41b5c

change tolerance for flaky test for test_dynamic_shape

dcc6a45

Merge branch 'upstream/main'

dbfcc56

Merge branch 'upstream/main'

703f49e

fused scheduler task name

8f73d4c

Merge branch 'upstream/main'

713b016

implement cpu issue softmax

eafa10f

yaoyaoding changed the title ~~fix for softmmax cpu causing issues~~ [Fixbug] Fix for softmmax cpu causing issues Mar 13, 2024

yaoyaoding closed this Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fixbug] Fix for softmmax cpu causing issues #437

[Fixbug] Fix for softmmax cpu causing issues #437

fishingguy456 commented Mar 9, 2024

vadiklyutiy commented Dec 22, 2024

yaoyaoding commented Jan 6, 2025

fishingguy456 commented Jan 6, 2025

[Fixbug] Fix for softmmax cpu causing issues #437

[Fixbug] Fix for softmmax cpu causing issues #437

Conversation

fishingguy456 commented Mar 9, 2024

vadiklyutiy commented Dec 22, 2024

yaoyaoding commented Jan 6, 2025

fishingguy456 commented Jan 6, 2025