-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Enable the transform tensor API to convert any NCHW tensor to NHWC tensor and vice versa #2003
base: develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess that gputt cannot be used with OCL backend. If so then please use conditional compilation (see config.h) as needed.
return a.first < b.first; | ||
}); | ||
|
||
printf("Permutation for %s -> %s: ", src_layout.c_str(), dst_layout.c_str()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use MIOPEN_LOG_I2/I/W/E
macros, no printf please.
…nd ROCm support in CMake
…nstall the extra gputt dependency. It should be built like this: BUILDKIT_PROGRESS=plain docker build --no-cache -f Dockerfile.gputt -t rocm/miopen:ci-gputt . Then the usual miopen:ci invocation should instead use miopen:ci-gputt: docker run -it -v $HOME:$HOME --privileged --rm --device=/dev/kfd --device /dev/dri:/dev/dri:rw --volume /dev/dri:/dev/dri:rw -v /var/lib/docker/:/var/lib/docker --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined rocm/miopen:ci-gputt fish
f1d20f7
to
c686acf
Compare
Co-authored-by: Vasilii Filippov <[email protected]>
…or transform by gpuTT functionality. In order for that gpuTT has been extended to support all data types required by MIOpen, including the fancy int8x4
LWPMIOPEN-125
We substitute the TensorTransfom implementation with a more generic GPU-based gpuTT library calls.
Few things still to be implemented: