Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix memcopy direction for concatenate (#18058)
`cudaMemcpyAsync` can return a `cudaError_t` value which should be checked for runtime errors. We should preserve the behavior of `thrust::copy` which got replaced with the `cudaMemcpyAsync` call in #17584. The driver may do the right thing and infer the source and destination pointer location instead of using the `cudaMemcpyKind`, but this still leads to weird circumstances where the copy type in code is DtoD while the actual copy at runtime is HtoD Authors: - Tanmay Gujar (https://github.com/tgujar) Approvers: - Bradley Dice (https://github.com/bdice) - Karthikeyan (https://github.com/karthikeyann) - David Wendt (https://github.com/davidwendt) URL: #18058
- Loading branch information