Add CeedBasisApplyAdd #1644

jeremylt · 2024-08-06T19:22:40Z

WIP, GPU impl and CeedBasisApplyAddAtPoints to come

jeremylt · 2024-08-06T22:44:10Z

Ok, design decision point - the usage really only makes sense for CEED_TRANSPOSE especially for why I need this, so I made that an explicit requirement.

I think for GPU I think I'll want separate Apply vs ApplyTranspose kernels in some cases?

jrwrigh · 2024-08-07T15:57:30Z

the usage really only makes sense for CEED_TRANSPOSE especially for why I need this, so I made that an explicit requirement.

Agreed that having the transpose as an input, but verifying it to always be transpose works well. Might also include that in the docstring for it, ie:

  @param[in]  t_mode    @ref CEED_TRANSPOSE to apply the transpose, mapping from quadrature points to nodes. CEED_NOTRANSPOSE is not valid for CeedBasisApplyAdd.

jeremylt · 2024-08-08T22:55:03Z

Update - CUDA/HIP ref and shared are good, need to tackle MAGMA

nbeams · 2024-08-08T23:12:44Z

I'm not able to follow libCEED activity as closely these days, but please ping me if you run into any issues with the MAGMA backend.

jeremylt · 2024-08-08T23:14:19Z

Thanks! The TLDR here is that I want to sum into the target vector with CeedBasisApplyAdd for CEED_TRANSPOSE for CEED_EVAL_INTERP and CEED_EVAL_GRAD. It looks not too tricky for MAGMA at a glance but I haven't dug in on it

jeremylt · 2024-08-09T20:56:28Z

Only tensor product grad is correctly summing into the target vector with my changes for MAGMA, unclear why but I'll dig more Monday

nbeams

Did a quick review, found some copy-paste bugs

backends/magma/ceed-magma-basis.c

include/ceed/jit-source/magma/magma-common-tensor.h

nbeams · 2024-08-10T23:50:13Z

As a side note, looking at the new tests in this PR reminded me that we might want to check on the test coverage for the non-tensor basis in MAGMA. Now that we have two options -- the RTC kernels for lower orders and standard library calls for the rest -- I'm not sure how often, if at all, the "standard" route is being tested. (For t363 I manually lowered MAGMA_NONTENSOR_CUSTOM_KERNEL_MAX_P/MAGMA_NONTENSOR_CUSTOM_KERNEL_MAX_Q to make sure it also passed with the non-custom-kernel path -- and it did -- but of course that won't happen in the usual testing).

zatkins-dev

Overall I like this refactor. I noticed a few places where it is probably more clear to index an array instead of doing pointer arithmetic, but if you disagree that's fine.

include/ceed/jit-source/cuda/cuda-ref-basis-nontensor-templates.h

include/ceed/jit-source/hip/hip-ref-basis-nontensor-templates.h

include/ceed/jit-source/hip/hip-shared-basis-tensor.h

include/ceed/jit-source/cuda/cuda-shared-basis-tensor.h

include/ceed/jit-source/magma/magma-basis-interp-deriv-nontensor.h

style - consistently use indexing over pointer arithmatic Co-authored-by: Zach Atkins <[email protected]>

jeremylt added enhancement performance labels Aug 6, 2024

jeremylt self-assigned this Aug 6, 2024

jeremylt force-pushed the jeremy/basis-apply-add branch 2 times, most recently from 9aa6d80 to 855975c Compare August 6, 2024 22:34

jeremylt force-pushed the jeremy/basis-apply-add branch 2 times, most recently from 8178602 to a322f13 Compare August 7, 2024 15:15

basis - add CeedBasisApplyAdd + CPU impl

652a51b

jeremylt force-pushed the jeremy/basis-apply-add branch from a322f13 to 652a51b Compare August 7, 2024 18:23

jeremylt added 2 commits August 8, 2024 09:45

basis - add ref GPU ApplyAdd

e6edef5

basis - add shared GPU ApplyAdd

5e414ad

jeremylt force-pushed the jeremy/basis-apply-add branch from eac281b to 5e414ad Compare August 8, 2024 22:06

jeremylt force-pushed the jeremy/basis-apply-add branch 2 times, most recently from 30a674e to bd66db0 Compare August 9, 2024 20:02

nbeams reviewed Aug 9, 2024

View reviewed changes

backends/magma/ceed-magma-basis.c Outdated Show resolved Hide resolved

backends/magma/ceed-magma-basis.c Outdated Show resolved Hide resolved

backends/magma/ceed-magma-basis.c Outdated Show resolved Hide resolved

nbeams reviewed Aug 9, 2024

View reviewed changes

include/ceed/jit-source/magma/magma-common-tensor.h Outdated Show resolved Hide resolved

basis - add MAGMA ApplyAdd

18d93ab

jeremylt force-pushed the jeremy/basis-apply-add branch from bd66db0 to 18d93ab Compare August 9, 2024 23:01

basis - add CeedBasisApplyAddAtPoints + default impl

3b2de1f

jeremylt force-pushed the jeremy/basis-apply-add branch 2 times, most recently from 56c8d78 to f5cd871 Compare August 12, 2024 21:50

basis - add GPU ApplyAddAtPoints

311be37

jeremylt force-pushed the jeremy/basis-apply-add branch from f5cd871 to 311be37 Compare August 12, 2024 22:06

jeremylt added the 1-In Review label Aug 12, 2024

jeremylt mentioned this pull request Aug 13, 2024

Skip duplicate transpose restrictions #1645

Merged

tidy - add extra assert to fix clang-tidy

585c252

zatkins-dev approved these changes Aug 13, 2024

View reviewed changes

jeremylt and others added 2 commits August 13, 2024 15:08

Apply suggestions from code review

8e5b6fd

style - consistently use indexing over pointer arithmatic Co-authored-by: Zach Atkins <[email protected]>

style - more pointer fixes

fbbb68f

jeremylt force-pushed the jeremy/basis-apply-add branch from 7b32c88 to fbbb68f Compare August 13, 2024 22:03

jeremylt merged commit db2becc into main Aug 13, 2024
23 of 24 checks passed

jeremylt deleted the jeremy/basis-apply-add branch August 13, 2024 22:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CeedBasisApplyAdd #1644

Add CeedBasisApplyAdd #1644

jeremylt commented Aug 6, 2024

jeremylt commented Aug 6, 2024

jrwrigh commented Aug 7, 2024

jeremylt commented Aug 8, 2024

nbeams commented Aug 8, 2024

jeremylt commented Aug 8, 2024

jeremylt commented Aug 9, 2024

nbeams left a comment

nbeams commented Aug 10, 2024

zatkins-dev left a comment

Add CeedBasisApplyAdd #1644

Add CeedBasisApplyAdd #1644

Conversation

jeremylt commented Aug 6, 2024

jeremylt commented Aug 6, 2024

jrwrigh commented Aug 7, 2024

jeremylt commented Aug 8, 2024

nbeams commented Aug 8, 2024

jeremylt commented Aug 8, 2024

jeremylt commented Aug 9, 2024

nbeams left a comment

Choose a reason for hiding this comment

nbeams commented Aug 10, 2024

zatkins-dev left a comment

Choose a reason for hiding this comment