Add critical `convergent` attributes to functions for OpenCL #805

seven-mile · 2024-08-24T09:18:32Z

ClangIR is still adding more function attributes sequentially. Most of them are not that important at this early stage. This is already tracked by in-code MissingFeatures::setFunctionAttributes().

But for OpenCL and other SIMT programs, the unit attribute convergent for functions are especially critical. Missing of it would potentially cause some misoptimizations.

The good news is that the frontend does not need to do heavy analysis to determine the presence of convergent. In OG CodeGen, the logic is basically to add convergent to all functions for all SIMT languages. Then LLVM will utilize DivergenceAnalysis to remove unnecessary convergents.

clangir/clang/lib/CodeGen/CGCall.cpp

Lines 2012 to 2019 in 826abe4

    
           if (LangOpts.assumeFunctionsAreConvergent()) { 
        
             // Conservatively, mark all functions and calls in CUDA and OpenCL as 
        
             // convergent (meaning, they may call an intrinsically convergent op, such 
        
             // as __syncthreads() / barrier(), and so can't have certain optimizations 
        
             // applied around them).  LLVM will remove this attribute where it safely 
        
             // can. 
        
             FuncAttrs.addAttribute(llvm::Attribute::Convergent); 
        
           }

And for non-SIMT languages, there should not be any functional changes.

The only challenge here is that the test case for OG OpenCL CodeGen: clang/test/CodeGenOpenCL/convergent.cl is a bit beyond the current capabilities of ClangIR. (It may relies on #803 to avoid strange poison propagation.) I would recommend writing a trivial test by ourselves to exercise the code paths for ConvergentAttr.

The text was updated successfully, but these errors were encountered:

seven-mile · 2024-09-11T05:48:39Z

Note that call also requires these attributes.

Generating of attributes of Func or Call in ClangIR should be unified with mechanism like ConstructAttributeList. While for LLVM lowering, amending of LLVM attributes for Call op is not yet implemented.

…ages (llvm#840) Fix llvm#805. This PR includes end-to-end implementation. The `convergent` attribute is set depending on languages, which is wrapped as `langOpts.assumeFunctionsAreConvergent()`. Therefore, in ClangIR, every `cir.func` under `#cir.lang<opencl_c>` is set to be convergent. After lowering to LLVM IR, `PostOrderFunctionAttrs` pass will remove unnecessary `convergent` then. In other words, we will still see `convergent` on every function with `-O0`, but not with default optimization level. The test taken from `CodeGenOpenCL/convergent.cl` is a bit complicated. However, the core of it is that `convergent` is set properly for `convfun()` `non_convfun()` `f()` and `g()`. Merge of two `if` is more or less a result of generating the same LLVM IR as OG.

…ages (#840) Fix #805. This PR includes end-to-end implementation. The `convergent` attribute is set depending on languages, which is wrapped as `langOpts.assumeFunctionsAreConvergent()`. Therefore, in ClangIR, every `cir.func` under `#cir.lang<opencl_c>` is set to be convergent. After lowering to LLVM IR, `PostOrderFunctionAttrs` pass will remove unnecessary `convergent` then. In other words, we will still see `convergent` on every function with `-O0`, but not with default optimization level. The test taken from `CodeGenOpenCL/convergent.cl` is a bit complicated. However, the core of it is that `convergent` is set properly for `convfun()` `non_convfun()` `f()` and `g()`. Merge of two `if` is more or less a result of generating the same LLVM IR as OG.

seven-mile mentioned this issue Aug 24, 2024

[GSoC] Add OpenCL support to compile GPU kernels #689

Closed

seven-mile added the invalid This doesn't seem right label Aug 26, 2024

seven-mile mentioned this issue Sep 14, 2024

[CIR][Dialect] Add convergent attribute to functions for SIMT languages #840

Merged

bcardosolopes closed this as completed in #840 Sep 16, 2024

bcardosolopes closed this as completed in ba8c248 Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add critical `convergent` attributes to functions for OpenCL #805

Add critical `convergent` attributes to functions for OpenCL #805

seven-mile commented Aug 24, 2024

seven-mile commented Sep 11, 2024

Add critical convergent attributes to functions for OpenCL #805

Add critical convergent attributes to functions for OpenCL #805

Comments

seven-mile commented Aug 24, 2024

seven-mile commented Sep 11, 2024

Add critical `convergent` attributes to functions for OpenCL #805

Add critical `convergent` attributes to functions for OpenCL #805