RFC: test out new syntax for launch with type deduction #305

rolandschulz · 2025-04-12T06:26:21Z

Prototype to test out a solution for intel/llvm#17832 . Looking for review on the syntax. Is it OK to have to wrap the function in a lambda to gain the type deduction? Should the lambda be past as first or last argument?

t4c1 · 2025-04-14T07:18:16Z

Looks cleaner to me, but I would not define stuff in syclcompat namespace within cutlass

rolandschulz · 2025-04-14T14:48:22Z

I agree in general we shouldn't put things into syclcompat namespace.

For this PR I put it in syclcompat, because the purpose of this PR is to protype what we want to ask syclcompat to add (intel/llvm#17832).

Assuming it gets added to syclcompat and we switch our code to use it, we would require the latest version of syclcompat. That doesn't work because we want to support the latest released DPC++ version. Would it be reasonable to have in our headers the extra syclcompat code for older versions of DPC++ to avoid this dependency?

mehdi-goli · 2025-04-14T15:11:44Z

examples/cute/tutorial/sgemm_1_sycl.cpp

+namespace syclcompat {
+  template <class F, int Dim>
+  sycl::event launch(const sycl::nd_range<Dim> &range, sycl::queue q, const F& f) {
+    return q.parallel_for(detail::transform_nd_range<Dim>(range),  [=](sycl::nd_item<Dim>) { f(); });
+  }
+  template <class F, int Dim>
+  sycl::event launch(const sycl::nd_range<Dim> &range, const F& f) {
+    return launch(range, get_default_queue(), f);
+  }
+  // Alternative launch through dim3 objects
+  template <class F>
+  sycl::event launch(const dim3 &grid, const dim3 &threads, sycl::queue q, const F& f) {
+    return launch(sycl::nd_range<3>{grid * threads, threads}, q, f);
+  }
+  template <class F>
+  sycl::event launch(const dim3 &grid, const dim3 &threads, const F& f) {
+    return launch(grid, threads, get_default_queue(), f);
+  }
+}


Shall we put those as PR on sycl::compat repo?

intel/llvm#18021

FMarno · 2025-04-16T13:55:16Z

We can add the dim3 overloads into the sycl-cuda-compat PR (#276) under a different namespace while we wait for upstream changes to make it to the release. Should be no harm in that.

JackAKirk · 2025-04-17T09:05:11Z

examples/cute/tutorial/sgemm_1_sycl.cpp

-                                B, dB, sB, tB,
-                                C, dC, sC, tC,
-                                alpha, beta);
+  auto event = syclcompat::launch(dimGrid, dimBlock, [=]


I guess you only want to record this event if you are doing fine-grained profiling?
It is possible you could see a notable performance improvement by only recording events when required: see https://github.com/intel/llvm/pull/18021/files#r2048537558

Also if an analogue to cudaEventRecord in sycl would be useful to you then you could request creating an extension for e.g. oneapi_event_record that takes a sycl event.

rolandschulz added 2 commits April 11, 2025 23:20

use new launch syntax

a75637f

fix

b06f455

rolandschulz requested a review from aacostadiaz April 14, 2025 14:48

mehdi-goli reviewed Apr 14, 2025

View reviewed changes

JackAKirk reviewed Apr 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: test out new syntax for launch with type deduction #305

RFC: test out new syntax for launch with type deduction #305

rolandschulz commented Apr 12, 2025

t4c1 commented Apr 14, 2025

rolandschulz commented Apr 14, 2025

mehdi-goli Apr 14, 2025 •

edited

Loading

rolandschulz Apr 15, 2025

FMarno commented Apr 16, 2025 •

edited

Loading

JackAKirk Apr 17, 2025

JackAKirk Apr 17, 2025 •

edited

Loading

RFC: test out new syntax for launch with type deduction #305

Are you sure you want to change the base?

RFC: test out new syntax for launch with type deduction #305

Conversation

rolandschulz commented Apr 12, 2025

t4c1 commented Apr 14, 2025

rolandschulz commented Apr 14, 2025

mehdi-goli Apr 14, 2025 • edited Loading

Choose a reason for hiding this comment

rolandschulz Apr 15, 2025

Choose a reason for hiding this comment

FMarno commented Apr 16, 2025 • edited Loading

JackAKirk Apr 17, 2025

Choose a reason for hiding this comment

JackAKirk Apr 17, 2025 • edited Loading

Choose a reason for hiding this comment

mehdi-goli Apr 14, 2025 •

edited

Loading

FMarno commented Apr 16, 2025 •

edited

Loading

JackAKirk Apr 17, 2025 •

edited

Loading