Add support for intrinsics for `NTuple{VecElement}` #55118

oscardssmith · 2024-07-13T15:23:03Z

This is the start towards proper SIMD support in Base. Currently the main things missing are support for masked/predicated instructions, and an intrinsic to load an arbitrarily chosen sized NTuple{n, VecElement} from a Memory. Many thanks to Alexandre Prieur for pair programming this with me, and @vtjnash, @gbaraldi and @vchuravy for answering the 50 million questions about C++/LLVM/debugging.

julia> f(a,b) = Core.Intrinsics.mul_float(a, b)
julia> a = ntuple(i->VecElement(sqrt(i)), 8);

julia> @code_llvm f(a,a)
; Function Signature: f(NTuple{8, VecElement{Float64}}, NTuple{8, VecElement{Float64}})
;  @ REPL[1]:1 within `f`
define <8 x double> @julia_f_949(<8 x double> %"a::Tuple", <8 x double> %"b::Tuple") #0 {
top:
  %0 = fmul <8 x double> %"a::Tuple", %"b::Tuple"
  ret <8 x double> %0
}

src/intrinsics.cpp

tecosaur · 2024-09-09T06:26:59Z

As one of the people rather excited about this, I'm wondering whether I might hope to see this progress to a non-draft PR in the near future, or whether this has some blocker/major further work needed? 🙂

oscardssmith · 2024-09-09T12:29:41Z

I'm unlikely to have time to do all the things that likely will be necessary to finish this off. there's no blockers, just a bunch more work.

giordano · 2024-10-08T16:58:11Z

base/simd.jl

+export vload, vstore!, natural_vecwidth
+
+# TODO: See C# and Co Vec type 
+# TODO: Hardware portable vector types...


Would this include vscale? 👀

Co-authored-by: Mosè Giordano <[email protected]>

KristofferC · 2024-11-04T20:47:09Z

base/simd.jl

+import Core.Intrinsics: add_float, sub_float, mul_float, div_float, muladd_float, neg_float
+
+## floating point promotions ##
+promote_rule(::Type{Vec{N, Float32}}, ::Type{Vec{N, Float16}}) where N = Vec{N, Float32}


It is not obvious to me that these should be defined. When you are doing low level SIMD stuff you probably don't want to accidentally promote things and in case where you really want to work with different types, an explicit convert might be better for clarity?

nsajko added the compiler:simd instruction-level vectorization label Jul 13, 2024

oscardssmith force-pushed the os/vectorized-intrinsics branch from 7ee1df5 to 6aced19 Compare July 28, 2024 03:13

giordano reviewed Jul 28, 2024

View reviewed changes

src/intrinsics.cpp Outdated Show resolved Hide resolved

oscardssmith marked this pull request as draft July 28, 2024 12:01

vchuravy force-pushed the os/vectorized-intrinsics branch from e05da0d to b5117ac Compare October 7, 2024 10:02

giordano reviewed Oct 8, 2024

View reviewed changes

vchuravy mentioned this pull request Oct 9, 2024

Implement support for set of constants in absint #56067

Draft

oscardssmith and others added 11 commits November 4, 2024 16:23

runtime works

aacdb85

IT WORKSjsnjsnjsnjsnjsn!

3b5248f

works with bitcast

7b868b1

undo array changes

2e9f793

Update src/intrinsics.cpp

ddc8890

Co-authored-by: Mosè Giordano <[email protected]>

implement vload/vstore! and a primitive Vec type

e6d6f4a

start on preferred_width intrinsic

0a0f7e0

fixup! start on preferred_width intrinsic

0e95cfd

add some basic arithmetic support

c8d0ba3

add Mask dt

799e7e2

implement select

71dd9ab

vchuravy force-pushed the os/vectorized-intrinsics branch from 4977f8c to 71dd9ab Compare November 4, 2024 16:13

KristofferC reviewed Nov 4, 2024

View reviewed changes

WIP: Implement SIMD functionality for XorishoSIMD

52aac7d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for intrinsics for `NTuple{VecElement}` #55118

Add support for intrinsics for `NTuple{VecElement}` #55118

oscardssmith commented Jul 13, 2024

tecosaur commented Sep 9, 2024

oscardssmith commented Sep 9, 2024

giordano Oct 8, 2024

KristofferC Nov 4, 2024

Add support for intrinsics for NTuple{VecElement} #55118

Are you sure you want to change the base?

Add support for intrinsics for NTuple{VecElement} #55118

Conversation

oscardssmith commented Jul 13, 2024

tecosaur commented Sep 9, 2024

oscardssmith commented Sep 9, 2024

giordano Oct 8, 2024

Choose a reason for hiding this comment

KristofferC Nov 4, 2024

Choose a reason for hiding this comment

Add support for intrinsics for `NTuple{VecElement}` #55118

Add support for intrinsics for `NTuple{VecElement}` #55118