[Backend][LLVM] Runtime support for any bitwidth integer numpy input #493

zzzDavid · 2023-03-15T19:25:19Z

Summary

Up until this PR, the top function input/output argument type has been set to 64-bit integer type (for integer type args), and type casting is done inside the function body. This was due to the fact that numpy has only 8, 16, 32, 64-bit integer types.

This PR extends hcl.Array and LLVM runtime to support arbitrary bitwidth input arguments from numpy array.

Methods

Byte-as-field numpy array

To store arbitrary width integer data as numpy arrays, we use struct-type numpy arrays, with each byte as a field. Therefore, each integer scalar is represented as a struct of bytes, and the bytes are contiguous in the memory.

Arbitrary data representation

When input data is wider than 64-bit, it cannot be represented as a numpy scalar type. Instead, we use multidimensional lists of integers in Python to represent input tensors, because Python integers can have arbitrary bitwidth.

MLIR arbitrary bitwidth integer alignment

When passing data from numpy to an MLIR's ExecutionEngine as input arguments, we are creating C Struct from numpy ndarrays with the ctypes module in Python. Through a series of experiments, I found that the required alignment of such C Struct is not byte-level, instead, it depends on the integer bitwidth:

Integer type bitwidth (bit)	alignment(bit)
(0, 8]	8
(8, 16]	16
(16, 32]	32
(32, 64]	64
(64, 128]	128
(128, 256]	256
(256, 512]	512

Changes

make_anybitwidth_numpy_array is moved from ir_builder.py to utils.py
All field formats in the struct numpy array are set to unsigned, this makes sign extension in runtime easier to implement, and this change does not affect the creation of DenseAttr in constant tensor op's IRBuilder function.
hcl.Array.np_array is refactored and extended to support any bitwidth data

Limitations

This PR only upgrades Int and UInt types. Fixed/UFixed types are not covered, because fixed-to-integer pass needs to be updated in the IR first. Support for fixed-point type will be added by another PR.

…ges in IR first

… extension easier

chhzh123

LGTM. Do you have other things to add?

zzzDavid · 2023-07-23T18:10:48Z

ExecutionEngine randomly produces wrong results for bitwidth 513-1024, I'm still debugging this issue. Will update you once it's solved.

zzzDavid added 15 commits March 7, 2023 02:07

[Util] Move make_anywidth_numpy_array to utils

c64cfae

[IRBuilder] Fix shape issue with DenseElementsAttr creation

07e430c

Reconstructing LLVM backend runtime

c5fee64

[Util] Remove np.int128, np.int256, since they don't exist

7dcf4df

[Array] Extend hcl.array to support any bitwidth

fddd444

[Array] Add sign extension

172772b

[Runtime] copying back results is not necessary

9cd8bf8

[Array] Exclude changes in fixed type in this PR, since it needs chan…

abea845

…ges in IR first

[Util] Remove signedness in struct numpy representation, to make sign…

340ccec

… extension easier

[Array] Fix issue with fixed type overflow handling

9a28e52

Format with black

e930edb

[Lint] Fix lint errors

e10a28f

[Lint] Upgrade local pylint, fix errors

a873b89

[Test] Add test_irregular_bitwidth_input

506349a

[Test] Use random input

acb4e2f

zzzDavid added the enhancement label Mar 17, 2023

chhzh123 approved these changes Mar 26, 2023

View reviewed changes

zzzDavid mentioned this pull request Jul 23, 2023

[Transform][AnyWidthInteger] Fix issues with float type arg cornell-zhang/hcl-dialect#189

Merged

This was referenced Aug 21, 2023

[Runtime] Support arbitrary bitwidth integer in LLVM execution engine cornell-zhang/allo#37

Merged

[Runtime] Add (U)Fixed type LLVM simulation support cornell-zhang/allo#45

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backend][LLVM] Runtime support for any bitwidth integer numpy input #493

[Backend][LLVM] Runtime support for any bitwidth integer numpy input #493

zzzDavid commented Mar 15, 2023 •

edited

Loading

chhzh123 left a comment

zzzDavid commented Jul 23, 2023

[Backend][LLVM] Runtime support for any bitwidth integer numpy input #493

Are you sure you want to change the base?

[Backend][LLVM] Runtime support for any bitwidth integer numpy input #493

Conversation

zzzDavid commented Mar 15, 2023 • edited Loading

Summary

Methods

Byte-as-field numpy array

Arbitrary data representation

MLIR arbitrary bitwidth integer alignment

Changes

Limitations

chhzh123 left a comment

Choose a reason for hiding this comment

zzzDavid commented Jul 23, 2023

zzzDavid commented Mar 15, 2023 •

edited

Loading