Integrating generic_float struct for adding datatypes #3522

richagadgil · 2024-10-10T22:53:46Z

No description provided.

pfultz2 · 2024-10-10T23:20:26Z

A couple of things:

Move generic_float to migraphx/generic_float.hpp
Add a specialization for std::numeric_limits
Add a specialization for migraphx::is_floating_point(and remove the half one)
Add a specialization for std::common_type(and remove the half one)
Add a test/generic_float.cpp with the fp32 tests I wrote and add some tests for operator overloads
Add pragmas to disable the duplicate branch warnings in the bit_cast function

The specializations should use the template type like:

template<unsigned int E, unsigned int M, unsigned int F>
class numeric_limits<migraphx::generic_float<E, M, F>>
{
...
};

pfultz2 · 2024-10-10T23:28:22Z

Also, we can use the fp8 template type as well to reduce the number common_type overloads:

template<unsigned int E, unsigned int M, unsigned int F, migraphx::fp8::f8_type T, bool FNUZ>
struct common_type<migraphx::generic_float<E, M, F>, migraphx::fp8::float8<T, FNUZ>> : std::common_type<float, float>
{};

template<unsigned int E, unsigned int M, unsigned int F, migraphx::fp8::f8_type T, bool FNUZ>
struct common_type<migraphx::fp8::float8<T, FNUZ>, migraphx::generic_float<E, M, F>> : std::common_type<float, float>
{};

src/include/migraphx/half.hpp

src/include/migraphx/generic_float.hpp

test/float32.cpp

test/float16.cpp

test/float32.cpp

pfultz2 · 2024-10-18T18:34:39Z

For the fp16 tests, we want test similiar to the fp8, but instead of having an array lookup table, we would sample some values into a map and test that:

TEST_CASE(check_half_values)
{
    for(auto [x, f] : half_lut)
    {
        auto h = migraphx::bit_cast<migraphx::half>(x);
        if(std::isnan(f))
        {
            CHECK(std::isnan(h));
        }
        else if(std::isinf(f))
        {
            CHECK(std::isinf(h));
            CHECK((h < 0) == (f < 0));
            CHECK(bit_equal(x, migraphx::half(f)));
        }
        else
        {
            CHECK(migraphx::float_equal(float(h), f));
            CHECK(bit_equal(x, migraphx::half(f)));
        }
    }
}

I have a map of a thousand or so values we can use for this test. Also we will want to test the numeric limits by checking the bits match what we would expect:

TEST_CASE(check_numeric_limits)
{
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::min(), uint16_t{0x0400}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::lowest(), uint16_t{0xfbff}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::max(), uint16_t{0x7bff}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::epsilon(), uint16_t{0x1400}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::denorm_min(), uint16_t{0x0001}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::infinity(), uint16_t{0x7c00}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::quiet_NaN(), uint16_t{0x7fff}));
    CHECK(bit_equal(std::numeric_limits<migraphx::half>::signaling_NaN(), uint16_t{0x7dff}));
}

In addition, it would be good to have some tests for overflow and underflow like for std::numeric_limits<half>::max() + std::numeric_limits<float>::epsilon().

test/float32.cpp

pfultz2 · 2024-11-04T21:37:41Z

test/half.cpp

+    CHECK(bit_equal(std::numeric_limits<migraphx::half>::signaling_NaN(), uint16_t{0x7d00}));
+}
+
+static const std::map<uint16_t, float> half_lut = {


We probably need to wrap this in a function to fix the tidy warning:

const std::map<uint16_t, float>& half_lut() { static const std::map<uint16_t, float> result = { ... }; return result; }

pfultz2 · 2024-11-04T21:39:21Z

Overall this looks, we just need to fix the tidy warnings.

src/include/migraphx/generic_float.hpp

pfultz2

Looks good, just need to fix the CI checks.

…ric_float

pfultz2 · 2024-11-06T15:36:04Z

src/include/migraphx/generic_float.hpp

+
+constexpr float32_parts get_parts(float f) { return migraphx::bit_cast<float32_parts>(f); }
+
+#pragma pack(push, 1)


This needs to be surrounded by #ifdef _MSC_VER.

pfultz2 · 2024-11-06T15:39:08Z

src/include/migraphx/generic_float.hpp

+
+#pragma pack(push, 1)
+template <unsigned int MantissaSize, unsigned int ExponentSize, unsigned int Flags = 0>
+struct alignas(1) __attribute__((may_alias)) generic_float


You need the packed attribute for gcc/clang, but this probably breaks windows though. Instead of using macros, doing [[gnu::packed, gnu::may_alias]] may work instead.

pfultz2 · 2024-11-06T15:41:45Z

src/include/migraphx/generic_float.hpp

+
+#pragma pack(push, 1)
+template <unsigned int MantissaSize, unsigned int ExponentSize, unsigned int Flags = 0>
+struct alignas(1) __attribute__((may_alias)) generic_float


Also alignment is wrong. It should be alignas((MantissaSize+ExponentSize+1)/8), I dont know if that compiles.

pfultz2 · 2024-11-06T15:42:34Z

src/include/migraphx/generic_float.hpp

+        return temp;
+    }
+};
+#pragma pack(pop)


Also needs a #ifdef _MSC_VER.

migraphx-bot · 2024-11-07T00:46:56Z

Test	Batch	Rate new 9ae05a	Rate old 624c8d	Diff	Compare
torchvision-resnet50	64	3,260.10	3,261.66	-0.05%	✅
torchvision-resnet50_fp16	64	6,987.01	6,990.31	-0.05%	✅
torchvision-densenet121	32	2,435.66	2,436.87	-0.05%	✅
torchvision-densenet121_fp16	32	4,094.32	4,089.75	0.11%	✅
torchvision-inceptionv3	32	1,637.98	1,637.96	0.00%	✅
torchvision-inceptionv3_fp16	32	2,763.22	2,767.26	-0.15%	✅
cadene-inceptionv4	16	775.91	776.97	-0.14%	✅
cadene-resnext64x4	16	811.84	811.00	0.10%	✅
slim-mobilenet	64	7,536.11	7,537.82	-0.02%	✅
slim-nasnetalarge	64	211.57	211.51	0.03%	✅
slim-resnet50v2	64	3,506.73	3,504.05	0.08%	✅
bert-mrpc-onnx	8	1,149.43	1,146.98	0.21%	✅
bert-mrpc-tf	1	467.46	502.37	-6.95%	🔴
pytorch-examples-wlang-gru	1	421.12	421.14	-0.00%	✅
pytorch-examples-wlang-lstm	1	389.03	402.51	-3.35%	🔴
torchvision-resnet50_1	1	775.94	800.87	-3.11%	🔴
cadene-dpn92_1	1	401.24	435.76	-7.92%	🔴
cadene-resnext101_1	1	382.72	383.41	-0.18%	✅
onnx-taau-downsample	1	342.84	342.91	-0.02%	✅
dlrm-criteoterabyte	1	33.34	33.35	-0.02%	✅
dlrm-criteoterabyte_fp16	1	52.71	52.74	-0.06%	✅
agentmodel	1	7,955.64	8,416.52	-5.48%	🔴
unet_fp16	2	58.96	58.97	-0.02%	✅
resnet50v1_fp16	1	942.21	940.39	0.19%	✅
resnet50v1_int8	1	1,021.08	1,022.33	-0.12%	✅
bert_base_cased_fp16	64	1,170.08	1,171.13	-0.09%	✅
bert_large_uncased_fp16	32	363.65	363.49	0.04%	✅
bert_large_fp16	1	200.42	200.63	-0.11%	✅
distilgpt2_fp16	16	2,202.27	2,202.77	-0.02%	✅
yolov5s	1	549.41	538.22	2.08%	✅
tinyllama	1	43.45	43.49	-0.08%	✅
vicuna-fastchat	1	171.45	171.92	-0.27%	✅
whisper-tiny-encoder	1	418.63	418.87	-0.06%	✅
whisper-tiny-decoder	1	425.61	428.54	-0.68%	✅

This build is not recommended to merge 🔴

migraphx-bot · 2024-11-07T00:46:58Z

✅ bert-mrpc-onnx: PASSED: MIGraphX meets tolerance

✅ bert-mrpc-tf: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-gru: PASSED: MIGraphX meets tolerance

✅ pytorch-examples-wlang-lstm: PASSED: MIGraphX meets tolerance

✅ torchvision-resnet50_1: PASSED: MIGraphX meets tolerance

✅ cadene-dpn92_1: PASSED: MIGraphX meets tolerance

✅ cadene-resnext101_1: PASSED: MIGraphX meets tolerance

✅ dlrm-criteoterabyte: PASSED: MIGraphX meets tolerance

✅ agentmodel: PASSED: MIGraphX meets tolerance

✅ unet: PASSED: MIGraphX meets tolerance

✅ resnet50v1: PASSED: MIGraphX meets tolerance

✅ bert_base_cased_fp16: PASSED: MIGraphX meets tolerance

🔴bert_large_uncased_fp16: FAILED: MIGraphX is not within tolerance - check verbose output

✅ bert_large: PASSED: MIGraphX meets tolerance

✅ yolov5s: PASSED: MIGraphX meets tolerance

✅ tinyllama: PASSED: MIGraphX meets tolerance

✅ vicuna-fastchat: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-encoder: PASSED: MIGraphX meets tolerance

✅ whisper-tiny-decoder: PASSED: MIGraphX meets tolerance

✅ distilgpt2_fp16: PASSED: MIGraphX meets tolerance

first pass at integrating generic float

c51c1ce

richagadgil self-assigned this Oct 10, 2024

fix namespaces

134b408

richagadgil added 4 commits October 10, 2024 18:37

fix mantissa

d4fa6eb

refactor

0b60841

refactor

7a646f1

add fp

ebe819b

pfultz2 reviewed Oct 11, 2024

View reviewed changes

src/include/migraphx/half.hpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 11, 2024

View reviewed changes

src/include/migraphx/half.hpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 11, 2024

View reviewed changes

src/include/migraphx/half.hpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 11, 2024

View reviewed changes

src/include/migraphx/half.hpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 11, 2024

View reviewed changes

src/include/migraphx/generic_float.hpp Outdated Show resolved Hide resolved

fixed generic float class

379a77a

pfultz2 reviewed Oct 14, 2024

View reviewed changes

src/include/migraphx/generic_float.hpp Outdated Show resolved Hide resolved

richagadgil added 3 commits October 14, 2024 18:30

add fp32 test

174384c

remove import

787b651

update tests

1d1fa1c

pfultz2 reviewed Oct 15, 2024

View reviewed changes

test/float32.cpp Outdated Show resolved Hide resolved

richagadgil added 2 commits October 17, 2024 18:16

fp16 tests that work

1791092

update tests

a2eb005

pfultz2 reviewed Oct 18, 2024

View reviewed changes

test/float16.cpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 18, 2024

View reviewed changes

test/float16.cpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 18, 2024

View reviewed changes

test/float16.cpp Outdated Show resolved Hide resolved

pfultz2 reviewed Oct 18, 2024

View reviewed changes

test/float32.cpp Outdated Show resolved Hide resolved

richagadgil added 2 commits October 18, 2024 18:06

updated fp16 and fp32 tests

ff8ffc7

half tests

e36fd65

causten requested a review from CharlieL7 October 22, 2024 16:45

pfultz2 reviewed Nov 4, 2024

View reviewed changes

test/float32.cpp Show resolved Hide resolved

pfultz2 reviewed Nov 4, 2024

View reviewed changes

pfultz2 reopened this Nov 4, 2024

pfultz2 reviewed Nov 4, 2024

View reviewed changes

src/include/migraphx/generic_float.hpp Outdated Show resolved Hide resolved

Update float32.cpp

894ed7f

pfultz2 approved these changes Nov 4, 2024

View reviewed changes

richagadgil added 8 commits November 4, 2024 22:37

fix tidy

4895a68

format

b129bd5

change tidy warnings

0463266

tidy

2e3bd25

Merge branch 'develop' into generic_float

ff3566e

windows build fix

c02f3e3

Merge branch 'generic_float' of github.com:ROCm/AMDMIGraphX into gene…

99802b9

…ric_float

windows build

2db6e41

pfultz2 reviewed Nov 6, 2024

View reviewed changes

src/include/migraphx/generic_float.hpp

return temp;

}

};

#pragma pack(pop)

Copy link

Collaborator

pfultz2 Nov 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also needs a #ifdef _MSC_VER.

richagadgil added 5 commits November 6, 2024 18:25

replace w gnu flag

b195514

align

5754c1c

cmake

34554fb

readd mvsc

56cdd29

redo compile options

9ae05ae

causten merged commit f5df004 into develop Nov 8, 2024
28 of 34 checks passed

causten deleted the generic_float branch November 8, 2024 17:59

richagadgil mentioned this pull request Nov 18, 2024

Implement BF16 using generic_float class #3578

Merged

V6ser pushed a commit to V6ser/AMDMIGraphX that referenced this pull request Feb 3, 2025

Integrating generic_float struct for adding datatypes (ROCm#3522)

bac6e7f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating generic_float struct for adding datatypes #3522

Integrating generic_float struct for adding datatypes #3522

richagadgil commented Oct 10, 2024

pfultz2 commented Oct 10, 2024 •

edited

Loading

pfultz2 commented Oct 10, 2024

pfultz2 commented Oct 18, 2024

pfultz2 Nov 4, 2024

pfultz2 commented Nov 4, 2024

pfultz2 left a comment

pfultz2 Nov 6, 2024

pfultz2 Nov 6, 2024

pfultz2 Nov 6, 2024

pfultz2 Nov 6, 2024

migraphx-bot commented Nov 7, 2024

migraphx-bot commented Nov 7, 2024


		constexpr float32_parts get_parts(float f) { return migraphx::bit_cast<float32_parts>(f); }

		#pragma pack(push, 1)

Integrating generic_float struct for adding datatypes #3522

Integrating generic_float struct for adding datatypes #3522

Conversation

richagadgil commented Oct 10, 2024

pfultz2 commented Oct 10, 2024 • edited Loading

pfultz2 commented Oct 10, 2024

pfultz2 commented Oct 18, 2024

pfultz2 Nov 4, 2024

Choose a reason for hiding this comment

pfultz2 commented Nov 4, 2024

pfultz2 left a comment

Choose a reason for hiding this comment

pfultz2 Nov 6, 2024

Choose a reason for hiding this comment

pfultz2 Nov 6, 2024

Choose a reason for hiding this comment

pfultz2 Nov 6, 2024

Choose a reason for hiding this comment

pfultz2 Nov 6, 2024

Choose a reason for hiding this comment

migraphx-bot commented Nov 7, 2024

migraphx-bot commented Nov 7, 2024

pfultz2 commented Oct 10, 2024 •

edited

Loading