-
Notifications
You must be signed in to change notification settings - Fork 933
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove static column vectors from window function tests. #18099
Remove static column vectors from window function tests. #18099
Conversation
Fixes rapidsai#18079. This commit fixes the failures reported in rapidsai#18079, where the use of static column vector objects in the tests causes the use of a CUDA runtime context before it's been initialized, causing the tests to fail with: ``` parallel_for failed: cudaErrorInvalidResourceHandle: invalid resource handle ``` Signed-off-by: MithunR <[email protected]>
e9182bc
to
0a44371
Compare
We've seen static initialization issues like this before, but it's been a while. It's too bad this has bitten us again. |
Can we target this branch at 25.02? We will need this for the 25.02.01 release thanks |
This is a red herring. Please ignore. |
So this pattern looks suspect CUDF_TEST_EXPECT_COLUMNS_EQUAL(*run_rolling(*AGG_COUNT_NON_NULL),
ints_column{{0, 1, 2, 2, 2, 2, 0, 1, 2, 2}, nulls_at({0, 6})}); The This pattern needs to be changed to hold onto the data for the life of the call and check auto result = run_rolling(*AGG_COUNT_NON_NULL);
auto ints = ints_column{{0, 1, 2, 2, 2, 2, 0, 1, 2, 2}, nulls_at({0, 6})};
CUDF_TEST_EXPECT_COLUMNS_EQUAL(*result, ints); The previous |
Actually, no, I think I misunderstood what you were pointing out.
I remember this now. This should be alright. The temporary object ( In this case, the destructor wouldn't be called until I don't think the |
/ok to test |
The I've kicked off the build, to see how it fares. If this change looks good to the reviewers, and CI passes, we should be alright. |
I could use advice to address @pxLi's request: Does it make sense to retarget this to |
Since this is a fix to the test only, I don't see a reason to backport this to 25.02. |
/ok to test |
This test is failing in our gb100 tests. We are targeting gb100 for 25.02.1. Hence the request to backport to 25.02. |
It doesn't look like the
The other checks seem to have gone through. @davidwendt, thank you for re-kicking the tests. I was wondering if you might have another look at this fix. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Any objection to our merging this to 25.04?
|
/merge |
54e740a
into
rapidsai:branch-25.04
Thank you for the reviews, all. I've merged this change. |
Fixes #18079.
This commit fixes the failures reported in #18079, where the use of static column vector objects in the tests causes the use of a CUDA runtime context before it's been initialized, causing the tests to fail with:
The solution is to switch the static column vectors to runtime, as a member of the test utility class
rolling_runner
.