Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add ListView equal #6969

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

feat: add ListView equal #6969

wants to merge 2 commits into from

Conversation

Kikkon
Copy link
Contributor

@Kikkon Kikkon commented Jan 12, 2025

Which issue does this PR close?

Closes #5501 .

Rationale for this change

What changes are included in this PR?

Add GenericListViewArray equal

Are there any user-facing changes?

@Kikkon Kikkon marked this pull request as ready for review January 13, 2025 15:14
Comment on lines +47 to +59
if lhs_size != rhs_size {
return false;
}

// check if null
if let (Some(lhs_null), Some(rhs_null)) = (lhs_nulls, rhs_nulls) {
if lhs_null.is_null(lhs_pos) != rhs_null.is_null(rhs_pos) {
return false;
}
if lhs_null.is_null(lhs_pos) {
continue;
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was taking a very close look at this code and I noticed two things:

Firstly, I believe having the size check before the nullability check can cause a bug in the following case:

#[test]
fn test_chuke() {
    let data = Arc::new(Int32Array::from(vec![1]));
    let field = Arc::new(Field::new("chuke", DataType::Int32, true));
    let a = ListViewArray::try_new(
        field.clone(),
        vec![0, 0].into(), // offsets
        vec![1, 1].into(), // sizes
        data.clone(),
        Some(vec![false, true].into()),
    )
    .unwrap();
    let b = ListViewArray::try_new(
        field.clone(),
        vec![0, 0].into(), // offsets
        vec![0, 1].into(), // sizes
        data.clone(),
        Some(vec![false, true].into()),
    )
    .unwrap();
    test_equal(&a, &b, true);
}

a and b are valid ListViewArrays, where their sizes buffers differs in their first element. This shouldn't matter since the value is null anyway, but this code will check this size first and falsely report as being not equal even though both arrays have null for that slot. Hence test will fail.

Secondly, I wonder if it's more efficient to pull the if let (Some(lhs_null), Some(rhs_null)) = (lhs_nulls, rhs_nulls) outside of the loop, considering this wouldn't change per iteration?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Changes to the arrow crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add ListViewArray and LargeListViewArray implementation and layout and basic construction
2 participants