Improve UnionArray logical_nulls tests #6781

gstvg · 2024-11-23T21:29:36Z

Which issue does this PR close?

Follow-up on #6303

Rationale for this change

As noted by @wiedld on #6303 (comment), the existing tests coverage is quite fragile relative to the target architecture and enabled CPU features, leaving most logic untested on ARM, for example.

What changes are included in this PR?

Instead of calling only the public Array::logical_nulls, tests of specific strategies also calls the correspondent private method directly. To do so, part of logical_nulls method, which builds the arguments to be passed to specific implementations, is made into a new private method UnionArray::fields_logical_nulls, and then used by both logical_nulls and the tests.

Are there any user-facing changes?

No, just tests

alamb

Thank you @gstvg

alamb · 2024-12-03T17:09:27Z

arrow-array/src/array/union_array.rs

+        assert_eq!(expected, array.logical_nulls().unwrap().into_inner());
+        assert_eq!(
+            expected,
+            array.mask_sparse_all_with_nulls_skip_one(array.fields_logical_nulls())


What is this mask_sparse_all_with_nulls_skip_one function? This formulation is somewhat confusing to me (without delving deeply into the code). Is there any way to make this more "obvious" 🤔

Each strategy for computing the logical_nulls are outlined here. The code determines a cost estimate to decide which strategy to use. Then it calls a helper function per each strategy => and that did the computation.

With the way the tests were written before, it always got the same cost estimate on certain architecture. For example, tests running on ARM would always get the same cost estimate and therefore always pick the same strategy (UnionArray::gather_nulls helper function). So even though this test is named test_sparse_union_logical_nulls_mask_all_nulls_skip_one, it was not actually testing that strategy and helper function.

With this change, the tests are directly calling each helper function.

In the example above, the test named test_sparse_union_logical_nulls_mask_all_nulls_skip_one has been changed to directly calling the helper method UnionArray::mask_sparse_all_with_nulls_skip_one.

wiedld · 2024-12-04T04:37:19Z

Just came back from FTO. I'll pick up this review tmrw. Thanks @gstvg !

wiedld

Very nice. ❤️

The possible strategies are all tested:

dense => gather nulls
sparse => all options of the SparseStrategy are tested

I didn't bother doing a branch coverage analysis, since this is already a big gain. Thank you!

alamb

Thank you for the PR @gstvg and the review @wiedld

test: union array logical nulls directly call wanted method

b4d01cd

github-actions bot added the arrow Changes to the arrow crate label Nov 23, 2024

gstvg mentioned this pull request Nov 23, 2024

Implement UnionArray logical_nulls #6303

Merged

alamb reviewed Dec 3, 2024

View reviewed changes

wiedld approved these changes Dec 6, 2024

View reviewed changes

alamb approved these changes Dec 6, 2024

View reviewed changes

alamb merged commit 63ad87a into apache:main Dec 6, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve UnionArray logical_nulls tests #6781

Improve UnionArray logical_nulls tests #6781

gstvg commented Nov 23, 2024

alamb left a comment

alamb Dec 3, 2024

wiedld Dec 6, 2024

wiedld commented Dec 4, 2024

wiedld left a comment

alamb left a comment

Improve UnionArray logical_nulls tests #6781

Improve UnionArray logical_nulls tests #6781

Conversation

gstvg commented Nov 23, 2024

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

alamb left a comment

Choose a reason for hiding this comment

alamb Dec 3, 2024

Choose a reason for hiding this comment

wiedld Dec 6, 2024

Choose a reason for hiding this comment

wiedld commented Dec 4, 2024

wiedld left a comment

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment