Improved performance of rz_bv_copy_nbits and rz_bv_set_range #4740

rajRishi22 · 2024-11-25T09:26:46Z

Your checklist for this pull request

I've read the guidelines for contributing to this repository
I made sure to follow the project's coding style
I've documented or updated the documentation of every function and struct this PR changes. If not so I've explained why.
I've added tests that prove my fix is effective or that my feature works (if possible)
I've updated the rizin book with the relevant information (if needed)

Detailed description
This pull request optimizes the performance of the rz_bv_set_range function. The original implementation iterated through bits one at a time, leading to inefficiencies for large ranges. The updated implementation:

Processes aligned chunks of bits using system word size for faster operations.
Dynamically adjusts chunk size for different architectures (e.g., 32-bit, 64-bit).
Handles unaligned prefix and suffix bits separately while optimizing the main loop.
Adds robust boundary validation to ensure correctness.
This change reduces iteration overhead and improves performance while maintaining compatibility and correctness.

Test plan

Verify functionality for small ranges, large ranges, and edge cases (unaligned ranges).
Test on various architectures to confirm portability (32-bit, 64-bit).
Use unit tests to ensure the results match the original functionality.
Check for any regressions with existing test suites.

Closing issues
Partially addresses #4716
...

Rot127

Nice start!
I can imagine we can even optimize the unaligned cases. But let's do this later.

I also changed your PR message. Because it closes #4716 only partially (missing unaligned cases).

You can add test cases in test/unit/test_bitvector.c.
Once everything is implemented and passes, we can run the Travis CI to test it on big endian machines.

librz/util/bitvector.c

wargio

missing implementation of rz_bv_get_chunk

…ned long to ut32

wargio · 2024-11-26T08:37:28Z

librz/util/bitvector.c

+		first_word &= ~(0xFFFFFFFF >> bit_offset); // Clear the upper bits
+		second_word &= (0xFFFFFFFF >> (32 - bit_offset)); // Clear the lower bits


Suggested change

first_word &= ~(0xFFFFFFFF >> bit_offset); // Clear the upper bits

second_word &= (0xFFFFFFFF >> (32 - bit_offset)); // Clear the lower bits

first_word &= ~(UT32_MAX >> bit_offset); // Clear the upper bits

second_word &= (UT32_MAX >> (32 - bit_offset)); // Clear the lower bits

wargio

You are touching something very critical within rizin, so please provide benchmark and C test for correctness and edge cases.

rajRishi22 added 3 commits November 25, 2024 12:48

Improve performance of rz_bv_copy_nbits

fbb8aca

Added limits.h as header

4852cf9

Improved performance of rz_bv_set_range

2701028

rajRishi22 requested review from ret2libc and thestr4ng3r as code owners November 25, 2024 09:26

github-actions bot added the RzUtil label Nov 25, 2024

Rot127 requested changes Nov 25, 2024

View reviewed changes

rajRishi22 added 3 commits November 25, 2024 23:35

Formatted code using clang-format-16

c84078d

Resolved issues from 1st review

cde2cff

Merge branch 'rz_bv_performance' into rz_bv_copy

6c2a272

Rot127 requested changes Nov 25, 2024

View reviewed changes

librz/util/bitvector.c Outdated Show resolved Hide resolved

librz/util/bitvector.c Outdated Show resolved Hide resolved

librz/util/bitvector.c Outdated Show resolved Hide resolved

rajRishi22 added 2 commits November 26, 2024 01:06

Fixed edge cases and added reference as per review2

d0f3b82

Added testcases for rz_bv_copy function

ff043dd

rajRishi22 requested review from wargio and kazarmy as code owners November 25, 2024 19:48

github-actions bot added the rz-test label Nov 25, 2024

rajRishi22 added 2 commits November 26, 2024 01:40

Fixed undefined variables

7920458

Reverted back to old test cases

4becce1

github-actions bot removed the rz-test label Nov 25, 2024