Enhance performance of `rz_bv_copy_nbits` and `rz_bv_set_range` #4716

Rot127 · 2024-11-13T18:49:52Z

Is your feature request related to a problem? Please describe.

Performance could be better and we save energy which would make us better humans.

Describe the solution you'd like

Both functions do something like this:

for (ut32 i = 0; i < nbit; ++i) {
   bool c = rz_bv_get(src, src_start_pos + i); // Not in case of rz_bv_set_range()
   rz_bv_set(dst, dst_start_pos + i, c);
}

At least if we deal with small bit vectors (<= 64bit) we could do this with shift, AND + OR.
This would save a bunch of calls and other shifts and ORs.

For larger bit vectors we could check if the range is large enough (e.g. 8 bits or more).
Then set the first bits until the rest of them is aligned to a multiple of a byte (for RzBitVector->large_a) and assign the rest of them via memcopy.

Describe alternatives you've considered

None really.

Additional context

This would give us quite some computation back. In my use case this loop consumes around 2.3%.
I also think this is a required improvement before we merge resource intensive analysis algorithms based on it.

The text was updated successfully, but these errors were encountered:

Rot127 · 2024-11-23T14:56:15Z

Possible implementation: https://graphics.stanford.edu/~seander/bithacks.html#MaskedMerge.
Simple for ut64 values, would need to be generalized to everything >64bits.

Rot127 added enhancement New feature or request good first issue Good for newcomers RzUtil performance A performance problem/enhancement labels Nov 13, 2024

This comment was marked as outdated.

Sign in to view

This was referenced Nov 25, 2024

Rz bv copy #4739

Closed

Improved performance of rz_bv_copy_nbits and rz_bv_set_range #4740

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance performance of `rz_bv_copy_nbits` and `rz_bv_set_range` #4716

Enhance performance of `rz_bv_copy_nbits` and `rz_bv_set_range` #4716

Rot127 commented Nov 13, 2024

Rot127 commented Nov 23, 2024

This comment was marked as outdated.

Enhance performance of rz_bv_copy_nbits and rz_bv_set_range #4716

Enhance performance of rz_bv_copy_nbits and rz_bv_set_range #4716

Comments

Rot127 commented Nov 13, 2024

Rot127 commented Nov 23, 2024

This comment was marked as outdated.

Enhance performance of `rz_bv_copy_nbits` and `rz_bv_set_range` #4716

Enhance performance of `rz_bv_copy_nbits` and `rz_bv_set_range` #4716