Unverified Commit 9bcb30d3 authored Oct 27, 2023 by Igor Kirillov Committed by GitHub Oct 27, 2023

[CodeGen] Improve ExpandMemCmp for more efficient non-register aligned sizes handling (#69942)

* Enhanced the logic of ExpandMemCmp pass to merge contiguous
subsequences
  in LoadSequence, based on sizes allowed in `AllowedTailExpansions`.
* This enhancement seeks to minimize the number of basic blocks and
produce optimized code when using memcmp with non-register aligned sizes.
* Enable this feature for AArch64 with memcmp sizes modulo 8 equal to
  3, 5, and 6.

parent 02ef12dd

Expand all Show whitespace changes

Inline Side-by-side

Please to comment