ARM: align loops to 4 bytes on Cortex-M3 and Cortex-M4. (c15d47bb) · Commits · Lorenzo Albano / LLVM bpEVL

Commit c15d47bb authored Sep 13, 2018 by Tim Northover

ARM: align loops to 4 bytes on Cortex-M3 and Cortex-M4.

The Technical Reference Manuals for these two CPUs state that branching
to an unaligned 32-bit instruction incurs an extra pipeline reload
penalty. That's bad.

This also enables the optimization at -Os since it costs on average one
byte per loop in return for 1 cycle per iteration, which is pretty good
going.

llvm-svn: 342127

parent 95ac65bc

Hide whitespace changes

Inline Side-by-side

Please register or to comment