Fix the worse case performance of ICF. (1b6bab01) · Commits · Roger Ferrer / llvm-epi

Commit 1b6bab01 authored Dec 02, 2016 by Rui Ueyama

Fix the worse case performance of ICF.

r288228 seems to have regressed ICF performance in some cases in which
a lot of sections are actually mergeable. In r288228, I made a change
to create a Range object for each new color group. So every time we
split a group, we allocated and added a new group to a list of groups.

This patch essentially reverted r288228 with an improvement to
parallelize the original algorithm.

Now the ICF main loop is entirely allocation-free and lock-free.

Just like pre-r288228, we search for group boundaries by linear scan
instead of managing the information using Range class. r288228 was
neutral in performance-wise, and so is this patch.

I confirmed that this produces the exact same result as before
using chromium and clang as tests.

llvm-svn: 288480

parent 491b799a

Hide whitespace changes

Inline Side-by-side

Please register or to comment