Commits · 1863582863867a5cb888e7a9f35f414df9db7958 · Lorenzo Albano / LLVM bpEVL

Mar 03, 2014
- Re-apply r202551, which introduced new PBQP solver. · 18635828
  Lang Hames authored Mar 03, 2014
```
llvm-svn: 202735
```
  18635828
Mar 02, 2014
- [C++11] Replace llvm::next and llvm::prior with std::next and std::prev. · b6d0bd48
  Benjamin Kramer authored Mar 02, 2014
```
Remove the old functions.

llvm-svn: 202636
```
  b6d0bd48
Feb 28, 2014

Jumped the gun with r202551 and broke some bots that weren't yet C++11ified. · c083578a
Lang Hames authored Feb 28, 2014
```
Reverting until the C++11 switch is complete.

llvm-svn: 202554
```
c083578a

New PBQP solver, and updates to the PBQP graph. · 525a2123

Lang Hames authored Feb 28, 2014

The previous PBQP solver was very robust but consumed a lot of memory,
performed a lot of redundant computation, and contained some unnecessarily tight
coupling that prevented experimentation with novel solution techniques. This new
solver is an attempt to address these shortcomings.

Important/interesting changes:

1) The domain-independent PBQP solver class, HeuristicSolverImpl, is gone.
It is replaced by a register allocation specific solver, PBQP::RegAlloc::Solver
(see RegAllocSolver.h).

The optimal reduction rules and the backpropagation algorithm have been extracted
into stand-alone functions (see ReductionRules.h), which can be used to build
domain specific PBQP solvers. This provides many more opportunities for
domain-specific knowledge to inform the PBQP solvers' decisions. In theory this
should allow us to generate better solutions. In practice, we can at least test
out ideas now.

As a side benefit, I believe the new solver is more readable than the old one.

2) The solver type is now a template parameter of the PBQP graph.

This allows the graph to notify the solver of any modifications made (e.g. by
domain independent rules) without the overhead of a virtual call. It also allows
the solver to supply policy information to the graph (see below).

3) Significantly reduced memory overhead.

Memory management policy is now an explicit property of the PBQP graph (via
the CostAllocator typedef on the graph's solver template argument). Because PBQP
graphs for register allocation tend to contain many redundant instances of
single values (E.g. the value representing an interference constraint between
GPRs), the new RASolver class uses a uniquing scheme. This massively reduces
memory consumption for large register allocation problems. For example, looking
at the largest interference graph in each of the SPEC2006 benchmarks (the
largest graph will always set the memory consumption high-water mark for PBQP),
the average memory reduction for the PBQP costs was 400x. That's times, not
percent. The highest was 1400x. Yikes. So - this is fixed.

"PBQP: No longer feasting upon every last byte of your RAM".

Minor details:

- Fully C++11'd. Never copy-construct another vector/matrix!

- Cute tricks with cost metadata: Metadata that is derived solely from cost
matrices/vectors is attached directly to the cost instances themselves. That way
if you unique the costs you never have to recompute the metadata. 400x less
memory means 400x less cost metadata (re)computation.

Special thanks to Arnaud de Grandmaison, who has been the source of much
encouragement, and of many very useful test cases.

This new solver forms the basis for future work, of which there's plenty to do.
I will be adding TODO notes shortly.

- Lang.

llvm-svn: 202551

525a2123

Feb 24, 2014

Replace the F_Binary flag with a F_Text one. · 90c7f1cc

Rafael Espindola authored Feb 24, 2014

After this I will set the default back to F_None. The advantage is that
before this patch forgetting to set F_Binary would corrupt a file on windows.
Forgetting to set F_Text produces one that cannot be read in notepad, which
is a better failure mode :-)

llvm-svn: 202052

90c7f1cc

Don't make F_None the default. · 7dbcdd08

Rafael Espindola authored Feb 24, 2014

This will make it easier to switch the default to being binary files.

llvm-svn: 202042

7dbcdd08

Dec 14, 2013

[block-freq] Refactor LiveInterals::getSpillWeight to use the new... · 9f49d744

Michael Gottesman authored Dec 14, 2013

[block-freq] Refactor LiveInterals::getSpillWeight to use the new MachineBlockFrequencyInfo methods.

This is slightly more interesting than the previous batch of changes.
Specifically:

1. We refactor getSpillWeight to take a MachineBlockFrequencyInfo (MBFI)
object. This enables us to completely encapsulate the actual manner we
use the MachineBlockFrequencyInfo to get our spill weights. This yields
cleaner code since one does not need to fetch the actual block frequency
before getting the spill weight if all one wants it the spill weight. It
also gives us access to entry frequency which we need for our
computation.

2. Instead of having getSpillWeight take a MachineBasicBlock (as one
might think) to look up the block frequency via the MBFI object, we
instead take in a MachineInstr object. The reason for this is that the
method is supposed to return the spill weight for an instruction
according to the comments around the function.

llvm-svn: 197296

9f49d744

Nov 11, 2013
- CalcSpillWeights: give a better describing name to calculateSpillWeights · ea3ac161
  Arnaud A. de Grandmaison authored Nov 11, 2013
```
Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint.

No functionnal change.

llvm-svn: 194404
```
  ea3ac161
Nov 10, 2013

CalculateSpillWeights does not need to be a pass · 760c1e0b

Arnaud A. de Grandmaison authored Nov 10, 2013

Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator.

Update the documentation style while there.

No functionnal change.

llvm-svn: 194356

760c1e0b

Nov 09, 2013

Re-apply r194300 with fixes for warnings. · fb82630a
Lang Hames authored Nov 09, 2013
```
llvm-svn: 194311
```
fb82630a
Revert r194300 which broke the build. · 59886d00
Nick Lewycky authored Nov 09, 2013
```
llvm-svn: 194308
```
59886d00

Rewrite the PBQP graph data structure. · 1662b832

Lang Hames authored Nov 09, 2013

The new graph structure replaces the node and edge linked lists with vectors.
Free lists (well, free vectors) are used for fast insertion/deletion.

The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that
the PBQP solver destructively consumes input graphs while computing a solution,
forcing the graph to be fully reconstructed for each round of PBQP. This
imposes a high cost on large functions, which often require several rounds of
solving/spilling to find a final register allocation. If we can cheaply clone
the PBQP graph and incrementally update it between rounds then hopefully we can
reduce this cost. Further, once we begin pooling matrix/vector values (future
work), we can cache some PBQP solver metadata and share it between cloned
graphs, allowing the PBQP solver to re-use some of the computation done in
earlier rounds.

For now this is just a data structure update. The allocator and solver still
use the graph the same way as before, fully reconstructing it between each
round. I expect no material change from this update, although it may change
the iteration order of the nodes, causing ties in the solver to break in
different directions, and this could perturb the generated allocations
(hopefully in a completely benign way).

Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back
to work on this, and for a lot of discussion and many useful PBQP test cases.

llvm-svn: 194300

1662b832

Nov 08, 2013

Revert "CalculateSpillWeights does not need to be a pass" · f7a60a8e
Arnaud A. de Grandmaison authored Nov 08, 2013
```
Temporarily revert my previous commit until I understand why it breaks 3 target tests.

llvm-svn: 194272
```
f7a60a8e

CalculateSpillWeights does not need to be a pass · ed812f65

Arnaud A. de Grandmaison authored Nov 08, 2013

Update the documentation style while there.

No functionnal change.

llvm-svn: 194269

ed812f65

Aug 15, 2013

Track new virtual registers by register number. · f9ea8854

Mark Lacey authored Aug 14, 2013

Track new virtual registers by register number, rather than by the live
interval created for them. This is the first step in separating the
creation of new virtual registers and new live intervals.  Eventually
live intervals will be created and populated on demand after the virtual
registers have been created and used in instructions.

llvm-svn: 188434

f9ea8854

Jul 01, 2013
- Make PBQP require/preserve MachineLoopInfo - the spiller requires it. · 7d99d797
  Lang Hames authored Jul 01, 2013
```
llvm-svn: 185378
```
  7d99d797
Jun 17, 2013

Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo. · e2a1d89e

Benjamin Kramer authored Jun 17, 2013

The main advantages here are way better heuristics, taking into account not
just loop depth but also __builtin_expect and other static heuristics and will
eventually learn how to use profile info. Most of the work in this patch is
pushing the MachineBlockFrequencyInfo analysis into the right places.

This is good for a 5% speedup on zlib's deflate (x86_64), there were some very
unfortunate spilling decisions in its hottest loop in longest_match(). Other
benchmarks I tried were mostly neutral.

This changes register allocation in subtle ways, update the tests for it.
2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction
it looked for was gone already (but the FileCheck pattern picked up unrelated
stuff).

llvm-svn: 184105

e2a1d89e

Apr 15, 2013

Replace uses of the deprecated std::auto_ptr with OwningPtr. · b23ea72e

Andy Gibbs authored Apr 15, 2013

This is a rework of the broken parts in r179373 which were subsequently reverted in r179374 due to incompatibility with C++98 compilers.  This version should be ok under C++98.

llvm-svn: 179520

b23ea72e

Apr 12, 2013
- Revert broken pieces of r179373. · dae08512
  Benjamin Kramer authored Apr 12, 2013
```
You can't copy an OwningPtr, and move semantics aren't available in C++98.

llvm-svn: 179374
```
  dae08512
- Replace uses of the deprecated std::auto_ptr with OwningPtr. · 95777550
  Andy Gibbs authored Apr 12, 2013
```
llvm-svn: 179373
```
  95777550
Jan 02, 2013

Move all of the header files which are involved in modelling the LLVM IR · 9fb823bb

Chandler Carruth authored Jan 02, 2013

into their new header subdirectory: include/llvm/IR. This matches the
directory structure of lib, and begins to correct a long standing point
of file layout clutter in LLVM.

There are still more header files to move here, but I wanted to handle
them in separate commits to make tracking what files make sense at each
layer easier.

The only really questionable files here are the target intrinsic
tablegen files. But that's a battle I'd rather not fight today.

I've updated both CMake and Makefile build systems (I think, and my
tests think, but I may have missed something).

I've also re-sorted the includes throughout the project. I'll be
committing updates to Clang, DragonEgg, and Polly momentarily.

llvm-svn: 171366

9fb823bb

Dec 04, 2012

Use MRI::getSimpleHint() instead of getRegAllocPref() in remaining cases. · 1dd82dd3

Jakob Stoklund Olesen authored Dec 04, 2012

Targets can provide multiple hints now, so getRegAllocPref() doesn't
make sense any longer because it only returns one preferred register.
Replace it with getSimpleHint() in the remaining heuristics. This
function only

llvm-svn: 169188

1dd82dd3

Dec 03, 2012

Use the new script to sort the includes of every file under lib. · ed0881b2

Chandler Carruth authored Dec 03, 2012

Sooooo many of these had incorrect or strange main module includes.
I have manually inspected all of these, and fixed the main module
include to be the nearest plausible thing I could find. If you own or
care about any of these source files, I encourage you to take some time
and check that these edits were sensible. I can't have broken anything
(I strictly added headers, and reordered them, never removed), but they
may not be the headers you'd really like to identify as containing the
API being implemented.

Many forward declarations and missing includes were added to a header
files to allow them to parse cleanly when included first. The main
module rule does in fact have its merits. =]

llvm-svn: 169131

ed0881b2

Nov 28, 2012

Make the LiveRegMatrix analysis available to targets. · 26c9d70d

Jakob Stoklund Olesen authored Nov 28, 2012

No functional change, just moved header files.

Targets can inject custom passes between register allocation and
rewriting. This makes it possible to tweak the register allocation
before rewriting, using the full global interference checking available
from LiveRegMatrix.

llvm-svn: 168806

26c9d70d

Revert r168630, r168631, and r168633 as these are causing nightly test failures. · ed119d54
Chad Rosier authored Nov 28, 2012
```
llvm-svn: 168751
```
ed119d54

Nov 27, 2012

Now that the X86 Maximal Stack Alignment Check pass has been removed (i.e., · f8a3a62c

Chad Rosier authored Nov 26, 2012

r168627), we no longer need to call the freezeReservedRegs() function a second
time.  Previously, this pass was conservatively adding the FP to the set of
reserved registers, requiring the second update to the reserved registers.
rdar://12719844

llvm-svn: 168631

f8a3a62c

Oct 29, 2012
- Remove unused typedef. · ee6142c3
  Lang Hames authored Oct 29, 2012
```
llvm-svn: 166910
```
  ee6142c3
Oct 16, 2012
- Remove LIS::isAllocatable() and isReserved() helpers. · cea596ac
  Jakob Stoklund Olesen authored Oct 15, 2012
```
All callers can simply use the corresponding MRI functions.

llvm-svn: 165985
```
  cea596ac
Oct 15, 2012

Switch most getReservedRegs() clients to the MRI equivalent. · c30a9af2

Jakob Stoklund Olesen authored Oct 15, 2012

Using the cached bit vector in MRI avoids comstantly allocating and
recomputing the reserved register bit vector.

llvm-svn: 165983

c30a9af2

Oct 10, 2012

My earlier "fix" for PBQP (see r165201) was incorrect. The real issue was that · 05fee08d

Lang Hames authored Oct 10, 2012

checkRegMaskInterference only initializes the bitmask on the first interference.

This fixes PR14027 and (re)fixes PR13945.

llvm-svn: 165608

05fee08d

Oct 04, 2012
- Fix reg mask slot test, and preserve LiveIntervals and VirtRegMap in the PBQP · 8ce99f29
  Lang Hames authored Oct 04, 2012
```
allocator. Fixes PR13945.

llvm-svn: 165201
```
  8ce99f29
Sep 05, 2012
- Remove unused typedefs gcc4.8 warns about. · 09c8a3dd
  Roman Divacky authored Sep 05, 2012
```
llvm-svn: 163225
```
  09c8a3dd
Aug 22, 2012

Add a getName function to MachineFunction. Use it in places that previously... · a538d831

Craig Topper authored Aug 22, 2012

Add a getName function to MachineFunction. Use it in places that previously did getFunction()->getName(). Remove includes of Function.h that are no longer needed.

llvm-svn: 162347

a538d831

Jun 22, 2012

Remove LiveIntervals::trackingRegUnits(). · b1b3e4aa

Jakob Stoklund Olesen authored Jun 22, 2012

With regunit liveness permanently enabled, this function would always
return true.

Also remove now obsolete code for checking physreg interference.

llvm-svn: 159006

b1b3e4aa

Jun 21, 2012
- Remove spurious typedefs. · 37a1338a
  Jakob Stoklund Olesen authored Jun 20, 2012
```
llvm-svn: 158878
```
  37a1338a
- Remove the RenderMachineFunction HTML output pass. · 1911a020
  Jakob Stoklund Olesen authored Jun 20, 2012
```
I don't think anyone has been using this functionality for a while, and
it is getting in the way of refactoring now.

llvm-svn: 158876
```
  1911a020
- Teach PBQPBuilder::build() about regunit interference. · bfa664ea
  Jakob Stoklund Olesen authored Jun 20, 2012
```
Filter out physreg candidates with regunit interferrence.
Also compute regmask interference more efficiently.

llvm-svn: 158864
```
  bfa664ea
Jun 20, 2012

Avoid iterating with LiveIntervals::iterator. · a1f43dcd

Jakob Stoklund Olesen authored Jun 20, 2012

That is a DenseMap iterator keyed by pointers, so the iteration order is
nondeterministic.

I would like to replace the DenseMap with an IndexedMap which doesn't
allow iteration.

llvm-svn: 158856

a1f43dcd

Jun 09, 2012

Also compute MBB live-in lists in the new rewriter pass. · be336295

Jakob Stoklund Olesen authored Jun 09, 2012

This deduplicates some code from the optimizing register allocators, and
it means that it is now possible to change the register allocators'
solutions simply by editing the VirtRegMap between the register
allocator pass and the rewriter.

llvm-svn: 158249

be336295

Reintroduce VirtRegRewriter. · 1224312f

Jakob Stoklund Olesen authored Jun 08, 2012

OK, not really. We don't want to reintroduce the old rewriter hacks.

This patch extracts virtual register rewriting as a separate pass that
runs after the register allocator. This is possible now that
CodeGen/Passes.cpp can configure the full optimizing register allocator
pipeline.

The rewriter pass uses register assignments in VirtRegMap to rewrite
virtual registers to physical registers, and it inserts kill flags based
on live intervals.

These finalization steps are the same for the optimizing register
allocators: RABasic, RAGreedy, and PBQP.

llvm-svn: 158244

1224312f