Commits · d5c3027473d3ca28492f3e734cd91ef809d6406e · Roger Ferrer / llvm-epi-0.8

Mar 01, 2012

Fix a codegen fault in which log2 or exp2 could be dead-code eliminated even... · f6298e92

James Molloy authored Mar 01, 2012

Fix a codegen fault in which log2 or exp2 could be dead-code eliminated even though they could have sideeffects.

Only allow log2/exp2 to be converted to an intrinsic if they are declared "readnone".

llvm-svn: 151807

f6298e92

Make InlineSpiller bundle-aware. · abe8c09b

Jakob Stoklund Olesen authored Mar 01, 2012

Simply treat bundles as instructions. Spill code is inserted between
bundles, never inside a bundle.  Rewrite all operands in a bundle at
once.

Don't attempt and memory operand folding inside bundles.

llvm-svn: 151787

abe8c09b

Move getBundleStart() into MachineInstrBundle.h. · d256c216

Jakob Stoklund Olesen authored Mar 01, 2012

This allows the function to be inlined, and makes it suitable for use in
getInstructionIndex().

Also provide a const version. C++ is great for touch typing practice.

llvm-svn: 151782

d256c216

Don't redundantly copy implicit operands when rematerializing. · 76e66c31

Lang Hames authored Mar 01, 2012

While we're at it - don't copy vreg implicit operands while rematerializing.
This fixes PR12138.

llvm-svn: 151779

76e66c31

Feb 29, 2012

LegalizeIntegerTypes: Reorder operations in the "big shift by small amount"... · d05a0c6c

Benjamin Kramer authored Feb 29, 2012

LegalizeIntegerTypes: Reorder operations in the "big shift by small amount" optimization, making the lives of later passes easier.

llvm-svn: 151722

d05a0c6c

Add an analyzeVirtReg() function. · 9e821456

Jakob Stoklund Olesen authored Feb 29, 2012

This function does more or less the same as
MI::readsWritesVirtualRegister(), but it supports bundles as well.

It also determines if any constraint requires reading and writing
operands to use the same register.  Most clients want to know.

Use the more modern MO.readsReg() instead of trying to sort out undefs
and partial redefines.  Stop supporting the extra full <imp-def> operand
as an alternative to <def,undef> sub-register defines.

llvm-svn: 151690

9e821456

Move the operand iterator into MachineInstrBundle.h where it belongs. · 8017d805

Jakob Stoklund Olesen authored Feb 29, 2012

Extract a base class and provide four specific sub-classes for iterating
over const/non-const bundles/instructions.

This eliminates the mystery bool constructor argument.

llvm-svn: 151684

8017d805

Feb 28, 2012

Kill off LiveRangeEdit::getNewVRegs and LiveRangeEdit::getUselessVRegs. These · 2fbad222

Lang Hames authored Feb 28, 2012

methods are no longer needed now that LinearScan has gone away.

(Contains tweaks trivialSpillEverywhere to enable the removal of getNewVRegs).

llvm-svn: 151658

2fbad222

Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. · 65f9d19c
Evan Cheng authored Feb 28, 2012
```
llvm-svn: 151645
```
65f9d19c
Fix off-by one in comment. · f2e160c6
Benjamin Kramer authored Feb 28, 2012
```
llvm-svn: 151644
```
f2e160c6

LegalizeIntegerTypes: Reenable the large shift with small amount optimization. · 0c281a7d

Benjamin Kramer authored Feb 28, 2012

To avoid problems with zero shifts when getting the bits that move between words
we use a trick: first shift the by amount-1, then do another shift by one. When
amount is 0 (and size 32) we first shift by 31, then by one, instead of by 32.

Also fix a latent bug that emitted the low and high words in the wrong order
when shifting right.

Fixes PR12113.

llvm-svn: 151637

0c281a7d

Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack... · ee7b8993

Daniel Dunbar authored Feb 28, 2012

Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part.

llvm-svn: 151630

ee7b8993

Code cleanup following CR by Duncan. · 1d666099
Nadav Rotem authored Feb 28, 2012
```
llvm-svn: 151627
```
1d666099

Fix a bug in the code that builds SDNodes from vector GEPs. · 875e463b

Nadav Rotem authored Feb 28, 2012

When the GEP index is a vector of pointers, the code that calculated the size
of the element started from the vector type, and not the contained pointer type.
As a result, instead of looking at the data element pointed by the vector, this
code used the size of the vector. This works for 32bit members (on 32bit
systems), but not for other types. Added code to peel the vector type and
added a test.

llvm-svn: 151626

875e463b

Some ARM implementaions, e.g. A-series, does return stack prediction. That is, · 87c7b09d

Evan Cheng authored Feb 28, 2012

the processor keeps a return addresses stack (RAS) which stores the address
and the instruction execution state of the instruction after a function-call
type branch instruction.

Calling a "noreturn" function with normal call instructions (e.g. bl) can
corrupt RAS and causes 100% return misprediction so LLVM should use a
unconditional branch instead. i.e.
mov lr, pc
b _foo
The "mov lr, pc" is issued in order to get proper backtrace.

rdar://8979299

llvm-svn: 151623

87c7b09d

Handle regmasks in MachineCSE. · 4c5ad2b8

Jakob Stoklund Olesen authored Feb 28, 2012

Don't attempt to extend physreg live ranges across calls.

<rdar://problem/10942095>

llvm-svn: 151610

4c5ad2b8

Handle regmasks in the machine code verifier. · 16c4a972
Jakob Stoklund Olesen authored Feb 28, 2012
```
llvm-svn: 151607
```
16c4a972
Fix 80-column violation. · 248c2996
Chad Rosier authored Feb 28, 2012
```
llvm-svn: 151599
```
248c2996

Feb 27, 2012

Fix for PR12090: clear def maps of aliases when visiting a copy. e.g. · ddeb9d11
Evan Cheng authored Feb 27, 2012
```
%S5<def> = COPY %S0<kill>
First clear def map of Q1, etc.

No small test case available.

llvm-svn: 151574
```
ddeb9d11

Update machine code verifier. · 5aafb56d

Jakob Stoklund Olesen authored Feb 27, 2012

After the SlotIndex slot names were updated, it is possible to apply
stricter checks to live intervals.

Also treat bundles as bags of operands when checking live intervals.

llvm-svn: 151531

5aafb56d

Feb 25, 2012

Make the peephole optimizer clear kill flags on a vreg if it's about to add new · d5862ce3

Lang Hames authored Feb 25, 2012

uses of the vreg, since the old kills may no longer be valid.  This was causing
-verify-machineinstrs to complain about uses after kills, and could potentially
have been causing subtle register allocation issues, but I haven't come across a
test case yet.

llvm-svn: 151425

d5862ce3

Fixed typo. · 31bb57bc
Lang Hames authored Feb 25, 2012
```
llvm-svn: 151417
```
31bb57bc

Feb 24, 2012
- Add missing static · 7f991428
  Jakob Stoklund Olesen authored Feb 24, 2012
```
llvm-svn: 151396
```
  7f991428
- Add a -stress-regalloc=<N> option. · 0a0a9688
  Jakob Stoklund Olesen authored Feb 24, 2012
```
This will limit all register classes to N registers in order to stress
test register allocation.

llvm-svn: 151379
```
  0a0a9688
- Don't crash when a glue node contains an internal CopyToReg · b9a3d618
  Hal Finkel authored Feb 24, 2012
```
This is necessary to support the existing ppc lowering code for indirect calls.
Fixes PR12071.

llvm-svn: 151373
```
  b9a3d618
- SDAGBuilder: Remove register sets that were never read and prune dead code surrounding it. · 6fe3e3d3
  Benjamin Kramer authored Feb 24, 2012
```
llvm-svn: 151364
```
  6fe3e3d3
- ScheduleDAGInstrs.h:155: warning: suggest parentheses around `&&' within `||'. · e839e289
  Nick Lewycky authored Feb 24, 2012
```
llvm-svn: 151355
```
  e839e289
- PostRA sched: speed up physreg tracking by not abusing SparseSet. · 9dbbd3e5
  Andrew Trick authored Feb 24, 2012
```
llvm-svn: 151348
```
  9dbbd3e5
- Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove... · 682c76b7
  Pete Cooper authored Feb 24, 2012
```
Turn avx insert intrinsic calls into INSERT_SUBVECTOR DAG nodes and remove duplicate patterns for selecting the intrinsics

llvm-svn: 151342
```
  682c76b7
- If the Address of a variable is an argument then treat the entire · da970541
  Eric Christopher authored Feb 24, 2012
```
variable declaration as an argument because we want that address
anyhow for our debug information.

This seems to fix rdar://9965111, at least we have more debug
information than before and from reading the assembly it appears
to be the correct location.

llvm-svn: 151335
```
  da970541
- Tabs, formatting and long lines oh my! · 219d51d6
  Eric Christopher authored Feb 24, 2012
```
llvm-svn: 151334
```
  219d51d6
- Allow an integer to be converted into an MMX type when it's used in an inline · 38b31619
  Bill Wendling authored Feb 23, 2012
```
asm.
<rdar://problem/10106006>

llvm-svn: 151303
```
  38b31619
Feb 23, 2012

BitVectorize loop. · ef8bf395
Benjamin Kramer authored Feb 23, 2012
```
llvm-svn: 151274
```
ef8bf395
post-ra-sched: Turn the KillIndices vector into a bitvector, it only stored two meaningful states. · 796fd469
Benjamin Kramer authored Feb 23, 2012
```
Rename it to LiveRegs to make it more clear what's stored inside.

llvm-svn: 151273
```
796fd469

post-ra-sched: Replace a std::set of regs with a bitvector. · 21974b1f

Benjamin Kramer authored Feb 23, 2012

Assuming that a single std::set node adds 3 control words, a bitvector
can store (3*8+4)*8=224 registers in the allocated memory of a single
element in the std::set (x86_64). Also we don't have to call malloc
for every register added.

llvm-svn: 151269

21974b1f

Make calls scheduling boundaries post-ra. · a793a59f

Jakob Stoklund Olesen authored Feb 23, 2012

Before register allocation, instructions can be moved across calls in
order to reduce register pressure. After register allocation, we don't
gain a lot by moving callee-saved defs across calls. In fact, since the
scheduler doesn't have a good idea how registers are used in the callee,
it can't really make good scheduling decisions.

This changes the schedule in two ways: 1. Latencies to call uses and
defs are no longer accounted for, causing some random shuffling around
calls. This isn't really a problem since those uses and defs are
inaccurate proxies for what happens inside the callee. They don't
represent registers used by the call instruction itself.

2. Instructions are no longer moved across calls. This didn't happen
very often, and the scheduling decision was made on dubious information
anyway.

As with any scheduling change, benchmark numbers shift around a bit,
but there is no positive or negative trend from this change.

This makes the post-ra scheduler 5% faster for ARM targets.

The secret motivation for this patch is the introduction of register
mask operands representing call clobbers. The most efficient way of
handling regmasks in ScheduleDAGInstrs is to model them as barriers for
physreg live ranges, but not for virtreg live ranges. That's fine
pre-ra, but post-ra it would have the same effect as this patch.

llvm-svn: 151265

a793a59f

Strip a layer of boilerplate from the VLIWPacketizer by storing the scheduler as an opaque pointer. · d53aa39f
Benjamin Kramer authored Feb 23, 2012
```
llvm-svn: 151252
```
d53aa39f
Fix to make sure that a comdat group gets generated correctly for a static member · a22828e0
Anton Korobeynikov authored Feb 23, 2012
```
of instantiated C++ templates.

Patch by Kristof Beyls!

llvm-svn: 151250
```
a22828e0
More newline cleanups. · 18c6be71
Eric Christopher authored Feb 23, 2012
```
llvm-svn: 151235
```
18c6be71
Add some handy-dandy newlines. · 5c45205b
Eric Christopher authored Feb 23, 2012
```
llvm-svn: 151234
```
5c45205b