Commits · 5bf1d0093b8d28c86ea079be89c0f278fffcfe1c · Roger Ferrer / llvm-epi-0.8

Oct 15, 2013

Remove some dead code. (DarwinGDBCompat was retired in r189903). · 5bf1d009
Adrian Prantl authored Oct 15, 2013
```
llvm-svn: 192731
```
5bf1d009
Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. · eb4a6e7c
Pekka Jaaskelainen authored Oct 15, 2013
```
llvm-svn: 192709
```
eb4a6e7c
Do not assert when trying to add a meta data operand with · eb08e2e0
Pekka Jaaskelainen authored Oct 15, 2013
```
MachineInstr::addOperand().

llvm-svn: 192707
```
eb08e2e0

Improve on r192635, ExeDepsFix for avx, and add a test case. · 3a99693c

Andrew Trick authored Oct 15, 2013

rdar:15221834 False AVX register dependencies cause 5x slowdown on
flops-5/6 and significant slowdown on several others.

This was blocking the switch to MI-Sched.

llvm-svn: 192669

3a99693c

Fix the ExecutionDepsFix pass to handle AVX instructions. · b6d56be6

Andrew Trick authored Oct 14, 2013

This pass is needed to break false dependencies. Without it, unlucky
register assignment can result in wild (5x) swings in
performance. This pass was trying to handle AVX but not getting it
right. AVX doesn't have partial register defs, it has unused register
reads in which the high bits of a source operand are copied into the
unused bits of the dest.

Fixing this requires conservative liveness analysis. This is awkard
because the pass already has its own pseudo-liveness. However, proper
liveness is expensive, and we would like to use a generic utility to
compute it. The fix only invokes liveness on-demand. It is rare to
detect a case that needs undef-read dependence breaking, but when it
happens, it can be needed many times within a very large block.

I think the existing heuristic which uses a register window of 16 is
too conservative for loop-carried false dependencies. If the loop is a
reduction. The out-of-order engine may be able to execute several loop
iterations in parallel. However, I'll leave this tuning exercise for
next time.

llvm-svn: 192635

b6d56be6

LiveRegUnits: Use *MBB for consistency and convenience. · e2f7cc4c
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192634
```
e2f7cc4c

Oct 14, 2013

LiveRegUnits::removeRegsInMask safety. · 3f4d6c65

Andrew Trick authored Oct 14, 2013

Clobbering is exclusive not inclusive on register units.
For liveness, we need to consider all the preserved registers.
e.g. A regmask that clobbers YMM0 may preserve XMM0.
Units are only clobbered when all super-registers are clobbered.

llvm-svn: 192623

3f4d6c65

Use a SparseSet in LiveRegUnits. · 276dd453

Andrew Trick authored Oct 14, 2013

Some clients may add block live ins and may track liveness over a
large scope. This guarantees an efficient implementation in all cases
with no memory allocation/deallocation, independent of the number of
target registers. It could be slightly less convenient but is fine in
the expected case.

llvm-svn: 192622

276dd453

Move LiveRegUnits implementation into .cpp. Comment and format. · 0aed0cfc
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192621
```
0aed0cfc
Convert LiveRegUnits methods to the current convention (it's new code). · ff3585c5
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192619
```
ff3585c5

Debug Info: static member DIE creation. · c6b63927

Manman Ren authored Oct 14, 2013

Clean up creation of static member DIEs. We can create static member DIEs from
two places, so we call getOrCreateStaticMemberDIE from the two places.

getOrCreateStaticMemberDIE will get or create the context DIE first, then it
will check if the DIE already exists, if not, we create the static member DIE
and add it to the context.

Creation of static member DIEs are handled in a similar way as subprogram DIEs.

llvm-svn: 192618

c6b63927

Fix indenting. · 6004dbc9
David Blaikie authored Oct 14, 2013
```
That wasn't confusing /at all/...

llvm-svn: 192617
```
6004dbc9

MachineSink: Fix and tweak critical-edge breaking heuristic. · 5cb7f4e3

Will Dietz authored Oct 14, 2013

Per original comment, the intention of this loop
is to go ahead and break the critical edge
(in order to sink this instruction) if there's
reason to believe doing so might "unblock" the
sinking of additional instructions that define
registers used by this one.  The idea is that if
we have a few instructions to sink "together"
breaking the edge might be worthwhile.

This commit makes a few small changes
to help better realize this goal:

First, modify the loop to ignore registers
defined by this instruction.  We don't
sink definitions of physical registers,
and sinking an SSA definition isn't
going to unblock an upstream instruction.

Second, ignore uses of physical registers.
Instructions that define physical registers are
rejected for sinking, and so moving this one
won't enable moving any defining instructions.
As an added bonus, while virtual register
use-def chains are generally small due
to SSA goodness, iteration over the uses
and definitions (used by hasOneNonDBGUse)
for physical registers like EFLAGS
can be rather expensive in practice.
(This is the original reason for looking at this)

Finally, to keep things simple continue
to only consider this trick for registers that
have a single use (via hasOneNonDBGUse),
but to avoid spuriously breaking critical edges
only do so if the definition resides
in the same MBB and therefore this one directly
blocks it from being sunk as well.
If sinking them together is meant to be,
let the iterative nature of this pass
sink the definition into this block first.

Update tests to accomodate this change,
add new testcase where sinking avoids pipeline stalls.

llvm-svn: 192608

5cb7f4e3

Remove the now unused strong phi elimination pass. · 9770bde5
Rafael Espindola authored Oct 14, 2013
```
llvm-svn: 192604
```
9770bde5

Fixed a bug in dynamic allocation memory on stack. · 82a46ebe

Elena Demikhovsky authored Oct 14, 2013

The alignment of allocated space was wrong, see Bugzila 17345.

Done by Zvi Rackover <zvi.rackover@intel.com>.

llvm-svn: 192573

82a46ebe

Oct 13, 2013
- TargetLowering: Don't index into empty string. · ae726a93
  Will Dietz authored Oct 13, 2013
```
(This is triggered by current lit tests)

llvm-svn: 192549
```
  ae726a93
Oct 12, 2013

Debug Info: remove form from function addDIEEntry. · 4c4b69c9

Manman Ren authored Oct 11, 2013

The form must be a reference form in addDIEEntry. Which reference form to
use will be decided by the callee.

No functionality change.

llvm-svn: 192517

4c4b69c9

Oct 11, 2013

fConversion: Attempt #2 at fixing the MSVC build. · a9767aed
Benjamin Kramer authored Oct 11, 2013
```
llvm-svn: 192492
```
a9767aed
IfConversion: Try to unbreak the MSVC build. · 24906d96
Benjamin Kramer authored Oct 11, 2013
```
llvm-svn: 192487
```
24906d96

Remove kill flags after if conversion if necessary · d616ccc0

Matthias Braun authored Oct 11, 2013

When if converting something like:
true:
   ... = R0<kill>

false:
   ... = R0<kill>

then the instructions of the true block must not have a <kill> flag
anymore, as the instruction of the false block follow and do still read
the R0 value.
Specifically this patch determines the set of register live-in in the
false block (possibly after simulating the liveness changes of the
duplicated instructions). Each of these live-in registers mustn't be
killed.

llvm-svn: 192482

d616ccc0

[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. · de0e0623

Quentin Colombet authored Oct 11, 2013

This should fix the buildbots.

Original commit message:
[DAGCombiner] Slice a big load in two loads when the element are next to each
other in memory and the target has paired load and performs post-isel loads
combining.

E.g., this optimization will transform something like this:
a = load i64* addr
b = trunc i64 a to i32
c = lshr i64 a, 32
d = trunc i64 c to i32

into:
b = load i32* addr1
d = load i32* addr2
Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and
performs post-isel loads combining.

One should overload TargetLowering::hasPairedLoad to provide this information.
The default is false.

<rdar://problem/14477220>

llvm-svn: 192476

de0e0623

[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. · 5aee63d9
Quentin Colombet authored Oct 11, 2013
```
llvm-svn: 192474
```
5aee63d9

[DAGCombiner] Slice a big load in two loads when the element are next to each · 41dc258f

Quentin Colombet authored Oct 11, 2013

other in memory and the target has paired load and performs post-isel loads
combining.

E.g., this optimization will transform something like this:
 a = load i64* addr
 b = trunc i64 a to i32
 c = lshr i64 a, 32
 d = trunc i64 c to i32

into:
 b = load i32* addr1
 d = load i32* addr2
Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and
performs post-isel loads combining.

One should overload TargetLowering::hasPairedLoad to provide this information.
The default is false.

<rdar://problem/14477220>

llvm-svn: 192471

41dc258f

fix typo in comment · b542fa51
Matthias Braun authored Oct 11, 2013
```
llvm-svn: 192455
```
b542fa51

Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom... · 660597d1

Justin Holewinski authored Oct 11, 2013

Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs

For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers,
while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be
emitted as a true PTX register name. Other targets can use this to customize the output of implicit def
comments.

Fixes PR17519

llvm-svn: 192444

660597d1

LiveRangeCalc.h: Update a description corresponding to r192396. [-Wdocumentation] · d5d16d57
NAKAMURA Takumi authored Oct 11, 2013
```
llvm-svn: 192421
```
d5d16d57

Oct 10, 2013
- Print register in LiveInterval::print() · f6fe6bff
  Matthias Braun authored Oct 10, 2013
```
llvm-svn: 192398
```
  f6fe6bff
- Represent RegUnit liveness with LiveRange instance · 34e1be94
  Matthias Braun authored Oct 10, 2013
```
Previously LiveInterval has been used, but having a spill weight and
register number is unnecessary for a register unit.

llvm-svn: 192397
```
  34e1be94
- Work on LiveRange instead of LiveInterval where possible · 2d5c32b3
  Matthias Braun authored Oct 10, 2013
```
Also change some pointer arguments to references at some places where
0-pointers are not allowed.

llvm-svn: 192396
```
  2d5c32b3
- Change MachineVerifier to work on LiveRange + LiveInterval · 364e6e90
  Matthias Braun authored Oct 10, 2013
```
llvm-svn: 192395
```
  364e6e90
- Pass LiveQueryResult by value · 88dd0abd
  Matthias Braun authored Oct 10, 2013
```
This makes the API a bit more natural to use and makes it easier to make
LiveRanges implementation details private.

llvm-svn: 192394
```
  88dd0abd
- Refactor LiveInterval: introduce new LiveRange class · d7df935b
  Matthias Braun authored Oct 10, 2013
```
LiveRange just manages a list of segments and a list of value numbers
now as LiveInterval did previously, but without having details like spill
weight or a fixed register number.
LiveInterval is now a subclass of LiveRange and simply adds the spill weight
and the register number.

llvm-svn: 192393
```
  d7df935b
- Rename LiveRange to LiveInterval::Segment · 13ddb7cd
  Matthias Braun authored Oct 10, 2013
```
The Segment struct contains a single interval; multiple instances of this struct
are used to construct a live range, but the struct is not a live range by
itself.

llvm-svn: 192392
```
  13ddb7cd
- Rename parameter: defined regs are not incoming. · 1965bfa4
  Matthias Braun authored Oct 10, 2013
```
llvm-svn: 192391
```
  1965bfa4
- Use getPointerSizeInBits() rather than 8 * getPointerSize() · a98c3b18
  Matt Arsenault authored Oct 10, 2013
```
llvm-svn: 192386
```
  a98c3b18
- Debug Info: In DIBuilder, the context field of subprogram is updated to use · c50fa111
  Manman Ren authored Oct 10, 2013
```
DIScopeRef.

A paired commit at clang is required due to changes to DIBuilder.

llvm-svn: 192378
```
  c50fa111
Oct 09, 2013

Debug Info: In DIBuilder, the context and type fields of template_type and · 88b0f948

Manman Ren authored Oct 09, 2013

template_value are updated to use DIRef.

A paired commit at clang is required due to changes to DIBuilder.

llvm-svn: 192320

88b0f948

Oct 08, 2013
- Explicitly request unsigned enum types when desired · cd4a25d6
  Reid Kleckner authored Oct 08, 2013
```
This fixes repeated -Wmicrosoft warnings when self-hosting clang on
Windows, and gets us real unsigned enum types with MSVC.

llvm-svn: 192227
```
  cd4a25d6
- Add DbgVariable::resolve per Eric's suggestion. · be5576f5
  Manman Ren authored Oct 08, 2013
```
llvm-svn: 192218
```
  be5576f5
- Debug Info: rename getOriginalTypeSize to getBaseTypeSize. · bda410f4
  Manman Ren authored Oct 08, 2013
```
llvm-svn: 192216
```
  bda410f4