Commits · 77c5bb5e4a9013e1f5ea9f232a54e59d989fd94f · Roger Ferrer / llvm-epi-0.8

Oct 18, 2013
- DIEHash: Add more things (and remove one character) from the COLLECT_ATTR macro · 01fae51f
  David Blaikie authored Oct 17, 2013
```
Makes the uses more terse and requires that they use a semicolon at the
end that helps editors indent proceeding lines correctly.

llvm-svn: 192925
```
  01fae51f
- DIEHash: Support for simple (non-recursive, non-reused) type references · ca353be6
  David Blaikie authored Oct 17, 2013
```
llvm-svn: 192924
```
  ca353be6
Oct 17, 2013

Replace sra with srl if a single sign bit is required · 95f7ba98
Richard Sandiford authored Oct 17, 2013
```
E.g. (and (sra (i32 x) 31) 2) -> (and (srl (i32 x) 30) 2).

llvm-svn: 192884
```
95f7ba98

Fix edge condition in DAGCombiner to improve codegen of shift sequences. · 561badf7

Andrea Di Biagio authored Oct 17, 2013

When canonicalizing dags according to the rule
(shl (zext (shr X, c1) ), c1) ==> (zext (shl (shr X, c1), c1))

remember to add the new shl dag to the DAGCombiner worklist of nodes.
If we don't explicitly add it to the worklist of nodes to visit, we
may not trigger later on the rule that folds the shift left + logical
shift right into a AND instruction with bitmask.

llvm-svn: 192883

561badf7

According to the dwarf standard pubnames and pubtypes for languages · 2c8b7907

Eric Christopher authored Oct 17, 2013

like C++ should be the fully qualified names for the type.

Add a routine that does a language specific context walk to build
up the qualified name and use it when we add types/names to the
tables. Expand the gnu pubnames testcase as it's the most complex
to make sure that qualified types are also being added.

llvm-svn: 192865

2c8b7907

[projects/test-suite] White space and long line fixes. · d4e9615d
Jack Carter authored Oct 17, 2013
```
No functionality changes.

llvm-svn: 192863
```
d4e9615d
Add the subprogram DIEs to the context they're created with only · 96eff3f3
Eric Christopher authored Oct 17, 2013
```
if they're a declaration, otherwise they're owned by the compile
unit.

llvm-svn: 192861
```
96eff3f3
DIEHash: Include the type's context in the type hash. · 8a142aaa
David Blaikie authored Oct 17, 2013
```
llvm-svn: 192856
```
8a142aaa
DIEHash: Use DW_FORM_sdata for integers, per spec. · 6316ca45
David Blaikie authored Oct 16, 2013
```
This allows us to produce the same hash as GCC for at least some simple
examples.

llvm-svn: 192855
```
6316ca45

Oct 16, 2013

Remove ambiguity introduced in r192836 · 920bb2a7
David Blaikie authored Oct 16, 2013
```
llvm-svn: 192840
```
920bb2a7
DIEHash: Include the trailing zero byte after the children of a DIE · 71a0ad66
David Blaikie authored Oct 16, 2013
```
llvm-svn: 192836
```
71a0ad66
After PostRA scheduling, don't set kill flags on undef operands. · 811a2ef9
Andrew Trick authored Oct 16, 2013
```
This should fix the ATOM buildbot failing on break-avx-dep.ll.

llvm-svn: 192824
```
811a2ef9

DAGCombiner: Don't fold xor into not if getNOT would introduce an illegal constant. · 00eb07b7

Benjamin Kramer authored Oct 16, 2013

This happens e.g. with <2 x i64> -1 on x86_32. It cannot be generated directly
because i64 is illegal. It would be nice if getNOT would handle this
transparently, but I don't see a way to generate a legal constant there right
now. Fixes PR17487.

llvm-svn: 192795

00eb07b7

Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits · 374a0e50

Richard Sandiford authored Oct 16, 2013

This is really an extension of the current (shl (shr ...)) -> shl optimization.
The main difference is that certain upper bits must also not be demanded.

The motivating examples are the first two in the testcase, which occur
in llvmpipe output.

llvm-svn: 192783

374a0e50

Add support for metadata representing .ident directives. · 0018a59d
Rafael Espindola authored Oct 16, 2013
```
llvm-svn: 192764
```
0018a59d

Fix a pair of bugs in the emission of pubname tables: · d2b497b5

Eric Christopher authored Oct 16, 2013

1) Make sure we emit static member variables by checking
at the end of createGlobalVariableDIE rather than piecemeal
in the function.
(As a note, createGlobalVariableDIE needs rewriting.)

2) Make sure we use the definition rather than declaration DIE
for two things: a) determining linkage for gnu pubnames, and b)
as the address of the DIE for global variables.
(As a note, createGlobalVariableDIE really needs rewriting.)

Adjust the testcase to make sure we're checking the correct DIEs.

llvm-svn: 192761

d2b497b5

Simplify zero initialization of DIEAttrs variable. · 94ded5f3
David Blaikie authored Oct 16, 2013
```
llvm-svn: 192755
```
94ded5f3

Make sure we're not attempting to construct a subprogram DIE · a6c38a32

Eric Christopher authored Oct 15, 2013

twice and just look up the value. Fix the one case where
we were trying to create a subprogram DIE and we should already
have had one. Reflow formatting in collectDeadVariables while fixing.

llvm-svn: 192749

a6c38a32

Oct 15, 2013

Remove some dead code. (DarwinGDBCompat was retired in r189903). · 5bf1d009
Adrian Prantl authored Oct 15, 2013
```
llvm-svn: 192731
```
5bf1d009
Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. · eb4a6e7c
Pekka Jaaskelainen authored Oct 15, 2013
```
llvm-svn: 192709
```
eb4a6e7c
Do not assert when trying to add a meta data operand with · eb08e2e0
Pekka Jaaskelainen authored Oct 15, 2013
```
MachineInstr::addOperand().

llvm-svn: 192707
```
eb08e2e0

Improve on r192635, ExeDepsFix for avx, and add a test case. · 3a99693c

Andrew Trick authored Oct 15, 2013

rdar:15221834 False AVX register dependencies cause 5x slowdown on
flops-5/6 and significant slowdown on several others.

This was blocking the switch to MI-Sched.

llvm-svn: 192669

3a99693c

Fix the ExecutionDepsFix pass to handle AVX instructions. · b6d56be6

Andrew Trick authored Oct 14, 2013

This pass is needed to break false dependencies. Without it, unlucky
register assignment can result in wild (5x) swings in
performance. This pass was trying to handle AVX but not getting it
right. AVX doesn't have partial register defs, it has unused register
reads in which the high bits of a source operand are copied into the
unused bits of the dest.

Fixing this requires conservative liveness analysis. This is awkard
because the pass already has its own pseudo-liveness. However, proper
liveness is expensive, and we would like to use a generic utility to
compute it. The fix only invokes liveness on-demand. It is rare to
detect a case that needs undef-read dependence breaking, but when it
happens, it can be needed many times within a very large block.

I think the existing heuristic which uses a register window of 16 is
too conservative for loop-carried false dependencies. If the loop is a
reduction. The out-of-order engine may be able to execute several loop
iterations in parallel. However, I'll leave this tuning exercise for
next time.

llvm-svn: 192635

b6d56be6

LiveRegUnits: Use *MBB for consistency and convenience. · e2f7cc4c
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192634
```
e2f7cc4c

Oct 14, 2013

LiveRegUnits::removeRegsInMask safety. · 3f4d6c65

Andrew Trick authored Oct 14, 2013

Clobbering is exclusive not inclusive on register units.
For liveness, we need to consider all the preserved registers.
e.g. A regmask that clobbers YMM0 may preserve XMM0.
Units are only clobbered when all super-registers are clobbered.

llvm-svn: 192623

3f4d6c65

Use a SparseSet in LiveRegUnits. · 276dd453

Andrew Trick authored Oct 14, 2013

Some clients may add block live ins and may track liveness over a
large scope. This guarantees an efficient implementation in all cases
with no memory allocation/deallocation, independent of the number of
target registers. It could be slightly less convenient but is fine in
the expected case.

llvm-svn: 192622

276dd453

Move LiveRegUnits implementation into .cpp. Comment and format. · 0aed0cfc
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192621
```
0aed0cfc
Convert LiveRegUnits methods to the current convention (it's new code). · ff3585c5
Andrew Trick authored Oct 14, 2013
```
llvm-svn: 192619
```
ff3585c5

Debug Info: static member DIE creation. · c6b63927

Manman Ren authored Oct 14, 2013

Clean up creation of static member DIEs. We can create static member DIEs from
two places, so we call getOrCreateStaticMemberDIE from the two places.

getOrCreateStaticMemberDIE will get or create the context DIE first, then it
will check if the DIE already exists, if not, we create the static member DIE
and add it to the context.

Creation of static member DIEs are handled in a similar way as subprogram DIEs.

llvm-svn: 192618

c6b63927

Fix indenting. · 6004dbc9
David Blaikie authored Oct 14, 2013
```
That wasn't confusing /at all/...

llvm-svn: 192617
```
6004dbc9

MachineSink: Fix and tweak critical-edge breaking heuristic. · 5cb7f4e3

Will Dietz authored Oct 14, 2013

Per original comment, the intention of this loop
is to go ahead and break the critical edge
(in order to sink this instruction) if there's
reason to believe doing so might "unblock" the
sinking of additional instructions that define
registers used by this one.  The idea is that if
we have a few instructions to sink "together"
breaking the edge might be worthwhile.

This commit makes a few small changes
to help better realize this goal:

First, modify the loop to ignore registers
defined by this instruction.  We don't
sink definitions of physical registers,
and sinking an SSA definition isn't
going to unblock an upstream instruction.

Second, ignore uses of physical registers.
Instructions that define physical registers are
rejected for sinking, and so moving this one
won't enable moving any defining instructions.
As an added bonus, while virtual register
use-def chains are generally small due
to SSA goodness, iteration over the uses
and definitions (used by hasOneNonDBGUse)
for physical registers like EFLAGS
can be rather expensive in practice.
(This is the original reason for looking at this)

Finally, to keep things simple continue
to only consider this trick for registers that
have a single use (via hasOneNonDBGUse),
but to avoid spuriously breaking critical edges
only do so if the definition resides
in the same MBB and therefore this one directly
blocks it from being sunk as well.
If sinking them together is meant to be,
let the iterative nature of this pass
sink the definition into this block first.

Update tests to accomodate this change,
add new testcase where sinking avoids pipeline stalls.

llvm-svn: 192608

5cb7f4e3

Remove the now unused strong phi elimination pass. · 9770bde5
Rafael Espindola authored Oct 14, 2013
```
llvm-svn: 192604
```
9770bde5

Fixed a bug in dynamic allocation memory on stack. · 82a46ebe

Elena Demikhovsky authored Oct 14, 2013

The alignment of allocated space was wrong, see Bugzila 17345.

Done by Zvi Rackover <zvi.rackover@intel.com>.

llvm-svn: 192573

82a46ebe

Oct 13, 2013
- TargetLowering: Don't index into empty string. · ae726a93
  Will Dietz authored Oct 13, 2013
```
(This is triggered by current lit tests)

llvm-svn: 192549
```
  ae726a93
Oct 12, 2013

Debug Info: remove form from function addDIEEntry. · 4c4b69c9

Manman Ren authored Oct 11, 2013

The form must be a reference form in addDIEEntry. Which reference form to
use will be decided by the callee.

No functionality change.

llvm-svn: 192517

4c4b69c9

Oct 11, 2013

fConversion: Attempt #2 at fixing the MSVC build. · a9767aed
Benjamin Kramer authored Oct 11, 2013
```
llvm-svn: 192492
```
a9767aed
IfConversion: Try to unbreak the MSVC build. · 24906d96
Benjamin Kramer authored Oct 11, 2013
```
llvm-svn: 192487
```
24906d96

Remove kill flags after if conversion if necessary · d616ccc0

Matthias Braun authored Oct 11, 2013

When if converting something like:
true:
   ... = R0<kill>

false:
   ... = R0<kill>

then the instructions of the true block must not have a <kill> flag
anymore, as the instruction of the false block follow and do still read
the R0 value.
Specifically this patch determines the set of register live-in in the
false block (possibly after simulating the liveness changes of the
duplicated instructions). Each of these live-in registers mustn't be
killed.

llvm-svn: 192482

d616ccc0

[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. · de0e0623

Quentin Colombet authored Oct 11, 2013

This should fix the buildbots.

Original commit message:
[DAGCombiner] Slice a big load in two loads when the element are next to each
other in memory and the target has paired load and performs post-isel loads
combining.

E.g., this optimization will transform something like this:
a = load i64* addr
b = trunc i64 a to i32
c = lshr i64 a, 32
d = trunc i64 c to i32

into:
b = load i32* addr1
d = load i32* addr2
Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and
performs post-isel loads combining.

One should overload TargetLowering::hasPairedLoad to provide this information.
The default is false.

<rdar://problem/14477220>

llvm-svn: 192476

de0e0623

[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. · 5aee63d9
Quentin Colombet authored Oct 11, 2013
```
llvm-svn: 192474
```
5aee63d9