Commits · b7ff9b1599fa81a67297786301678fed660f11e2 · Roger Ferrer / llvm-epi-0.8

Apr 27, 2012

[asan] small optimization: do not emit "x+0" instructions · 5a464f03
Kostya Serebryany authored Apr 27, 2012
```
llvm-svn: 155701
```
5a464f03
Refactor IT handling not to store the bottom bit of the condition code in the... · f435b09e
Richard Barton authored Apr 27, 2012
```
Refactor IT handling not to store the bottom bit of the condition code in the mask operand in the MCInst.

llvm-svn: 155700
```
f435b09e
Revert r155682, "Use ConstantExpr::getExtractElement when constant-folding vectors" · 6008dfdb
NAKAMURA Takumi authored Apr 27, 2012
```
It broke stage2 build. stage1/clang sometimes crashed.

llvm-svn: 155699
```
6008dfdb
[tsan] Atomic support for ThreadSanitizer, patch by Dmitry Vyukov · a1259778
Kostya Serebryany authored Apr 27, 2012
```
llvm-svn: 155698
```
a1259778
Implement a bastardized ABI. · 1ec87ee0
Evan Cheng authored Apr 27, 2012
```
llvm-svn: 155686
```
1ec87ee0
- thumbv6 shouldn't imply +thumb2. Cortex-M0 doesn't suppport 32-bit Thumb2 · f52003de
Evan Cheng authored Apr 27, 2012
```
  instructions.
- However, it does support dmb, dsb, isb, mrs, and msr.
rdar://11331541

llvm-svn: 155685
```
f52003de

Use ConstantExpr::getExtractElement when constant-folding vectors · 90f3798f

Dan Gohman authored Apr 27, 2012

instead of getAggregateElement. This has the advantage of being
more consistent and allowing higher-level constant folding to
procede even if an inner extract element cannot be folded.

Make ConstantFoldInstruction call ConstantFoldConstantExpression
on the instruction's operands, making it more consistent with 
ConstantFoldConstantExpression itself. This makes sure that
ConstantExprs get TargetData-aware folding before being handed
off as operands for further folding.

This causes more expressions to be folded, but due to a known
shortcoming in constant folding, this currently has the side effect
of stripping a few more nuw and inbounds flags in the non-targetdata
side of constant-fold-gep.ll. This is mostly harmless.

This fixes rdar://11324230.

llvm-svn: 155682

90f3798f

Break up getProfitableChainIncrement(). · c90abc89

Jakob Stoklund Olesen authored Apr 26, 2012

The required checks are moved to ChainInstruction() itself and the
policy decisions are moved to IVChain::isProfitableInc().

Also cache the ExprBase in IVChain to avoid frequent recomputations.

No functional change intended.

llvm-svn: 155676

c90abc89

Turn IVChain into a struct. · a0337d7b
Jakob Stoklund Olesen authored Apr 26, 2012
```
No functional change intended.

llvm-svn: 155675
```
a0337d7b

Add instcombine patterns for the following transformations: · 7813dcee

Chad Rosier authored Apr 26, 2012

 (x & y) | (x ^ y) -> x | y 
 (x & y) + (x ^ y) -> x | y 

Patch by Manman Ren.
rdar://10770603

llvm-svn: 155674

7813dcee

Apr 26, 2012

Fix the SD scheduler to avoid gluing the same node twice. · 03fa574a

Andrew Trick authored Apr 26, 2012

DAGCombine strangeness may result in multiple loads from the same
offset. They both may try to glue themselves to another load. We could
insist that the redundant loads glue themselves to each other, but the
beter fix is to bail out from bad gluing at the time we detect it.

Fixes rdar://11314175: BuildSchedUnits assert.

llvm-svn: 155668

03fa574a

ARM: Thumb ldr(literal) base address alignment is 32-bits. · 3d6c629e

Jim Grosbach authored Apr 26, 2012

The base address for the PC-relative load is Align(PC,4), so it's the
address of the word containing the 16-bit instruction, not the address
of the instruction itself. Ugh.

rdar://11314619

llvm-svn: 155659

3d6c629e

· 81290f4b

Preston Gurd authored Apr 26, 2012

Trivial change to set UseLeaForSP flag in addition to toggling
the FeatureLeaForSP feature bit when llvm auto detects Intel Atom.

Patch by Andy Zhang

llvm-svn: 155655

81290f4b

[Support/YAML] Properly fix unitialized variable warning by inserting a · a6c2c291
Michael J. Spencer authored Apr 26, 2012
```
'REPLACEMENT CHARACTER' (U+FFFD) when getAsInteger fails.

llvm-svn: 155653
```
a6c2c291

Use VLD1 in NEON extenting-load patterns instead of VLDR. · 3de97b7a

Tim Northover authored Apr 26, 2012

On some cores it's a bad idea for performance to mix VFP and NEON instructions
and since these patterns are NEON anyway, the NEON load should be used.

llvm-svn: 155630

3de97b7a

Test commit. · 6699a60b
Tim Northover authored Apr 26, 2012
```
llvm-svn: 155626
```
6699a60b

Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to... · 08ccfbe5

Craig Topper authored Apr 26, 2012

Enable detection of AVX and AVX2 support through CPUID. Add AVX/AVX2 to corei7-avx, core-avx-i, and core-avx2 cpu names.

llvm-svn: 155618

08ccfbe5

Teach the reassociate pass to fold chains of multiplies with repeated · 739ef80f

Chandler Carruth authored Apr 26, 2012

elements to minimize the number of multiplies required to compute the
final result. This uses a heuristic to attempt to form near-optimal
binary exponentiation-style multiply chains. While there are some cases
it misses, it seems to at least a decent job on a very diverse range of
inputs.

Initial benchmarks show no interesting regressions, and an 8%
improvement on SPASS. Let me know if any other interesting results (in
either direction) crop up!

Credit to Richard Smith for the core algorithm, and helping code the
patch itself.

llvm-svn: 155616

739ef80f

If triple is armv7 / thumbv7 and a CPU is specified, do not automatically assume · 9f7ad310

Evan Cheng authored Apr 26, 2012

the feature set of v7a. This comes about if the user specifies something like
-arch armv7 -mcpu=cortex-m3. We shouldn't be generating instructions such as
uxtab in this case.

rdar://11318438

llvm-svn: 155601

9f7ad310

Don't forget to reset 'first operand' flag when we're setting the MDNodeOperand value. · 0156f44a
Bill Wendling authored Apr 26, 2012
```
llvm-svn: 155599
```
0156f44a

Apr 25, 2012

Print IV chain numbers while collecting them. · 293673d7
Jakob Stoklund Olesen authored Apr 25, 2012
```
llvm-svn: 155567
```
293673d7
Remove more dead code. · 01f201f4
Jakob Stoklund Olesen authored Apr 25, 2012
```
llvm-svn: 155566
```
01f201f4

Unify internal representation of ARM instructions with a register... · ba5b0cc8

Richard Barton authored Apr 25, 2012

Unify internal representation of ARM instructions with a register right-shifted by #32. These are stored as shifts by #0 in the MCInst and correctly marshalled when transforming from or to assembly representation.

llvm-svn: 155565

ba5b0cc8

Remove the -disable-cross-class-join option. · 983dd43b

Jakob Stoklund Olesen authored Apr 25, 2012

Cross-class joins have been normal and fully supported for a while now.
With TableGen generating the getMatchingSuperRegClass() hook, they are
unlikely to cause problems again.

llvm-svn: 155552

983dd43b

Cross-class joining is winning. · d11cf967

Jakob Stoklund Olesen authored Apr 25, 2012

Remove the heuristic for disabling cross-class joins. The greedy
register allocator can handle the narrow register classes, and when it
splits a live range, it can pick a larger register class.

Benchmarks were unaffected by this change.

<rdar://problem/11302212>

llvm-svn: 155551

d11cf967

Add ifdef around getSubtargetFeatureName in tablegen output file so that only... · 3ec7c2aa

Craig Topper authored Apr 25, 2012

Add ifdef around getSubtargetFeatureName in tablegen output file so that only targets that want the function get it. This prevents other targets from getting an unused function warning.

llvm-svn: 155538

3ec7c2aa

Use vector_shuffles instead of target specific unpack nodes for AVX... · 5ff6dc34

Craig Topper authored Apr 25, 2012

Use vector_shuffles instead of target specific unpack nodes for AVX ZERO_EXTEND/ANY_EXTEND combine. These will be converted to target specific nodes during lowering. This is more consistent with other code.

llvm-svn: 155537

5ff6dc34

Reverting r155468. Chris and Chandler have convinced me that it's dangerous and · 2fd0c691
Lang Hames authored Apr 25, 2012
```
in poor taste.

Talking through some alternate solutions with Chandler.

llvm-svn: 155530
```
2fd0c691
Do not use $gp as a dedicated global register if the target ABI is not O32. · 2020e27d
Akira Hatanaka authored Apr 25, 2012
```
llvm-svn: 155522
```
2020e27d

Simplify the known retain count tracking; use a boolean state instead · 62079b43

Dan Gohman authored Apr 25, 2012

of a precise count. Also, move RRInfo's Partial field into PtrState,
now that it won't increase the size.

llvm-svn: 155513

62079b43

Build custom predecessor and successor lists for each basic block. · c24c66f2

Dan Gohman authored Apr 24, 2012

These lists exclude invoke unwind edges and loop backedges which
are being ignored. This makes it easier to ignore them
consistently.

llvm-svn: 155500

c24c66f2

ARM: improved assembler diagnostics for missing CPU features. · 5117ef74

Jim Grosbach authored Apr 24, 2012

When an instruction match is found, but the subtarget features it
requires are not available (missing floating point unit, or thumb vs arm
mode, for example), issue a diagnostic that identifies what the feature
mismatch is.

rdar://11257547

llvm-svn: 155499

5117ef74

Apr 24, 2012

Fix a naughty header include that breaks "installed" builds. · 4d4b5469
Andrew Trick authored Apr 24, 2012
```
llvm-svn: 155486
```
4d4b5469
ConstantFoldSelectInstruction swapped the operands of the select. · 450d69a5
Nadav Rotem authored Apr 24, 2012
```
Fix 12592. Patch by Matt Pharr.

llvm-svn: 155480
```
450d69a5

MachineBasicBlock::SplitCriticalEdge() should follow LLVM IR variant and... · 2d14d8ac

Evan Cheng authored Apr 24, 2012

MachineBasicBlock::SplitCriticalEdge() should follow LLVM IR variant and refuse to break edge to EH landing pad. rdar://11300144

llvm-svn: 155470

2d14d8ac

Add support for llvm.arm.neon.vmull* intrinsics to InstCombine. This fixes · 84531c2b
Lang Hames authored Apr 24, 2012
```
<rdar://problem/11291436>.

llvm-svn: 155468
```
84531c2b

Fix a crash on valid (if UB) bitcode that is produced for some global · aacb8a58

Chandler Carruth authored Apr 24, 2012

constants in C++11 mode. I have no idea why it required such particular
circumstances to get here, the code seems clearly to rely upon unchecked
assumptions.

Specifically, when we decide to form an index into a struct type, we may
have gone through (at least one) zero-length array indexing round, which
would have left the offset un-adjusted, and thus not necessarily valid
for use when indexing the struct type.

This is just an canonicalization step, so the correct thing is to refuse
to canonicalize nonsensical GEPs of this form. Implemented, and test
case added.

Fixes PR12642. Pair debugged and coded with Richard Smith. =] I credit
him with most of the debugging, and preventing me from writing the wrong
code.

llvm-svn: 155466

aacb8a58

ARM: Nuke remnant bogus code. · 1e75fc1f

Jim Grosbach authored Apr 24, 2012

r154362 was supposed to delete this bit, but obviously didn't.

rdar://11305594

llvm-svn: 155465

1e75fc1f

AVX: Add additional vbroadcast replacement sequences for integers. · 810734b7
Nadav Rotem authored Apr 24, 2012
```
Remove the v2f64 patterns because it does not match any vbroadcast
instruction.

llvm-svn: 155461
```
810734b7
cmake: new file · 26bdff9b
Andrew Trick authored Apr 24, 2012
```
llvm-svn: 155460
```
26bdff9b