Commits · 3eb78ec974c395dd92d63f6fe912a38c133188ea · Roger Ferrer / llvm-epi-0.8

Apr 05, 2013

Add obj2yaml to test dependencies · 3eb78ec9
Alexey Samsonov authored Apr 05, 2013
```
llvm-svn: 178852
```
3eb78ec9

Fix for PR14824: "Optimization arm_ldst_opt inserts newly generated... · b309b3b3

Stepan Dyatkovskiy authored Apr 05, 2013

Fix for PR14824: "Optimization arm_ldst_opt inserts newly generated instruction vldmia at incorrect position".
Patch introduces memory operands tracking in ARMLoadStoreOpt::LoadStoreMultipleOpti. For each register it keeps the order of load operations as it was before optimization pass.
It is kind of deep improvement of fix proposed by Hao: http://llvm.org/bugs/show_bug.cgi?id=14824#c4
But it also tracks conflicts between different register classes (e.g. D2 and S5).
For more details see:
Bug description: http://llvm.org/bugs/show_bug.cgi?id=14824
LLVM Commits discussion:
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130311/167936.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130318/168688.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130325/169376.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130401/170238.html

llvm-svn: 178851

b309b3b3

Add a SchedMachineModel for the PPC G5 · 1a958cf3
Hal Finkel authored Apr 05, 2013
```
llvm-svn: 178850
```
1a958cf3
The ppc bots say this is the last broken line, so lets try one more :-( · 4e1e3e75
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178849
```
4e1e3e75
Add a SchedMachineModel for the PPC A2 · 5fde1b03
Hal Finkel authored Apr 05, 2013
```
llvm-svn: 178848
```
5fde1b03
One more try before I just delete the macho bits until tomorrow. · 1218a40c
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178847
```
1218a40c

Fix bug in PEI's virtual-register scavenging · e6f48e4e

Hal Finkel authored Apr 05, 2013

This change fixes a bug that I introduced in r178058. After a register is
scavenged using one of the available spills slots the instruction defining the
virtual register needs to be moved to after the spill code. The scavenger has
already processed the defining instruction so that registers killed by that
instruction are available for definition in that same instruction. Unfortunately,
after this, the scavenger needs to iterate through the spill code and then
visit, again, the instruction that defines the now-scavenged register. In order
to avoid confusion, the register scavenger needs the ability to 'back up'
through the spill code so that it can again process the instructions in the
appropriate order. Prior to this fix, once the scavenger reached the
just-moved instruction, it would assert if it killed any registers because,
having already processed the instruction, it believed they were undefined.

Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar
for diagnosing the problem and testing this fix.

llvm-svn: 178845

e6f48e4e

ARM scheduler model: Add scheduler info to more instructions and resource · fb6b9f48
Arnold Schwaighofer authored Apr 05, 2013
```
descriptions for compares

llvm-svn: 178844
```
fb6b9f48

More test loosening. · 531efab6

Rafael Espindola authored Apr 05, 2013

Sorry for so many commits, but llvm is still building on my ppc vm.

llvm-svn: 178843

531efab6

ARM scheduler model: Swift has varying latencies, uops for simple ALU ops · 5dde1f39
Arnold Schwaighofer authored Apr 05, 2013
```
llvm-svn: 178842
```
5dde1f39
Loosen this test too. · 61ad7493
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178841
```
61ad7493

Loosen this test. · b080267b

Rafael Espindola authored Apr 05, 2013

Looks like there is a big endian/little endian problem here. Loosen the
test to try to get the bots green while llvm builds on a ppc qemu vm.

The failure was in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/

llvm-svn: 178839

b080267b

Move obj2yaml to tools to sort out make's dependencies. · 87a02909
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178835
```
87a02909
Build obj2yaml with configure+make. · 726c8c3b
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178833
```
726c8c3b
Add a test for obj2yaml in preparation for refactoring it. · 599e810a
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178829
```
599e810a
Clean up some confusing language, and use more realistic examples. · 45a11572
Jakob Stoklund Olesen authored Apr 05, 2013
```
llvm-svn: 178828
```
45a11572
RegisterPressure heuristics currently require signed comparisons. · 80e66ce0
Andrew Trick authored Apr 05, 2013
```
llvm-svn: 178823
```
80e66ce0

Disable DFSResult for ConvergingScheduler. · 96ce3848

Andrew Trick authored Apr 05, 2013

For now, just save the compile time since the ConvergingScheduler
heuristics don't use this analysis. We'll probably enable it later
after compile-time investigation.

llvm-svn: 178822

96ce3848

MachineScheduler: format DEBUG output. · 419d4917

Andrew Trick authored Apr 05, 2013

I'm getting more serious about tuning and enabling on x86/ARM. Start
by making the trace readable.

llvm-svn: 178821

419d4917

LoopVectorizer: Pass OperandValueKind information to the cost model · df6f67ed

Arnold Schwaighofer authored Apr 04, 2013

Pass down the fact that an operand is going to be a vector of constants.

This should bring the performance of MultiSource/Benchmarks/PAQ8p/paq8p on x86
back. It had degraded to scalar performance due to my pervious shift cost change
that made all shifts expensive on x86.

radar://13576547

llvm-svn: 178809

df6f67ed

X86 cost model: Differentiate cost for vector shifts of constants · 44f902ed

Arnold Schwaighofer authored Apr 04, 2013

SSE2 has efficient support for shifts by a scalar. My previous change of making
shifts expensive did not take this into account marking all shifts as expensive.
This would prevent vectorization from happening where it is actually beneficial.

With this change we differentiate between shifts of constants and other shifts.

radar://13576547

llvm-svn: 178808

44f902ed

CostModel: Add parameter to instruction cost to further classify operand values · b9773871

Arnold Schwaighofer authored Apr 04, 2013

On certain architectures we can support efficient vectorized version of
instructions if the operand value is uniform (splat) or a constant scalar.
An example of this is a vector shift on x86.

We can efficiently support

for (i = 0 ; i < ; i += 4)
  w[0:3] = v[0:3] << <2, 2, 2, 2>

but not

for (i = 0; i < ; i += 4)
  w[0:3] = v[0:3] << x[0:3]

This patch adds a parameter to getArithmeticInstrCost to further qualify operand
values as uniform or uniform constant.

Targets can then choose to return a different cost for instructions with such
operand values.

A follow-up commit will test this feature on x86.

radar://13576547

llvm-svn: 178807

b9773871

Debug Info: revert 178722 for now. · bdcb4464

Manman Ren authored Apr 04, 2013

There is a difference for FORM_ref_addr between DWARF 2 and DWARF 3+.
Since Eric is against guarding DWARF 2 ref_addr with DarwinGDBCompat, we are
still in discussion on how to handle this.

The correct solution is to update our header to say version 4 instead of version
2 and update tool chains as well.

rdar://problem/13559431

llvm-svn: 178806

bdcb4464

typo · 322f41d0
Adrian Prantl authored Apr 04, 2013
```
llvm-svn: 178804
```
322f41d0

Rename the current PPC BCL definition to BCLalways · e5680b3c

Hal Finkel authored Apr 04, 2013

BCL is normally a conditional branch-and-link instruction, but has
an unconditional form (which is used in the SjLj code, for example).
To make clear that this BCL instruction definition is specifically
the special unconditional form (which does not meaningfully take
a condition-register input), rename it to BCLalways.

No functionality change intended.

llvm-svn: 178803

e5680b3c

PPC: Improve code generation for mixed-precision reciprocal sqrt · f96c18e3

Hal Finkel authored Apr 04, 2013

The DAGCombine logic that recognized a/sqrt(b) and transformed it into
a multiplication by the reciprocal sqrt did not handle cases where the
sqrt and the division were separated by an fpext or fptrunc.

llvm-svn: 178801

f96c18e3

Apr 04, 2013

Hexagon: Expand br_cc. · a929ab58

Jyotsna Verma authored Apr 04, 2013

It fixes following tests for Hexagon:

CodeGen/Generic/2003-07-29-BadConstSbyte.ll
CodeGen/Generic/2005-10-21-longlonggtu.ll
CodeGen/Generic/2009-04-28-i128-cmp-crash.ll
CodeGen/Generic/MachineBranchProb.ll
CodeGen/Generic/builtin-expect.ll
CodeGen/Generic/pr12507.ll

llvm-svn: 178794

a929ab58

Reassociate: Avoid iterator invalidation. · dd67654a

Benjamin Kramer authored Apr 04, 2013

OpndPtrs stored pointers into the Opnd vector that became invalid when the
vector grows. Store indices instead. Sadly I only have a large testcase that
only triggers under valgrind, so I didn't include it.

llvm-svn: 178793

dd67654a

Disable 2010-10-01-crash.ll for Hexagon as the Hexagon frontend will · bc03a979
Jyotsna Verma authored Apr 04, 2013
```
never produce a byval parameter with size < 8 bytes.

llvm-svn: 178792
```
bc03a979

Add back parsing of header charactestics. · 7733466c

Rafael Espindola authored Apr 04, 2013

It had been dropped during the switch to yaml::IO. Also add a test going
from yaml2obj to llvm-readobj. It can be extended as we add more
fields/formats to yaml2obj.

llvm-svn: 178786

7733466c

[XCore] Add bru instruction. · 0c12d185
Richard Osborne authored Apr 04, 2013
```
llvm-svn: 178783
```
0c12d185

[XCore] The RRegs register class is a superset of GRRegs. · f18d95f7

Richard Osborne authored Apr 04, 2013

At the time when the XCore backend was added there were some issues with
with overlapping register classes but these all seem to be fixed now.
Describing the register classes correctly allow us to get rid of a
codegen only instruction (LDAWSP_lru6_RRegs) and it means we can
disassemble ru6 instructions that use registers above r11.

llvm-svn: 178782

f18d95f7

Missing word · 4ee93cd4
Eli Bendersky authored Apr 04, 2013
```
llvm-svn: 178774
```
4ee93cd4

Avoid high-latency false CPSR dependencies even for tMOVSi. · 299475e0

Jakob Stoklund Olesen authored Apr 04, 2013

The Thumb2SizeReduction pass avoids false CPSR dependencies, except it
still aggressively creates tMOVi8 instructions because they are so
common.

Avoid creating false CPSR dependencies even for tMOVi8 instructions when
the the CPSR flags are known to have high latency. This allows integer
computation to overlap floating point computations.

Also process blocks in a reverse post-order and propagate high-latency
flags to successors.

<rdar://problem/13468102>

llvm-svn: 178773

299475e0

Formatting · fc186358
Eli Bendersky authored Apr 04, 2013
```
llvm-svn: 178771
```
fc186358
Revert r178713 · 2e254d04
Evan Cheng authored Apr 04, 2013
```
llvm-svn: 178769
```
2e254d04
New-password-test commit. · e58df62e
Stepan Dyatkovskiy authored Apr 04, 2013
```
llvm-svn: 178765
```
e58df62e
R600: Use a mask for offsets when encoding instructions · bcbb13d6
Vincent Lejeune authored Apr 04, 2013
```
llvm-svn: 178763
```
bcbb13d6
R600: Fix wrong address when substituting ENDIF · 8e377fdb
Vincent Lejeune authored Apr 04, 2013
```
llvm-svn: 178762
```
8e377fdb
R600: Take export into account when computing cf address · c44fa997
Vincent Lejeune authored Apr 04, 2013
```
llvm-svn: 178761
```
c44fa997