Commits · 6da4ca83b07cb79dabb6243d6ccea5cae1dbd8ae · Roger Ferrer / llvm-epi-0.8

Jan 21, 2011

Venkatraman Govindaraju authored Jan 21, 2011

 Rename FLUSH to FLUSHW.
 Output "ta 3" instead of a "flushw" instruction if v8 instruction set is used.

llvm-svn: 123997

ef8cf45e

Just because we have determined that an (fcmp | fcmp) is true for A < B, · a834200d

Owen Anderson authored Jan 21, 2011

A == B, and A > B, does not mean we can fold it to true.  We still need to
check for A ? B (A unordered B).

llvm-svn: 123993

a834200d

Last round of fixes for movw + movt global address codegen. · 2f2435d0

Evan Cheng authored Jan 21, 2011

1. Fixed ARM pc adjustment.
2. Fixed dynamic-no-pic codegen
3. CSE of pc-relative load of global addresses.

It's now enabled by default for Darwin.

llvm-svn: 123991

2f2435d0

Clang was not parsing target triples involving EABI and was generating wrong... · 83758d5c

Renato Golin authored Jan 21, 2011

Clang was not parsing target triples involving EABI and was generating wrong IR (wrong PCS) and passing the wrong information down llc via the target-triple printed in IR. I've fixed this by adding the parsing of EABI into LLVM's Triple class and using it to choose the correct PCS in Clang's Tools. A Clang patch is on its way to use this infrastructure.

llvm-svn: 123990

83758d5c

Handles libffi on the CMake build. · 64955953
Oscar Fuentes authored Jan 21, 2011
```
Patch by arrowdodger!

llvm-svn: 123976
```
64955953

Fix the encoding of QADD/SUB, QDADD/SUB. While qadd16, qadd8 use "rd, rn, rm", · 4bd61238

Bruno Cardoso Lopes authored Jan 21, 2011

qadd and qdadd uses "rd, rm, rn", the same applies to the 'sub' variants. This
is described in ARM manuals and matches the encoding used by the gnu assembler.

llvm-svn: 123975

4bd61238

Implement support for byval arguments in Sparc backend. · 0594789f
Venkatraman Govindaraju authored Jan 21, 2011
```
llvm-svn: 123974
```
0594789f
SCCP doesn't actually preserve the CFG. It will delete and insert terminator · ae0275e0
Nick Lewycky authored Jan 21, 2011
```
instructions.

llvm-svn: 123973
```
ae0275e0

Enable support for precise scheduling of the instruction selection · bd428ec5

Andrew Trick authored Jan 21, 2011

DAG. Disable using "-disable-sched-cycles".

For ARM, this enables a framework for modeling the cpu pipeline and
counting stalls. It also activates several heuristics to drive
scheduling based on the model. Scheduling is inherently imprecise at
this stage, and until spilling is improved it may defeat attempts to
schedule. However, this framework provides greater control over
tuning codegen.

Although the flag is not target-specific, it should have very little
affect on the default scheduler used by x86. The only two changes that
affect x86 are:
- scheduling a high-latency operation bumps the current cycle so independent
  operations can have their latency covered. i.e. two independent 4
  cycle operations can produce results in 4 cycles, not 8 cycles.
- Two operations with equal register pressure impact and no
  latency-based stalls on their uses will be prioritized by depth before height
  (height is irrelevant if no stalls occur in the schedule below this point).

llvm-svn: 123971

bd428ec5

Convert -enable-sched-cycles and -enable-sched-hazard to -disable · 47ff14b0

Andrew Trick authored Jan 21, 2011

flags. They are still not enable in this revision.

Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with
the scheduler's model of operand latency in the selection DAG.

Generalized unit tests to work with sched-cycles.

llvm-svn: 123969

47ff14b0

fix PR9013, an infinite loop in instcombine. · b5e15d19
Chris Lattner authored Jan 21, 2011
```
llvm-svn: 123968
```
b5e15d19
update obsolete comment. · f4ca47bd
Chris Lattner authored Jan 21, 2011
```
llvm-svn: 123965
```
f4ca47bd

Don't try to pull vector bitcasts that change the number of elements through · 6a083cf8

Nick Lewycky authored Jan 21, 2011

a select. A vector select is pairwise on each element so we'd need a new
condition with the right number of elements to select on. Fixes PR8994.

llvm-svn: 123963

6a083cf8

Object: Fix type punned pointer issues by making DataRefImpl a union and using intptr_t. · 0324b672
Michael J. Spencer authored Jan 21, 2011
```
llvm-svn: 123962
```
0324b672

Add a constant folding of casts from zero to zero. Fixes PR9011! · 39b12c05

Nick Lewycky authored Jan 21, 2011

While here, I'd like to complain about how vector is not an aggregate type
according to llvm::Type::isAggregateType(), but they're listed under aggregate
types in the LangRef and zero vectors are stored as ConstantAggregateZero.

llvm-svn: 123956

39b12c05

Don't be overly aggressive with CSE of "ldr constantpool". If it's a pc-relative · 028ccbfc

Evan Cheng authored Jan 20, 2011

value, the "add pc" must be CSE'ed at the same time. We could follow the same
approach as T2 by adding pseudo instructions that combine the ldr + "add pc".
But the better approach is to use movw + movt (which I will enable soon), so
I'll leave this as a TODO.

llvm-svn: 123949

028ccbfc

Jan 20, 2011

Implement requiredTransitive · f07426b4

Tobias Grosser authored Jan 20, 2011

The PassManager did not implement the transitivity of requiredTransitive. This
was unnoticed since 2006.

llvm-svn: 123942

f07426b4

Fix the encoding and parsing of clrex instruction · e965f06f
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123936
```
e965f06f
Change instruction names for consistency · ef8cab90
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123930
```
ef8cab90
Add cdp/cdp2 instructions for thumb/thumb2 · d8f9b37f
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123929
```
d8f9b37f

- Use a more appropriate name for Owen's ARM Parser isMCR hack since the same... · 33461ecc

Bruno Cardoso Lopes authored Jan 20, 2011

- Use a more appropriate name for Owen's ARM Parser isMCR hack since the same operands can be present
 in cdp/cdp2 instructions. Also increase the hack with cdp/cdp2 instructions.
- Fix the encoding of cdp/cdp2 instructions for ARM (no thumb and thumb2 yet) and add testcases for t
hem.

llvm-svn: 123927

33461ecc

SplitKit requires that all defs are in place before calling useIntv(). · 8a46e26b

Jakob Stoklund Olesen authored Jan 20, 2011

The value mapping gets confused about which original values have multiple new
definitions so they may need phi insertions.

This could probably be simplified by letting enterIntvBefore() take a live range
to be added following the instruction. As long as the range stays inside the
same basic block, value mapping shouldn't be a problem.

llvm-svn: 123926

8a46e26b

Add LiveIntervalMap::dumpCache() to print out the cache used by the ssa update algorithm. · 04e6b3bd
Jakob Stoklund Olesen authored Jan 20, 2011
```
llvm-svn: 123925
```
04e6b3bd
Add mcr*2 and mr*c2 support to thumb2 targets · 4d4b490f
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123919
```
4d4b490f
Add mcr* and mr*c support to thumb targets · cf99dc7e
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123917
```
cf99dc7e
Allow sign-extending of i8 and i16 to i128 on SPU. · 6e5a54b3
Kalle Raiskila authored Jan 20, 2011
```
llvm-svn: 123912
```
6e5a54b3

At -O123 the early-cse pass is run before instcombine has run. According to my · 8fb2c382

Duncan Sands authored Jan 20, 2011

auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0.
This patch adds this transform and some related logic to InstructionSimplify
and removes some of the logic from instcombine (unfortunately not all because
there are several situations in which instcombine can improve things by making
new instructions, whereas instsimplify is not allowed to do this). At -O2 this
often results in more than 15% more simplifications by early-cse, and results in
hundreds of lines of bitcode being eliminated from the testsuite. I did see some
small negative effects in the testsuite, for example a few additional instructions
in three programs. One program, 483.xalancbmk, got an additional 35 instructions,
which seems to be due to a function getting an additional instruction and then
being inlined all over the place.

llvm-svn: 123911

8fb2c382

Refactor mcr* and mr*c instructions into classes with the same encoding. No functionality change. · 32f9b756
Bruno Cardoso Lopes authored Jan 20, 2011
```
llvm-svn: 123910
```
32f9b756
My editor's indent went crazy. Fix. · 37c4a8be
Eric Christopher authored Jan 20, 2011
```
llvm-svn: 123909
```
37c4a8be

Expand invalid return values for umulo and smulo. Handle these similarly · 785db078

Eric Christopher authored Jan 20, 2011

to add/sub by doing the normal operation and then checking for overflow
afterwards. This generally relies on the DAG handling the later invalid
operations as well.

Fixes the 64-bit part of rdar://8622122 and rdar://8774702.

llvm-svn: 123908

785db078

Correct itinerary entry for t2MOV_pic_ga_add_pc. · 7af85533
Evan Cheng authored Jan 20, 2011
```
llvm-svn: 123907
```
7af85533

Sorry, several patches in one. · b8b0ad80

Evan Cheng authored Jan 20, 2011

TargetInstrInfo:
Change produceSameValue() to take MachineRegisterInfo as an optional argument.
When in SSA form, targets can use it to make more aggressive equality analysis.

Machine LICM:
1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead.
2. Fix a bug which prevent CSE of instructions which are not re-materializable.
3. Use improved form of produceSameValue.

ARM:
1. Teach ARM produceSameValue to look pass some PIC labels.
2. Look for operands from different loads of different constant pool entries
   which have same values.
3. Re-implement PIC GA materialization using movw + movt. Combine the pair with
   a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible
   to re-materialize the instruction, allow machine LICM to hoist the set of
   instructions out of the loop and make it possible to CSE them. It's a bit
   hacky, but it significantly improve code quality.
4. Some minor bug fixes as well.

With the fixes, using movw + movt to materialize GAs significantly outperform the
load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap
and 176.gcc ~10%.

llvm-svn: 123905

b8b0ad80

Object: Add ELF support. · b60a18de
Michael J. Spencer authored Jan 20, 2011
```
llvm-svn: 123896
```
b60a18de
Object: Add COFF Support. · 8e90adaf
Michael J. Spencer authored Jan 20, 2011
```
llvm-svn: 123895
```
8e90adaf

Selection DAG scheduler register pressure heuristic fixes. · 2cd1f0be

Andrew Trick authored Jan 20, 2011

Added a check for already live regs before claiming HighRegPressure.
Fixed a few cases of checking the wrong number of successors.
Added some tracing until these heuristics are better understood.

llvm-svn: 123892

2cd1f0be

Check that a live range exists before shortening it. This fixes PR8989. · 4060abb4
Jakob Stoklund Olesen authored Jan 20, 2011
```
The live range may have been deleted earlier because of rematerialization.

llvm-svn: 123891
```
4060abb4
Add hidden -verify-coalescing to run the machine code verifier before and after · 145755f1
Jakob Stoklund Olesen authored Jan 20, 2011
```
register coalescing.

llvm-svn: 123890
```
145755f1
Sparc backend: Implements a delay slot filler that attempt to fill delay slots · 058e1247
Venkatraman Govindaraju authored Jan 20, 2011
```
with useful instructions.

llvm-svn: 123884
```
058e1247
Update a comment. · 050eec1d
Cameron Zwarich authored Jan 20, 2011
```
llvm-svn: 123879
```
050eec1d
Fix bug found by new clang warning. · 5acd4a64
Jakob Stoklund Olesen authored Jan 20, 2011
```
llvm-svn: 123872
```
5acd4a64