Commits · e473aaf540710863d85033805d5ab3b2c11960d4 · Roger Ferrer / llvm-epi-0.8

Apr 20, 2011
- Remove unused arguments. · e473aaf5
  Rafael Espindola authored Apr 20, 2011
```
llvm-svn: 129844
```
  e473aaf5
- ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS · cd01ed5b
  Daniel Dunbar authored Apr 20, 2011
```
triple component.

llvm-svn: 129838
```
  cd01ed5b
- Fix typo in the comment. · dc62e597
  Johnny Chen authored Apr 19, 2011
```
llvm-svn: 129837
```
  dc62e597
Apr 19, 2011

ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() · 2b9b0e37
Daniel Dunbar authored Apr 19, 2011
```
predicates.

llvm-svn: 129816
```
2b9b0e37
Target/X86: Eliminate uses of getDarwinVers(). · 100455a3
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129813
```
100455a3
Target/X86: Add getTargetTriple() accessor. · 44b53036
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129812
```
44b53036
Target/PPC: Kill off DarwinVers, which is now dead. · e3de896b
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129811
```
e3de896b
Target/PPC: Eliminate a use of getDarwinVers(). · f954a0f0
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129810
```
f954a0f0
Target/PPC: Add a TargetTriple field. · a37aab25
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129809
```
a37aab25
Target: Eliminate a use of getDarwinMajorNumber(). · 9483bb6b
Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129803
```
9483bb6b
Remove some duplicate op action entries and reorganize. · c721b0db
Eric Christopher authored Apr 19, 2011
```
llvm-svn: 129781
```
c721b0db

This patch combines several changes from Evan Cheng for rdar://8659675 . · 0858c3aa

Bob Wilson authored Apr 19, 2011

Making use of VFP / NEON floating point multiply-accumulate / subtraction is
difficult on current ARM implementations for a few reasons.
1. Even though a single vmla has latency that is one cycle shorter than a pair
   of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause
   additional pipeline stall. So it's frequently better to single codegen
   vmul + vadd.
2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to
   stall for 4 cycles. We need to schedule them apart.
3. A vmla followed vmla is a special case. Obvious issuing back to back RAW
   vmla + vmla is very bad. But this isn't ideal either:
     vmul
     vadd
     vmla
   Instead, we want to expand the second vmla:
     vmla
     vmul
     vadd
   Even with the 4 cycle vmul stall, the second sequence is still 2 cycles
   faster.

Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough
but it isn't the optimial solution. This patch attempts to make it possible to
use vmla / vmls in cases where it is profitable.

A. Add missing isel predicates which cause vmla to be codegen'ed.
B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to
   compute a fmul and a fmla.
C. Add additional isel checks for vmla, avoid cases where vmla is feeding into
   fp instructions (except for the #3 exceptional case).
D. Add ARM hazard recognizer to model the vmla / vmls hazards.
E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the
   vmla / vmls will trigger one of the special hazards.

Enable these fp vmlx codegen changes for Cortex-A9.

llvm-svn: 129775

0858c3aa

Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. · d04a83f8
Bob Wilson authored Apr 19, 2011
```
llvm-svn: 129774
```
d04a83f8

Avoid some 's' 16-bit instruction which partially update CPSR · a2881ee8

Bob Wilson authored Apr 19, 2011

(and add false dependency) when it isn't dependent on last CPSR defining
instruction. rdar://8928208

llvm-svn: 129773

a2881ee8

Avoid write-after-write issue hazards for Cortex-A9. · df612ba0

Bob Wilson authored Apr 19, 2011

Add a avoidWriteAfterWrite() target hook to identify register classes that
suffer from write-after-write hazards. For those register classes, try to avoid
writing the same register in two consecutive instructions.

This is currently disabled by default. We should not spill to avoid hazards!
The command line flag -avoid-waw-hazard can be used to enable waw avoidance.

llvm-svn: 129772

df612ba0

Some single-precision VFP instructions can execute in either the VPF or Neon · 3e5944d9
Bob Wilson authored Apr 19, 2011
```
pipelines, at least on Cortex-A9.

llvm-svn: 129771
```
3e5944d9
Improvements for the Cortex-A9 scheduling itineraries. · f33715e5
Bob Wilson authored Apr 19, 2011
```
llvm-svn: 129770
```
f33715e5
Add support for FastISel'ing varargs calls. · ee92a6b3
Eli Friedman authored Apr 19, 2011
```
llvm-svn: 129765
```
ee92a6b3

Implement support for x86 fastisel of small fixed-sized memcpys, which are generated · 91328b31

Chris Lattner authored Apr 19, 2011

en-mass for C++ PODs.  On my c++ test file, this cuts the fast isel rejects by 10x 
and shrinks the generated .s file by 5%

llvm-svn: 129755

91328b31

tidy up · 34a08c23
Chris Lattner authored Apr 19, 2011
```
llvm-svn: 129753
```
34a08c23

Implement support for fast isel of calls of i1 arguments, even though they are illegal, · 5f4b7834

Chris Lattner authored Apr 19, 2011

when they are a truncate from something else. This eliminates fully half of all the
fastisel rejections on a test c++ file I'm working with, which should make a substantial
improvement for -O0 compile of c++ code.

This fixed rdar://9297003 - fast isel bails out on all functions taking bools

llvm-svn: 129752

5f4b7834

Handle i1/i8/i16 constant integer arguments to calls by prepromoting them. · d7f7c939

Chris Lattner authored Apr 19, 2011

Before we would bail out on i1 arguments all together, now we just bail on
non-constant ones.  Also, we used to emit extraneous code.  e.g. test12 was:

	movb	$0, %al
	movzbl	%al, %edi
	callq	_test12

and test13 was:
	movb	$0, %al
	xorl	%edi, %edi
	movb	%al, 7(%rsp)
	callq	_test13f

Now we get:

	movl	$0, %edi
	callq	_test12
and:
	movl	$0, %edi
	callq	_test13f

llvm-svn: 129751

d7f7c939

be layout aware, to produce: · c59290a3

Chris Lattner authored Apr 19, 2011

	testb	$1, %al
	je	LBB0_2
## BB#1:                                ## %if.then
	movb	$0, %al

instead of:

	testb	$1, %al
	jne	LBB0_1
	jmp	LBB0_2
LBB0_1:                                 ## %if.then
	movb	$0, %al

how 'bout that.

llvm-svn: 129749

c59290a3

fix rdar://9297006 - fast isel bails out on trunc to i1 -> bools cry, · 2c8a4c3b
Chris Lattner authored Apr 19, 2011
```
a common cause of fast isel rejects on c++ code.

llvm-svn: 129748
```
2c8a4c3b

Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That · 7d6cd490

Evan Cheng authored Apr 19, 2011

is, it assumes addresses are 64-bit aligned (which should be the more common
case). If the alignment is found not to be aligned, then getOperandLatency()
would adjust the operand latency computation by one to compensate for it.
rdar://9294833

llvm-svn: 129742

7d6cd490

Do not lose mem_operands while lowering VLD / VST intrinsics. · 40791337
Evan Cheng authored Apr 19, 2011
```
llvm-svn: 129738
```
40791337

Apr 18, 2011

Trim a few unneeded includes. · ddac5dd2
Jim Grosbach authored Apr 18, 2011
```
llvm-svn: 129723
```
ddac5dd2
Invert the meaning of printAliasInstr's return value. It now returns · 2e3fbaab
Eric Christopher authored Apr 18, 2011
```
true on success and false on failure. Update callers.

llvm-svn: 129722
```
2e3fbaab
Small fix to the ARM AsmParser to ensure that a · 5d73033e
Sean Callanan authored Apr 18, 2011
```
superclass variable is instantiated properly.

llvm-svn: 129713
```
5d73033e

Add a new bit that ImmLeaf's can opt into, which allows them to duck out of · 80254a53

Chris Lattner authored Apr 18, 2011

the generated FastISel.  X86 doesn't need to generate code to match ADD16ri8 
since ADD16ri will do just fine.  This is a small codesize win in the generated
instruction selector.

llvm-svn: 129692

80254a53

switch the rest of the x86 immediate patterns over to ImmLeaf, · c479e063

Chris Lattner authored Apr 17, 2011

simplifying them and exposing more information to tblgen.  It would be nice
if other target authors adopted this as well, particularly arm since it has fastisel.

llvm-svn: 129676

c479e063

now that predicates have a decent abstraction layer on them, introduce a new · 2ff8c1a2

Chris Lattner authored Apr 17, 2011

kind of predicate: one that is specific to imm nodes.  The predicate function
specified here just checks an int64_t directly instead of messing around with
SDNode's.  The virtue of this is that it means that fastisel and other things
can reason about these predicates.

llvm-svn: 129675

2ff8c1a2

Apr 17, 2011

Rework our internal representation of node predicates to expose more · 514e292b

Chris Lattner authored Apr 17, 2011

structure and fix some fixmes.  We now have a TreePredicateFn class
that handles all of the decoding of these things.  This is an internal
cleanup that has no impact on the code generated by tblgen.

llvm-svn: 129670

514e292b

1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll · b53ccb8e

Chris Lattner authored Apr 17, 2011

2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts
3. teach tblgen to handle shift immediates that are different sizes than the
shifted operands, eliminating some code from the X86 fast isel backend.
4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function
instead of FastEmit_ri to simplify code.

llvm-svn: 129666

b53ccb8e

fix an x86 fast isel issue where we'd completely give up on folding an address · eb729d48

Chris Lattner authored Apr 17, 2011

when we have a global variable base an an index.  Instead, just give up on
folding the global variable.

Before we'd geenrate:

_test:                                  ## @test
## BB#0:
	movq	_rtx_length@GOTPCREL(%rip), %rax
	leaq	(%rax), %rax
	addq	%rdi, %rax
	movzbl	(%rax), %eax
	ret

now we generate:

_test:                                  ## @test
## BB#0:
	movq	_rtx_length@GOTPCREL(%rip), %rax
	movzbl	(%rax,%rdi), %eax
	ret

The difference is even more significant when there is a scale
involved.

This fixes rdar://9289558 - total fail with addr mode formation at -O0/x86-64

llvm-svn: 129664

eb729d48

fix an oversight which caused us to compile the testcase (and other · 4832660b

Chris Lattner authored Apr 17, 2011

less trivial things) into a dummy lea.  Before we generated:

_test:                                  ## @test
	movq	_G@GOTPCREL(%rip), %rax
	leaq	(%rax), %rax
	ret

now we produce:

_test:                                  ## @test
	movq	_G@GOTPCREL(%rip), %rax
	ret

This is part of rdar://9289558

llvm-svn: 129662

4832660b

tidy up and reduce indentation. · 4b026b96
Chris Lattner authored Apr 17, 2011
```
llvm-svn: 129661
```
4b026b96
Remove working entry from README. · 55f7bf32
Eli Friedman authored Apr 17, 2011
```
llvm-svn: 129654
```
55f7bf32

Apr 16, 2011
- MSVC needs the return 0 to compile. · 47f86e6c
  Francois Pichet authored Apr 16, 2011
```
llvm-svn: 129640
```
  47f86e6c
- Put each personality function in a section. This fixes the gnu ld warning: · a83b1770
  Rafael Espindola authored Apr 16, 2011
```
  error in foo.o; no .eh_frame_hdr table will be created.

llvm-svn: 129635
```
  a83b1770