Commits · aeb17df8027debc3e0f6ca139d493cec56c2b8d6 · Roger Ferrer / llvm-epi-0.8

Dec 12, 2012

LoopVectorizer: When -Os is used, vectorize only loops that dont require a... · aeb17df8

Nadav Rotem authored Dec 12, 2012

LoopVectorizer: When -Os is used, vectorize only loops that dont require a tail loop. There is no testcase because I dont know of a way to initialize the loop vectorizer pass without adding an additional hidden flag. 

llvm-svn: 169950

aeb17df8

Avoid using lossy load / stores for memcpy / memset expansion. e.g. · 04e55187
Evan Cheng authored Dec 12, 2012
```
f64 load / store on non-SSE2 x86 targets.

llvm-svn: 169944
```
04e55187
Have SimplifyBinOp call the new FAdd/FSub/FMul helpers, with fast-math flags off · d2b05e59
Michael Ilseman authored Dec 12, 2012
```
llvm-svn: 169943
```
d2b05e59
- Fix a problematic way in creating all-the-1 APInt. · 81b36785
Shuxin Yang authored Dec 12, 2012
```
- Propagate "exact" bit of [l|a]shr instruction.

llvm-svn: 169942
```
81b36785
Remove redunant optimizations from InstCombine, instead call the appropriate... · d5787be5
Michael Ilseman authored Dec 12, 2012
```
Remove redunant optimizations from InstCombine, instead call the appropriate functions from SimplifyInstruction

llvm-svn: 169941
```
d5787be5

Added a slew of SimplifyInstruction floating-point optimizations, many of... · bb6f691b

Michael Ilseman authored Dec 12, 2012

Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included.

  fsub X, +0 ==> X
  fsub X, -0 ==> X, when we know X is not -0
  fsub +/-0.0, (fsub -0.0, X) ==> X
  fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X
  fsub nnan ninf X, X ==> 0.0
  fadd nsz X, 0 ==> X
  fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0
    where nnan and ninf have to occur at least once somewhere in this expression
  fmul X, 1.0 ==> X

llvm-svn: 169940

bb6f691b

Trim unneeded header #include. · 647c7027
Jim Grosbach authored Dec 11, 2012
```
llvm-svn: 169933
```
647c7027

ARM: Remove old testing option. · 0ddedcc5

Jim Grosbach authored Dec 11, 2012

Pre-regalloc frame allocation and referencing has been on by default
for ages. No need for the testing option that disables it.

llvm-svn: 169931

0ddedcc5

ARM: Remove old testing options. · 1197889c
Jim Grosbach authored Dec 11, 2012
```
Base pointer referencing has been enabled for ages.

llvm-svn: 169930
```
1197889c

Replace TargetLowering::isIntImmLegal() with · eb54240d

Evan Cheng authored Dec 11, 2012

ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined
term for something like integer immediate materialization. It is always possible
to materialize an integer immediate. Whether to use it for memcpy expansion is
more a "cost" conceern.

llvm-svn: 169929

eb54240d

Dec 11, 2012

PR14574. Fix a bug in the code that calculates the mask the converted PHIs in if-conversion. · f707bf4c
Nadav Rotem authored Dec 11, 2012
```
llvm-svn: 169916
```
f707bf4c

Add R600 backend · 75aadc28

Tom Stellard authored Dec 11, 2012

A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX

llvm-svn: 169915

75aadc28

This patch implements the general dynamic TLS model for 64-bit PowerPC. · c56f1d34

Bill Schmidt authored Dec 11, 2012

Given a thread-local symbol x with global-dynamic access, the generated
code to obtain x's address is:

     Instruction                            Relocation            Symbol
  addis ra,r2,x@got@tlsgd@ha           R_PPC64_GOT_TLSGD16_HA       x
  addi  r3,ra,x@got@tlsgd@l            R_PPC64_GOT_TLSGD16_L        x
  bl __tls_get_addr(x@tlsgd)           R_PPC64_TLSGD                x
                                       R_PPC64_REL24           __tls_get_addr
  nop
  <use address in r3>

The implementation borrows from the medium code model work for introducing
special forms of ADDIS and ADDI into the DAG representation.  This is made
slightly more complicated by having to introduce a call to the external
function __tls_get_addr.  Using the full call machinery is overkill and,
more importantly, makes it difficult to add a special relocation.  So I've
introduced another opcode GET_TLS_ADDR to represent the function call, and
surrounded it with register copies to set up the parameter and return value.

Most of the code is pretty straightforward.  I ran into one peculiarity
when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like
BL8_NOP_ELF except that it takes another parameter to represent the symbol
("x" above) that requires a relocation on the call.  Something in the 
TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated
identically during the emit phase, so this second operand was never
visited to generate relocations.  This is the reason for the slightly
messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding().

Two new tests are included to demonstrate correct external assembly and
correct generation of relocations using the integrated assembler.

Comments welcome!

Thanks,
Bill

llvm-svn: 169910

c56f1d34

Update some comments. · d692c1db
Eric Christopher authored Dec 11, 2012
```
llvm-svn: 169907
```
d692c1db

Loop Vectorize: optimize the vectorization of trunc(induction_var). The... · e266efb7

Nadav Rotem authored Dec 11, 2012

Loop Vectorize: optimize the vectorization of trunc(induction_var). The truncation is now done on scalars.

llvm-svn: 169904

e266efb7

Remove the RelaxAll overrule in MCAssembler::fixupNeedsRelaxation, · 0f74f173

Eli Bendersky authored Dec 11, 2012

because that method is only getting called for MCInstFragment. These
fragments aren't even generated when RelaxAll is set, which is why the
flag reference here is superfluous. Removing it simplifies the code
with no harmful effects.

An assertion is added higher up to make sure this path is never
reached.

llvm-svn: 169886

0f74f173

Use an ArrayRef instead of a std::vector&. · a92da5b3
Rafael Espindola authored Dec 11, 2012
```
llvm-svn: 169881
```
a92da5b3
Add comment for load folding · 24e440d0
Joel Jones authored Dec 11, 2012
```
llvm-svn: 169880
```
24e440d0

[msan] Use explicitely aligned stores and loads with function argument shadow. · d2bd319a

Evgeniy Stepanov authored Dec 11, 2012

Use explicitely aligned store and load instructions to deal with argument and
retval shadow. This matters when an argument's alignment is higher than
__msan_param_tls alignment (which is the case with __m128i).

llvm-svn: 169859

d2bd319a

Revert EVT->MVT changes, r169836-169851, due to buildbot failures. · e98b7a03
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169854
```
e98b7a03

Holding my nose and moving the accumulation routine to GEPOperator · 7ec41c78

Chandler Carruth authored Dec 11, 2012

instead of the instruction. I've left a forwarding wrapper for the
instruction so users with the instruction don't need to create
a GEPOperator themselves.

This lets us remove the copy of this code in instsimplify.

I've looked at most of the other copies of similar code, and this is the
only one I've found that is actually exactly the same. The one in
InlineCost is very close, but it requires re-mapping non-constant
indices through the cost analysis value simplification map. I could add
direct support for this to the generic routine, but it seems overly
specific.

llvm-svn: 169853

7ec41c78

Hoist the GEP constant address offset computation to a common home on · 1e14053d

Chandler Carruth authored Dec 11, 2012

the GEP instruction class.

This is part of the continued refactoring and cleaning of the
infrastructure used by SROA. This particular operation is also done in
a few other places which I'll try to refactor to share this
implementation.

llvm-svn: 169852

1e14053d

Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, · b31465b0
Patrik Hagglund authored Dec 11, 2012
```
instead of EVTs.

llvm-svn: 169851
```
b31465b0
Change TargetLowering::getTypeForExtArgOrReturn to take and return · ad432a8e
Patrik Hagglund authored Dec 11, 2012
```
MVTs, instead of EVTs.

Accordingly, add bitsLT (and similar) to MVT.

llvm-svn: 169850
```
ad432a8e
Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, · d3433749
Patrik Hagglund authored Dec 11, 2012
```
from EVT.

llvm-svn: 169849
```
d3433749
Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of · 03e9628c
Patrik Hagglund authored Dec 11, 2012
```
EVTs.

llvm-svn: 169848
```
03e9628c
Change TargetLowering::TransformToType to contain MVTs, instead of · c50489e2
Patrik Hagglund authored Dec 11, 2012
```
EVTs.

llvm-svn: 169847
```
c50489e2
Change TargetLowering::findRepresentativeClass to take an MVT, instead · 8d2e7cf5
Patrik Hagglund authored Dec 11, 2012
```
of EVT.

llvm-svn: 169845
```
8d2e7cf5
Change TargetLowering::getTypeToPromoteTo to take and return MVTs, · ffb60f7c
Patrik Hagglund authored Dec 11, 2012
```
instead of EVTs.

llvm-svn: 169844
```
ffb60f7c
Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. · a9702811
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169843
```
a9702811
Change TargetLowering::getCondCodeAction to take an MVT, instead of · e3bec636
Patrik Hagglund authored Dec 11, 2012
```
EVT.

llvm-svn: 169842
```
e3bec636
Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. · 7ffcd226
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169841
```
7ffcd226
Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. · cbc9d4d0
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169840
```
cbc9d4d0
Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. · 40e1afe9
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169839
```
40e1afe9
Change TargetLowering::getRepRegClassFor to take an MVT, instead of · 57b1694d
Patrik Hagglund authored Dec 11, 2012
```
EVT.

Accordingly, change RegDefIter to contain MVTs instead of EVTs.

llvm-svn: 169838
```
57b1694d

Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. · 3708e548

Patrik Hagglund authored Dec 11, 2012

Accordingly, add helper funtions getSimpleValueType (in parallel to
getValueType) in SDValue, SDNode, and TargetLowering.

This is the first, in a series of patches.

llvm-svn: 169837

3708e548

[CMake] Remove dependencies to intrinsics_gen I introduced in r169724. · 99feb75c
NAKAMURA Takumi authored Dec 11, 2012
```
llvm-svn: 169819
```
99feb75c
Use multiclass for new-value store instructions with MEMri operand. · 92e71918
Jyotsna Verma authored Dec 11, 2012
```
llvm-svn: 169814
```
92e71918
Fix PR14565. Don't if-convert loops that have switch statements in them. · dbb33281
Nadav Rotem authored Dec 11, 2012
```
llvm-svn: 169813
```
dbb33281
Stylistic tweak. · c2bd620f
Evan Cheng authored Dec 11, 2012
```
llvm-svn: 169811
```
c2bd620f