Commits · f60a9279ea0497bd371e23501bcedb7c4cd02330 · Roger Ferrer / llvm-epi-0.8

Dec 12, 2012

Initial implementation of a utility for converting native data · f60a9279

Nick Kledzik authored Dec 12, 2012

structures to and from YAML using traits.  The first client will
be the test suite of lld.  The documentation will show up at:

   http://llvm.org/docs/YamlIO.html

llvm-svn: 170019

f60a9279

Fix a logic bug in inline expansion of memcpy / memset with an overlapping · b7d3d03b

Evan Cheng authored Dec 12, 2012

load / store pair. It's not legal to use a wider load than the size of
the remaining bytes if it's the first pair of load / store.

llvm-svn: 170018

b7d3d03b

[mips] Fix a memory leak bug report by NAKAMURA Takumi. · 7bc144c3
Akira Hatanaka authored Dec 12, 2012
```
llvm-svn: 170012
```
7bc144c3
Make naming consistent, add comments and sanity asserts · e11ab3aa
Eli Bendersky authored Dec 12, 2012
```
llvm-svn: 170007
```
e11ab3aa
Fix indentation. · 6027bdf8
Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 170005
```
6027bdf8
LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to... · d0bb22bb
Nadav Rotem authored Dec 12, 2012
```
LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to increase the function size.

llvm-svn: 170004
```
d0bb22bb

This patch implements local-dynamic TLS model support for the 64-bit · 24b8dd6e

Bill Schmidt authored Dec 12, 2012

PowerPC target.  This is the last of the four models, so we now have 
full TLS support.

This is mostly a straightforward extension of the general dynamic model.
I had to use an additional Chain operand to tie ADDIS_DTPREL_HA to the
register copy following ADDI_TLSLD_L; otherwise everything above the
ADDIS_DTPREL_HA appeared dead and was removed.

As before, there are new test cases to test the assembly generation, and
the relocations output during integrated assembly.  The expected code
gen sequence can be read in test/CodeGen/PowerPC/tls-ld.ll.

There are a couple of things I think can be done more efficiently in the
overall TLS code, so there will likely be a clean-up patch forthcoming;
but for now I want to be sure the functionality is in place.

Bill

llvm-svn: 170003

24b8dd6e

Kerning. · 623d8ceb
Bill Wendling authored Dec 12, 2012
```
llvm-svn: 170002
```
623d8ceb

The TargetData is not used for the isPowerOfTwo determination. It has never · e4023806

Rafael Espindola authored Dec 12, 2012

been used in the first place.  It simply was passed to the function and to the
recursive invocations.  Simply drop the parameter and update the callers for the
new signature.

Patch by Saleem Abdulrasool!

llvm-svn: 169988

e4023806

Improve debug info generated with enabled AddressSanitizer. · 3d43b63a

Alexey Samsonov authored Dec 12, 2012

When ASan replaces <alloca instruction> with
<offset into a common large alloca>, it should also patch
llvm.dbg.declare calls and replace debug info descriptors to mark
that we've replaced alloca with a value that stores an address
of the user variable, not the user variable itself.

See PR11818 for more context.

llvm-svn: 169984

3d43b63a

Add ARM NONE and PREL31 relocation types. · 4dd14fb5

Logan Chien authored Dec 12, 2012

Add R_ARM_NONE and R_ARM_PREL31 relocation types
to MCExpr.  Both of them will be used while
generating .ARM.extab and .ARM.exidx sections.

llvm-svn: 169965

4dd14fb5

Remove some dead code. · 07cc8487
Rafael Espindola authored Dec 12, 2012
```
llvm-svn: 169963
```
07cc8487
[CMake] Fixup R600. · 85292a13
NAKAMURA Takumi authored Dec 12, 2012
```
llvm-svn: 169962
```
85292a13

Sorry about the churn. One more change to getOptimalMemOpType() hook. Did I · 962711ee

Evan Cheng authored Dec 12, 2012

mention the inline memcpy / memset expansion code is a mess?

This patch split the ZeroOrLdSrc argument into two: IsMemset and ZeroMemset.
The first indicates whether it is expanding a memset or a memcpy / memmove.
The later is whether the memset is a memset of zero. It's totally possible
(likely even) that targets may want to do different things for memcpy and
memset of zero.

llvm-svn: 169959

962711ee

Fix the ascii drawing that was ruined when I split the H and CPP · 6798a04b
Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 169955
```
6798a04b

- Rename isLegalMemOpType to isSafeMemOpType. "Legal" is a very overloade term. · c3d1aca6

Evan Cheng authored Dec 12, 2012

Also added more comments to explain why it is generally ok to return true.
- Rename getOptimalMemOpType argument IsZeroVal to ZeroOrLdSrc. It's meant to
be true for loaded source (memcpy) or zero constants (memset). The poor name
choice is probably some kind of legacy issue.

llvm-svn: 169954

c3d1aca6

fix a typo. · 4fa2e3d5
Nadav Rotem authored Dec 12, 2012
```
llvm-svn: 169953
```
4fa2e3d5
DAGCombine: clamp hi bit in APInt::getBitsSet to avoid assertion · 82751a10
Manman Ren authored Dec 12, 2012
```
rdar://12838504

llvm-svn: 169951
```
82751a10

LoopVectorizer: When -Os is used, vectorize only loops that dont require a... · aeb17df8

Nadav Rotem authored Dec 12, 2012

LoopVectorizer: When -Os is used, vectorize only loops that dont require a tail loop. There is no testcase because I dont know of a way to initialize the loop vectorizer pass without adding an additional hidden flag. 

llvm-svn: 169950

aeb17df8

Avoid using lossy load / stores for memcpy / memset expansion. e.g. · 04e55187
Evan Cheng authored Dec 12, 2012
```
f64 load / store on non-SSE2 x86 targets.

llvm-svn: 169944
```
04e55187
Have SimplifyBinOp call the new FAdd/FSub/FMul helpers, with fast-math flags off · d2b05e59
Michael Ilseman authored Dec 12, 2012
```
llvm-svn: 169943
```
d2b05e59
- Fix a problematic way in creating all-the-1 APInt. · 81b36785
Shuxin Yang authored Dec 12, 2012
```
- Propagate "exact" bit of [l|a]shr instruction.

llvm-svn: 169942
```
81b36785
Remove redunant optimizations from InstCombine, instead call the appropriate... · d5787be5
Michael Ilseman authored Dec 12, 2012
```
Remove redunant optimizations from InstCombine, instead call the appropriate functions from SimplifyInstruction

llvm-svn: 169941
```
d5787be5

Added a slew of SimplifyInstruction floating-point optimizations, many of... · bb6f691b

Michael Ilseman authored Dec 12, 2012

Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included.

  fsub X, +0 ==> X
  fsub X, -0 ==> X, when we know X is not -0
  fsub +/-0.0, (fsub -0.0, X) ==> X
  fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X
  fsub nnan ninf X, X ==> 0.0
  fadd nsz X, 0 ==> X
  fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0
    where nnan and ninf have to occur at least once somewhere in this expression
  fmul X, 1.0 ==> X

llvm-svn: 169940

bb6f691b

Trim unneeded header #include. · 647c7027
Jim Grosbach authored Dec 11, 2012
```
llvm-svn: 169933
```
647c7027

ARM: Remove old testing option. · 0ddedcc5

Jim Grosbach authored Dec 11, 2012

Pre-regalloc frame allocation and referencing has been on by default
for ages. No need for the testing option that disables it.

llvm-svn: 169931

0ddedcc5

ARM: Remove old testing options. · 1197889c
Jim Grosbach authored Dec 11, 2012
```
Base pointer referencing has been enabled for ages.

llvm-svn: 169930
```
1197889c

Replace TargetLowering::isIntImmLegal() with · eb54240d

Evan Cheng authored Dec 11, 2012

ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined
term for something like integer immediate materialization. It is always possible
to materialize an integer immediate. Whether to use it for memcpy expansion is
more a "cost" conceern.

llvm-svn: 169929

eb54240d

Dec 11, 2012

PR14574. Fix a bug in the code that calculates the mask the converted PHIs in if-conversion. · f707bf4c
Nadav Rotem authored Dec 11, 2012
```
llvm-svn: 169916
```
f707bf4c

Add R600 backend · 75aadc28

Tom Stellard authored Dec 11, 2012

A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX

llvm-svn: 169915

75aadc28

This patch implements the general dynamic TLS model for 64-bit PowerPC. · c56f1d34

Bill Schmidt authored Dec 11, 2012

Given a thread-local symbol x with global-dynamic access, the generated
code to obtain x's address is:

     Instruction                            Relocation            Symbol
  addis ra,r2,x@got@tlsgd@ha           R_PPC64_GOT_TLSGD16_HA       x
  addi  r3,ra,x@got@tlsgd@l            R_PPC64_GOT_TLSGD16_L        x
  bl __tls_get_addr(x@tlsgd)           R_PPC64_TLSGD                x
                                       R_PPC64_REL24           __tls_get_addr
  nop
  <use address in r3>

The implementation borrows from the medium code model work for introducing
special forms of ADDIS and ADDI into the DAG representation.  This is made
slightly more complicated by having to introduce a call to the external
function __tls_get_addr.  Using the full call machinery is overkill and,
more importantly, makes it difficult to add a special relocation.  So I've
introduced another opcode GET_TLS_ADDR to represent the function call, and
surrounded it with register copies to set up the parameter and return value.

Most of the code is pretty straightforward.  I ran into one peculiarity
when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like
BL8_NOP_ELF except that it takes another parameter to represent the symbol
("x" above) that requires a relocation on the call.  Something in the 
TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated
identically during the emit phase, so this second operand was never
visited to generate relocations.  This is the reason for the slightly
messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding().

Two new tests are included to demonstrate correct external assembly and
correct generation of relocations using the integrated assembler.

Comments welcome!

Thanks,
Bill

llvm-svn: 169910

c56f1d34

Update some comments. · d692c1db
Eric Christopher authored Dec 11, 2012
```
llvm-svn: 169907
```
d692c1db

Loop Vectorize: optimize the vectorization of trunc(induction_var). The... · e266efb7

Nadav Rotem authored Dec 11, 2012

Loop Vectorize: optimize the vectorization of trunc(induction_var). The truncation is now done on scalars.

llvm-svn: 169904

e266efb7

Remove the RelaxAll overrule in MCAssembler::fixupNeedsRelaxation, · 0f74f173

Eli Bendersky authored Dec 11, 2012

because that method is only getting called for MCInstFragment. These
fragments aren't even generated when RelaxAll is set, which is why the
flag reference here is superfluous. Removing it simplifies the code
with no harmful effects.

An assertion is added higher up to make sure this path is never
reached.

llvm-svn: 169886

0f74f173

Use an ArrayRef instead of a std::vector&. · a92da5b3
Rafael Espindola authored Dec 11, 2012
```
llvm-svn: 169881
```
a92da5b3
Add comment for load folding · 24e440d0
Joel Jones authored Dec 11, 2012
```
llvm-svn: 169880
```
24e440d0

[msan] Use explicitely aligned stores and loads with function argument shadow. · d2bd319a

Evgeniy Stepanov authored Dec 11, 2012

Use explicitely aligned store and load instructions to deal with argument and
retval shadow. This matters when an argument's alignment is higher than
__msan_param_tls alignment (which is the case with __m128i).

llvm-svn: 169859

d2bd319a

Revert EVT->MVT changes, r169836-169851, due to buildbot failures. · e98b7a03
Patrik Hagglund authored Dec 11, 2012
```
llvm-svn: 169854
```
e98b7a03

Holding my nose and moving the accumulation routine to GEPOperator · 7ec41c78

Chandler Carruth authored Dec 11, 2012

instead of the instruction. I've left a forwarding wrapper for the
instruction so users with the instruction don't need to create
a GEPOperator themselves.

This lets us remove the copy of this code in instsimplify.

I've looked at most of the other copies of similar code, and this is the
only one I've found that is actually exactly the same. The one in
InlineCost is very close, but it requires re-mapping non-constant
indices through the cost analysis value simplification map. I could add
direct support for this to the generic routine, but it seems overly
specific.

llvm-svn: 169853

7ec41c78

Hoist the GEP constant address offset computation to a common home on · 1e14053d

Chandler Carruth authored Dec 11, 2012

the GEP instruction class.

This is part of the continued refactoring and cleaning of the
infrastructure used by SROA. This particular operation is also done in
a few other places which I'll try to refactor to share this
implementation.

llvm-svn: 169852

1e14053d