Commits · ef2284fbad9ff0621bad1b6adeae4dc9708a673f · Roger Ferrer / llvm-epi-0.8

Aug 15, 2012

[PathV2] Add mapped_file_region. Implementation for Windows and POSIX. · ef2284fb
Michael J. Spencer authored Aug 15, 2012
```
llvm-svn: 161976
```
ef2284fb

Fix another roundToIntegral bug where very large values could become infinity.... · 352dfff4

Owen Anderson authored Aug 15, 2012

Fix another roundToIntegral bug where very large values could become infinity.  Problem and solution identified by Steve Canon.

llvm-svn: 161969

352dfff4

Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows... · eec6bc62

Evan Cheng authored Aug 15, 2012

Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029

llvm-svn: 161962

eec6bc62

Fix typo in comment. · be7e297b
Owen Anderson authored Aug 15, 2012
```
llvm-svn: 161956
```
be7e297b

Add missing Rfalse operand to the predicated pseudo-instructions. · 2ec0c41e

Jakob Stoklund Olesen authored Aug 15, 2012

When predicating this instruction:

  Rd = ADD Rn, Rm

We need an extra operand to represent the value given to Rd when the
predicate is false:

  Rd = ADDCC Rfalse, Rn, Rm, pred

The Rd and Rfalse operands are different registers while in SSA form.
Rfalse is tied to Rd to make sure they get the same register during
register allocation.

Previously, Rd and Rn were tied, but that is not required.

Compare to MOVCC:

  Rd = MOVCC Rfalse, Rtrue, pred

llvm-svn: 161955

2ec0c41e

Set the branch probability of branching to the 'normal' destination of an invoke · e1c54262

Bill Wendling authored Aug 15, 2012

instruction to something absurdly high, while setting the probability of
branching to the 'unwind' destination to the bare minimum. This should set cause
the normal destination's invoke blocks to be moved closer to the invoke.

PR13612

llvm-svn: 161944

e1c54262

[asan] implement --asan-always-slow-path, which is a part of the improvement... · 1e575ab8

Kostya Serebryany authored Aug 15, 2012

[asan] implement --asan-always-slow-path, which is a part of the improvement to handle unaligned partially OOB accesses. See http://code.google.com/p/address-sanitizer/issues/detail?id=100

llvm-svn: 161937

1e575ab8

Fix a problem with APFloat::roundToIntegral where it would return incorrect... · 1ff74b0d

Owen Anderson authored Aug 15, 2012

Fix a problem with APFloat::roundToIntegral where it would return incorrect results for negative inputs to trunc.  Add unit tests to verify this behavior.

llvm-svn: 161929

1ff74b0d

fix infinite loop in instcombine with more than 4GB memcpy · 69e172a6

Michael Liao authored Aug 15, 2012

- memcpy size is wrongly truncated into 32-bit and treat 8GB memcpy is
  0-sized memcpy
- as 0-sized memcpy/memset is already removed before SimplifyMemTransfer
  and SimplifyMemSet in visitCallInst, replace 0 checking with
  assertions.
- replace getZExtValue() with getLimitedValue() according to
  Eli Friedman

llvm-svn: 161923

69e172a6

Fix a typo that led to a failure to correctly verify bitcast instructions. · 58564d5a
Nick Lewycky authored Aug 15, 2012
```
Patch by Stephen Hines!

llvm-svn: 161921
```
58564d5a
Fix undefined behavior: don't perform array indexing through a potentially null · 8f3447c0
Richard Smith authored Aug 15, 2012
```
pointer.

llvm-svn: 161919
```
8f3447c0

The names of VFP variants of half-to-float conversion instructions were · c6d945b1

Anton Korobeynikov authored Aug 14, 2012

reversed. This leads to wrong codegen for float-to-half conversion
intrinsics which are used to support storage-only fp16 type.
NEON variants of same instructions are fine.

llvm-svn: 161907

c6d945b1

This needs braces. Spotted by Bill. · 5f61a749
Eric Christopher authored Aug 14, 2012
```
llvm-svn: 161906
```
5f61a749
minor fix of X86ISD::VSEXT_MOVL dump · 06f6fe87
Michael Liao authored Aug 14, 2012
```
llvm-svn: 161902
```
06f6fe87

Aug 14, 2012

fix PR11334 · 34107b91

Michael Liao authored Aug 14, 2012

- FP_EXTEND only support extending from vectors with matching elements.
  This results in the scalarization of extending to v2f64 from v2f32,
  which will be legalized to v4f32 not matching with v2f64.
- add X86-specific VFPEXT supproting extending from v4f32 to v2f64.
- add BUILD_VECTOR lowering helper to recover back the original
  extending from v4f32 to v2f64.
- test case is enhanced to include different vector width.

llvm-svn: 161894

34107b91

Switch the fixed-length disassembler to be table-driven. · ecaef49f

Jim Grosbach authored Aug 14, 2012

Refactor the TableGen'erated fixed length disassemblmer to use a
table-driven state machine rather than a massive set of nested
switch() statements.

As a result, the ARM Disassembler (ARMDisassembler.cpp) builds much more
quickly and generates a smaller end result. For a Release+Asserts build on
a 16GB 3.4GHz i7 iMac w/ SSD:

Time to compile at -O2 (averaged w/ hot caches):
  Previous: 35.5s
  New:       8.9s

TEXT size:
  Previous: 447,251
  New:      297,661

Builds in 25% of the time previously required and generates code 66% of
the size.

Execution time of the disassembler is only slightly slower (7% disassembling
10 million ARM instructions, 19.6s vs 21.0s). The new implementation has
not yet been tuned, however, so the performance should almost certainly
be recoverable should it become a concern.

llvm-svn: 161888

ecaef49f

Fix the construction of the magic constant for roundToIntegral to be 64-bit... · 0b357225

Owen Anderson authored Aug 14, 2012

Fix the construction of the magic constant for roundToIntegral to be 64-bit safe.  Fixes c-torture/execute/990826-0.c

llvm-svn: 161885

0b357225

[asan] insert crash basic blocks inline as opposed to inserting them at the... · fda7a138

Kostya Serebryany authored Aug 14, 2012

[asan] insert crash basic blocks inline as opposed to inserting them at the end of the function. This doesn't seem to fix or break anything, but is considered to be more friendly to downstream passes

llvm-svn: 161870

fda7a138

Factor duplicate calls to getUNDEF in several functions. · 925a281b
Craig Topper authored Aug 14, 2012
```
llvm-svn: 161860
```
925a281b

Re-factor intrinsic lowering to combine common parts of similar intrinsics.... · d0d4b11f

Craig Topper authored Aug 14, 2012

Re-factor intrinsic lowering to combine common parts of similar intrinsics. Reduces compiled code size a little bit.

llvm-svn: 161859

d0d4b11f

Change greater than to greater than or equal so that an identical sized store... · 2a40418a

Craig Topper authored Aug 14, 2012

Change greater than to greater than or equal so that an identical sized store to the same offset is treated as completing overwriting.

llvm-svn: 161857

2a40418a

Fix undefined behavior: binding null pointer to reference. No functionality change. · 0ff8f0ea
Richard Smith authored Aug 14, 2012
```
llvm-svn: 161853
```
0ff8f0ea

During the CodeGenPrepare we often lower intrinsics (such as objsize) · 70409991

Nadav Rotem authored Aug 14, 2012

and allow some optimizations to turn conditional branches into unconditional.
This commit adds a simple control-flow optimization which merges two consecutive
basic blocks which are connected by a single edge. This allows the codegen to
operate on larger basic blocks.

rdar://11973998

llvm-svn: 161852

70409991

Grammar. · 160522c2
Eric Christopher authored Aug 14, 2012
```
llvm-svn: 161851
```
160522c2
Typo. · 97f6ea9f
Eric Christopher authored Aug 14, 2012
```
llvm-svn: 161826
```
97f6ea9f

Add a roundToIntegral method to APFloat, which can be parameterized over... · a40319b7

Owen Anderson authored Aug 13, 2012

Add a roundToIntegral method to APFloat, which can be parameterized over various rounding modes.  Use this to implement SelectionDAG constant folding of FFLOOR, FCEIL, and FTRUNC.

llvm-svn: 161807

a40319b7

Transfer weights in transferSuccessorsAndUpdatePHIs(). · 396b595b
Jakob Stoklund Olesen authored Aug 13, 2012
```
llvm-svn: 161805
```
396b595b
Print out MachineBasicBlock successor weights when available. · 1dc107a8
Jakob Stoklund Olesen authored Aug 13, 2012
```
llvm-svn: 161804
```
1dc107a8

LICM uses AliasSet information to hoist and sink instructions. However, other... · 8d804520

Nadav Rotem authored Aug 13, 2012

LICM uses AliasSet information to hoist and sink instructions. However, other passes, such as LoopRotate
may invalidate its AliasSet because SSAUpdater does not update the AliasSet properly.
This patch teaches SSAUpdater to notify AliasSet that it made changes.
The testcase in PR12901 is too big to be useful and I could not reduce it to a normal size.

rdar://11872059 PR12901

llvm-svn: 161803

8d804520

MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. · 5d4e2058

Nadav Rotem authored Aug 13, 2012

Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores),
we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer.

rdar://11872864 PR12899

llvm-svn: 161802

5d4e2058

Aug 13, 2012

Remove the TII::scheduleTwoAddrSource() hook. · 702bcc3b

Jakob Stoklund Olesen authored Aug 13, 2012

It never does anything when running 'make check', and it get's in the
way of updating live intervals in 2-addr.

The hook was originally added to help form IT blocks in Thumb2 code
before register allocation, but the pass ordering has changed since
then, and we run if-conversion after register allocation now.

When the MI scheduler is enabled, there will be no less than two
schedulers between 2-addr and Thumb2ITBlockPass, so this hook is
unlikely to help anything.

llvm-svn: 161794

702bcc3b

ARM: enable struct byval for AAPCS-VFP. · d6c8270e
Manman Ren authored Aug 13, 2012
```
This change is to be enabled in clang.

rdar://9877866

llvm-svn: 161789
```
d6c8270e
Whitespace cleanup. · 49aeb5cc
Bill Wendling authored Aug 13, 2012
```
llvm-svn: 161788
```
49aeb5cc
Count triangles and diamonds in early if-conversion. · d0af1d96
Jakob Stoklund Olesen authored Aug 13, 2012
```
llvm-svn: 161783
```
d0af1d96
Delete dead typedef. · 62a097d1
Jakob Stoklund Olesen authored Aug 13, 2012
```
llvm-svn: 161782
```
62a097d1

Handle extra Tail predecessors in if-conversion. · 83a927d8

Jakob Stoklund Olesen authored Aug 13, 2012

It is still possible to if-convert if the tail block has extra
predecessors, but the tail phis must be rewritten instead of being
removed.

llvm-svn: 161781

83a927d8

[Hexagon] Don't mark callee saved registers as clobbered by a tail call · 0bb7f23c

Arnold Schwaighofer authored Aug 13, 2012

This was causing unnecessary spills/restores of callee saved registers.

Fixes PR13572.

Patch by Pranav Bhandarkar!

llvm-svn: 161778

0bb7f23c

Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND... · 3a94c545

Nadav Rotem authored Aug 13, 2012

Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND ISDNode has more than one user. 

rdar://11876519

llvm-svn: 161775

3a94c545

X86: move Int_CVTSD2SSrr, Int_CVTSI2SSrr, Int_CVTSI2SDrr, Int_CVTSS2SDrr from · 959acb10

Manman Ren authored Aug 13, 2012

OpTbl1 to OpTbl2 since they have 3 operands and the last operand can be changed
to a memory operand.

PR13576

llvm-svn: 161769

959acb10

Add support for the %H output modifier. · 7d8b53c1
Eric Christopher authored Aug 13, 2012
```
Patch by Weiming Zhao.

llvm-svn: 161768
```
7d8b53c1