Commits · 612d70b19d3f46bf293daa2952127ba8bcb1edf6 · Roger Ferrer / llvm-epi-0.8

Nov 20, 2011

Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. · 612d70b1
Nick Lewycky authored Nov 20, 2011
```
Suggested in code review by Eli.

That code in InstCombine looks kinda suspicious.

llvm-svn: 145013
```
612d70b1

The logic for breaking the CFG in the presence of hot successors didn't · 18dfac38

Chandler Carruth authored Nov 20, 2011

properly account for the *global* probability of the edge being taken.
This manifested as a very large number of unconditional branches to
blocks being merged against the CFG even though they weren't
particularly hot within the CFG.

The fix is to check whether the edge being merged is both locally hot
relative to other successors for the source block, and globally hot
compared to other (unmerged) predecessors of the destination block.

This introduces a new crasher on GCC single-source, but it's currently
behind a flag, and Ben has offered to work on the reduction. =]

llvm-svn: 145010

18dfac38

SCEV: Actually set overflow flags on add expressions. · b5ba2eef
Benjamin Kramer authored Nov 20, 2011
```
setFlags doesn't modify its arguments.

llvm-svn: 145007
```
b5ba2eef

Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift... · e79761df

Craig Topper authored Nov 20, 2011

Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift instructions. Remove 256-bit splat handling from LowerShift as it was already handled by PerformShiftCombine.

llvm-svn: 145005

e79761df

Nov 19, 2011

Use 256-bit vcmpeqd for creating an all ones vector when AVX2 is enabled. · a3a65836
Craig Topper authored Nov 19, 2011
```
llvm-svn: 145004
```
a3a65836

Remove some of the special classes that worked around an old tablegen... · bac86038

Craig Topper authored Nov 19, 2011

Remove some of the special classes that worked around an old tablegen limitation of not being able to remove redundant bitconverts from patterns.

llvm-svn: 145003

bac86038

Custom lower AVX2 variable shift intrinsics to shl/srl/sra nodes and remove the intrinsic patterns. · 3af6ae08
Craig Topper authored Nov 19, 2011
```
llvm-svn: 144999
```
3af6ae08

Move the handling of unanalyzable branches out of the loop-driven chain · f3dc9eff

Chandler Carruth authored Nov 19, 2011

formation phase and into the initial walk of the basic blocks. We
essentially pre-merge all blocks where unanalyzable fallthrough exists,
as we won't be able to update the terminators effectively after any
reorderings. This is quite a bit more principled as there may be CFGs
where the second half of the unanalyzable pair has some analyzable
predecessor that gets placed first. Then it may get placed next,
implicitly breaking the unanalyzable branch even though we never even
looked at the part that isn't analyzable. I've included a test case that
triggers this (thanks Benjamin yet again!), and I'm hoping to synthesize
some more general ones as I dig into related issues.

Also, to make this new scheme work we have to be able to handle branches
into the middle of a chain, so add this check. We always fallback on the
incoming ordering.

Finally, this starts to really underscore a known limitation of the
current implementation -- we don't consider broken predecessors when
merging successors. This can caused major missed opportunities, and is
something I'm planning on looking at next (modulo more bug reports).

llvm-svn: 144994

f3dc9eff

Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from... · f984efbf

Craig Topper authored Nov 19, 2011

Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from add/sub of appropriate shuffle vectors.

llvm-svn: 144989

f984efbf

Collapse X86 PSIGNB/PSIGNW/PSIGND node types. · 81390be0
Craig Topper authored Nov 19, 2011
```
llvm-svn: 144988
```
81390be0
Extend VPBLENDVB and VPSIGN lowering to work for AVX2. · de6b73bb
Craig Topper authored Nov 19, 2011
```
llvm-svn: 144987
```
de6b73bb
Remove unused parameters from the AVX maskmov classes. · 66e2b5a6
Craig Topper authored Nov 19, 2011
```
llvm-svn: 144985
```
66e2b5a6

Nov 18, 2011

Fix a corner case in updating LoopInfo after fully unrolling an outer loop. · 6b4d578f

Andrew Trick authored Nov 18, 2011

The loop tree's inclusive block lists are painful and expensive to
update. (I have no idea why they're inclusive). The design was
supposed to handle this case but the implementation missed it and my
unit tests weren't thorough enough.

Fixes PR11335: loop unroll update.

llvm-svn: 144970

6b4d578f

Add AVX2 vpbroadcast support · 1ec141d0
Nadav Rotem authored Nov 18, 2011
```
llvm-svn: 144967
```
1ec141d0
[asan] workaround for reg alloc bug 11395: don't instrument functions with... · 1cdc6e95
Kostya Serebryany authored Nov 18, 2011
```
[asan] workaround for reg alloc bug 11395: don't instrument functions with large chunks of inline assembler

llvm-svn: 144962
```
1cdc6e95
Guard call to getRegForValue with isTypeLegal check to avoid unnecessary work/dead code. · ee93ff73
Chad Rosier authored Nov 18, 2011
```
llvm-svn: 144959
```
ee93ff73

DISubrange supports unsigned lower/upper array bounds, so let's not fake it in... · 107e8ec3

Devang Patel authored Nov 17, 2011

DISubrange supports unsigned lower/upper array bounds, so let's not fake it in the end while emitting DWARF. If a FE needs to encode signed lower/upper array bounds then we need to extend DISubrange or ad DISignedSubrange. 

llvm-svn: 144937

107e8ec3

quick fix: remove GlobalVariable::GlobalVariable mistakenly commited at... · a6edf4c2

Kostya Serebryany authored Nov 17, 2011

quick fix: remove GlobalVariable::GlobalVariable mistakenly commited at r144933. For some reason this compiles on linux 

llvm-svn: 144936

a6edf4c2

Fix an overly general check in SimplifyIndvar to handle useless phi cycles. · 94904586

Andrew Trick authored Nov 17, 2011

The right way to check for a binary operation is
cast<BinaryOperator>. The original check: cast<Instruction> &&
numOperands() == 2 would match phi "instructions", leading to an
infinite loop in extreme corner case: a useless phi with operands
[self, constant] that prior optimization passes failed to remove,
being used in the loop by another useless phi, in turn being used by an
lshr or udiv.

Fixes PR11350: runaway iteration assertion.

llvm-svn: 144935

94904586

fall back to explicit list of allowed linkages when instrumenting globals in... · 65e2211b

Kostya Serebryany authored Nov 17, 2011

fall back to explicit list of allowed linkages when instrumenting globals in asan; add a test check that asan does not touch linkonce_odr

llvm-svn: 144933

65e2211b

Nov 17, 2011

Add TODO comment. · 0eff3e5c
Chad Rosier authored Nov 17, 2011
```
llvm-svn: 144920
```
0eff3e5c

Fix SSE/AVX integer comparison patterns to understand that all integer vector... · f41e1d02

Craig Topper authored Nov 17, 2011

Fix SSE/AVX integer comparison patterns to understand that all integer vector loads are promoted to i64 vector loads so patterns need a bitconvert. Also slightly simplify the AVX2 variable shift patterns by using the predefined bitconvert pattern fragments.

llvm-svn: 144896

f41e1d02

Dead code. · 15b2498e
Chad Rosier authored Nov 17, 2011
```
llvm-svn: 144888
```
15b2498e

When fast iseling a GEP, accumulate the offset rather than emitting a series of · f83ab704

Chad Rosier authored Nov 17, 2011

ADDs.  MaxOffs is used as a threshold to limit the size of the offset. Tradeoffs
being: (1) If we can't materialize the large constant then we'll cause fast-isel
to bail. (2) Too large of an offset can't be directly encoded in the ADD
resulting in a MOV+ADD.  Generally not a bad thing because otherwise we would
have had ADD+ADD, but on Thumb this turns into a MOVS+MOVT+ADD. Working on a fix
for that. (3) Conversely, too low of a threshold we'll miss opportunities to 
coalesce ADDs.
rdar://10412592

llvm-svn: 144886

f83ab704

Remove seemingly unnecessary duplicate VROUND definitions. · f17b6005
Craig Topper authored Nov 17, 2011
```
llvm-svn: 144885
```
f17b6005

Add support for custom names for library functions in TargetLibraryInfo. Add... · 489c0ff4

Eli Friedman authored Nov 17, 2011

Add support for custom names for library functions in TargetLibraryInfo.  Add a custom name for fwrite and fputs on x86-32 OSX.  Make SimplifyLibCalls honor the custom
names for fwrite and fputs.

Fixes <rdar://problem/9815881>.

llvm-svn: 144876

489c0ff4

Don't unconditionally set the kill flag. · ce619ddf
Chad Rosier authored Nov 17, 2011
```
rdar://10456186

llvm-svn: 144872
```
ce619ddf

Turn on vzeroupper insertion on call boundaries for AVX; it works as far as I... · 20439a42

Eli Friedman authored Nov 17, 2011

Turn on vzeroupper insertion on call boundaries for AVX; it works as far as I know, and I'd like to see wider testing.

llvm-svn: 144867

20439a42

Make sure to replace the chain properly when DAGCombining a... · ff1eaa75

Eli Friedman authored Nov 16, 2011

Make sure to replace the chain properly when DAGCombining a LOAD+EXTRACT_VECTOR_ELT into a single LOAD.  Fixes PR10747/PR11393.

llvm-svn: 144863

ff1eaa75

Object/COFF: Support common symbols. · d27d51fb
Michael J. Spencer authored Nov 16, 2011
```
llvm-svn: 144861
```
d27d51fb

Nov 16, 2011
- Generalize the fixup info for ARM mode. · d3f02cbc
  Jim Grosbach authored Nov 16, 2011
```
We don't (yet) have the granularity in the fixups to be specific about which
bitranges are affected. That's a future cleanup, but we're not there yet.

llvm-svn: 144852
```
  d3f02cbc
- Lower 64-bit constant pool node. · b31abde0
  Akira Hatanaka authored Nov 16, 2011
```
llvm-svn: 144849
```
  b31abde0
- Lower 64-bit block address. · eb420717
  Akira Hatanaka authored Nov 16, 2011
```
llvm-svn: 144847
```
  eb420717
- Fix encoding of NOP used for padding in ARM mode .align. · 7ccdb7c0
  Jim Grosbach authored Nov 16, 2011
```
llvm-svn: 144842
```
  7ccdb7c0
- Add patterns for 64-bit tglobaladdr, tblockaddress, tjumptable and tconstpool · 7b8547c4
  Akira Hatanaka authored Nov 16, 2011
```
nodes.

llvm-svn: 144841
```
  7b8547c4
- 64-bit jump register instruction. · 6d617cec
  Akira Hatanaka authored Nov 16, 2011
```
llvm-svn: 144840
```
  6d617cec
- Another missing X86ISD::MOVLPD pattern. rdar://10450317 · 011538dc
  Evan Cheng authored Nov 16, 2011
```
llvm-svn: 144839
```
  011538dc
- ARM assembly parsing for shifted register operands for MOV instruction. · bfe5c5c9
  Jim Grosbach authored Nov 16, 2011
```
llvm-svn: 144837
```
  bfe5c5c9
- Clean up debug printing of ARM shifted operands. · 01e04392
  Jim Grosbach authored Nov 16, 2011
```
llvm-svn: 144836
```
  01e04392
- Add fast-isel stats to determine who's doing all the work, the · ff40b1e1
  Chad Rosier authored Nov 16, 2011
```
target-independent selector or the target-specific selector.

llvm-svn: 144833
```
  ff40b1e1