Commits · 030b0d77ed950ebe11c79daec7c8c393cb8c7e5c · Roger Ferrer / llvm-epi-0.8

Jan 17, 2012

LSR fix: broaden the check for loop preheaders. · 12728f04

Andrew Trick authored Jan 17, 2012

It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR.
Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert.

llvm-svn: 148288

12728f04

Remove unreachable code. (replace with llvm_unreachable to help GCC where necessary) · b48ed1a4
David Blaikie authored Jan 17, 2012
```
llvm-svn: 148284
```
b48ed1a4

Jan 16, 2012
- Fixed comment in loop-unswitch. · 2931a59e
  Stepan Dyatkovskiy authored Jan 16, 2012
```
llvm-svn: 148252
```
  2931a59e
Jan 15, 2012

Cosmetic patch for r148215. · 7ec12e43
Stepan Dyatkovskiy authored Jan 15, 2012
```
llvm-svn: 148216
```
7ec12e43

Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to... · cb2adbac

Stepan Dyatkovskiy authored Jan 15, 2012

Fixup for r148132. Type replacement for LoopsProperties: from DenseMap to std::map, since we need to keep a valid pointer to properties of current loop.

Message for r148132:
LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache).

llvm-svn: 148215

cb2adbac

Jan 14, 2012
- Fix an unused variable warning that Chad noticed. · 4cf362ac
  Dan Gohman authored Jan 14, 2012
```
llvm-svn: 148164
```
  4cf362ac
Jan 13, 2012
- Speculatively revert r148132+r148133 to try and fix a buildbot failure. · d476fdc3
  Eli Friedman authored Jan 13, 2012
```
llvm-svn: 148149
```
  d476fdc3
- Cosmetic patch for r148132. · 0a920fa2
  Stepan Dyatkovskiy authored Jan 13, 2012
```
llvm-svn: 148133
```
  0a920fa2
- LoopUnswitch: All helper data that is collected during loop-unswitch... · cbcbdb23
  Stepan Dyatkovskiy authored Jan 13, 2012
```
LoopUnswitch: All helper data that is collected during loop-unswitch iterations was moved to separated class (LUAnalysisCache). 

llvm-svn: 148132
```
  cbcbdb23
- Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that · 728db499
  Dan Gohman authored Jan 13, 2012
```
the optimizer doesn't eliminate objc_retainBlock calls which are needed
for their side effect of copying blocks onto the heap.
This implements rdar://10361249.

llvm-svn: 148076
```
  728db499
Jan 11, 2012

Re-fix the issue Bill fixed in r147899 in a slightly different way, which... · b31c627b

Eli Friedman authored Jan 11, 2012

Re-fix the issue Bill fixed in r147899 in a slightly different way, which doesn't abuse the semantics of linker_private.  We don't really want to merge any string constant with a weak_odr global.

llvm-svn: 147971

b31c627b

[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395 :... · 687d0781

Kostya Serebryany authored Jan 11, 2012

[asan] extend the workaround for http://llvm.org/bugs/show_bug.cgi?id=11395: don't instrument the function at all on x86_32 if it has a large asm blob

llvm-svn: 147953

687d0781

Improved compile time: · 82165698

Stepan Dyatkovskiy authored Jan 11, 2012

1. Size heuristics changed. Now we calculate number of unswitching
branches only once per loop.
2. Some checks was moved from UnswitchIfProfitable to
processCurrentLoop, since it is not changed during processCurrentLoop
iteration. It allows decide to skip some loops at an early stage.
Extended statistics:
- Added total number of instructions analyzed.

llvm-svn: 147935

82165698

If the global variable is removed by the linker, then don't constant merge it · c7915519

Bill Wendling authored Jan 11, 2012

with other symbols.

An object in the __cfstring section is suppoed to be filled with CFString
objects, which have a pointer to ___CFConstantStringClassReference followed by a
pointer to a __cstring. If we allow the object in the __cstring section to be
merged with another global, then it could end up in any section. Because the
linker is going to remove these symbols in the final executable, we shouldn't
bother to merge them.
<rdar://problem/10564621>

llvm-svn: 147899

c7915519

Jan 10, 2012

Enable LSR IV Chains with sufficient heuristics. · d5d2db9a

Andrew Trick authored Jan 10, 2012

These heuristics are sufficient for enabling IV chains by
default. Performance analysis has been done for i386, x86_64, and
thumbv7. The optimization is rarely important, but can significantly
speed up certain cases by eliminating spill code within the
loop. Unrolled loops are prime candidates for IV chains. In many
cases, the final code could still be improved with more target
specific optimization following LSR. The goal of this feature is for
LSR to make the best choice of induction variables.

Instruction selection may not completely take advantage of this
feature yet. As a result, there could be cases of slight code size
increase.

Code size can be worse on x86 because it doesn't support postincrement
addressing. In fact, when chains are formed, you may see redundant
address plus stride addition in the addressing mode. GenerateIVChains
tries to compensate for the common cases.

On ARM, code size increase can be mitigated by using postincrement
addressing, but downstream codegen currently misses some opportunities.

llvm-svn: 147826

d5d2db9a

Jan 09, 2012

Adding IV chain generation to LSR. · 248d410e

Andrew Trick authored Jan 09, 2012

After collecting chains, check if any should be materialized. If so,
hide the chained IV users from the LSR solver. LSR will only solve for
the head of the chain. GenerateIVChains will then materialize the
chained IV users by computing the IV relative to its previous value in
the chain.

In theory, chained IV users could be exposed to LSR's solver. This
would be considerably complicated to implement and I'm not aware of a
case where we need it. In practice it's more important to
intelligently prune the search space of nontrivial loops before
running the solver, otherwise the solver is often forced to prune the
most optimal solutions. Hiding the chained users does this well, so
that LSR is more likely to find the best IV for the chain as a whole.

llvm-svn: 147801

248d410e

Adding collection of IV chains to LSR. · 29fe5f03

Andrew Trick authored Jan 09, 2012

This collects a set of IV uses within the loop whose values can be
computed relative to each other in a sequence. Following checkins will
make use of this information.

llvm-svn: 147797

29fe5f03

"Minor LSR debugging stuff" · 4dc3eff5
Andrew Trick authored Jan 09, 2012
```
llvm-svn: 147785
```
4dc3eff5
Move assert to the right place. · f7fe24f4
Benjamin Kramer authored Jan 09, 2012
```
llvm-svn: 147779
```
f7fe24f4
InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. · f9d0cc01
Benjamin Kramer authored Jan 09, 2012
```
This subsumes several other transforms while enabling us to catch more cases.

llvm-svn: 147777
```
f9d0cc01

Jan 08, 2012

Tweak my last commit to be less conservative about uses. · 6609f741

Benjamin Kramer authored Jan 08, 2012

We still save an instruction when just the "and" part is replaced.
Also change the code to match comments more closely.

llvm-svn: 147753

6609f741

InstCombine: If we have a bit test and a sign test anded/ored together, merge... · da37e153

Benjamin Kramer authored Jan 08, 2012

InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test.

This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set.

llvm-svn: 147749

da37e153

Jan 07, 2012

Enable redundant phi elimination after LSR. · 06f6c05d

Andrew Trick authored Jan 07, 2012

This will be more important as we extend the LSR pass in ways that don't rely on the formula solver. In particular, we need it for constructing IV chains.

llvm-svn: 147724

06f6c05d

LSR: Don't optimize loops if an outer loop has no preheader. · 732ad80d

Andrew Trick authored Jan 07, 2012

LoopSimplify may not run on some outer loops, e.g. because of indirect
branches. SCEVExpander simply cannot handle outer loops with no preheaders.
Fixes rdar://10655343 SCEVExpander segfault.

llvm-svn: 147718

732ad80d

LSR: run DeleteDeadPhis before replaceCongruentPhis. · 2ec61a89
Andrew Trick authored Jan 07, 2012
```
llvm-svn: 147711
```
2ec61a89
Extended replaceCongruentPhis to handle mixed phi types. · 5adedf5d
Andrew Trick authored Jan 07, 2012
```
llvm-svn: 147707
```
5adedf5d

Jan 06, 2012

[asan] cleanup: remove the SIGILL-related code (compiler part) · 3411f2ea
Kostya Serebryany authored Jan 06, 2012
```
llvm-svn: 147667
```
3411f2ea

Fix SpeculativelyExecuteBB to either speculate all or none of the phis · 5ab9c0a9

Dan Gohman authored Jan 05, 2012

present in the bottom of the CFG triangle, as the transformation isn't
ever valuable if the branch can't be eliminated.

Also, unify some heuristics between SimplifyCFG's multiple
if-converters, for consistency.

This fixes rdar://10627242.

llvm-svn: 147630

5ab9c0a9

PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into... · 55fa49f3

Eli Friedman authored Jan 05, 2012

PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into global initializers if there's an implied extension or truncation.

llvm-svn: 147625

55fa49f3

Jan 05, 2012

Revert r56315. When the instruction to speculate is a load, this · 52672118

Dan Gohman authored Jan 05, 2012

code can incorrectly move the load across a store. This never
happens in practice today, but only because the current
heuristics accidentally preclude it.

llvm-svn: 147623

52672118

SCCCaptured is trivially false on entry to this loop and not modified inside it. · f740db31
Nick Lewycky authored Jan 05, 2012
```
Eliminate the dead test for it on each loop iteration. No functionality change.

llvm-svn: 147616
```
f740db31

Jan 04, 2012
- Remove pointless asserts. · 6d1d4bb6
  Nick Lewycky authored Jan 04, 2012
```
llvm-svn: 147529
```
  6d1d4bb6
- Teach instcombine all sorts of great stuff about shifts that have exact, nuw or · 0c48afa0
  Nick Lewycky authored Jan 04, 2012
```
nsw bits on them.

llvm-svn: 147528
```
  0c48afa0
Dec 31, 2011
- Make use of the exact bit when optimizing '(X >>exact 3) << 1' to eliminate the · b59008c6
  Nick Lewycky authored Dec 31, 2011
```
'and' that would zero out the trailing bits, and to produce an exact shift
ourselves.

llvm-svn: 147391
```
  b59008c6
Dec 29, 2011

Change CaptureTracking to pass a Use* instead of a Value* when a value is · 4c378a44

Nick Lewycky authored Dec 28, 2011

captured. This allows the tracker to look at the specific use, which may be
especially interesting for function calls.

Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does
not iterate until a fixpoint and does not guarantee that it produces the same
result regardless of iteration order. The new implementation builds up a graph
of how arguments are passed from function to function, and uses a bottom-up walk
on the argument-SCCs to assign nocapture. This gets us nocapture more often, and
does so rather efficiently and independent of iteration order.

llvm-svn: 147327

4c378a44

Dec 28, 2011
- Demystify this comment. · 8640fdf0
  Nick Lewycky authored Dec 28, 2011
```
llvm-svn: 147307
```
  8640fdf0
Dec 27, 2011
- Use false not zero, as a bool. · 398255e7
  Nick Lewycky authored Dec 27, 2011
```
llvm-svn: 147292
```
  398255e7
- Turn cos(-x) into cos(x). Patch by Alexander Malyshev! · a8e84fb5
  Nick Lewycky authored Dec 27, 2011
```
llvm-svn: 147291
```
  a8e84fb5
- Teach simplifycfg to recompute branch weights when merging some branches, and · c554a9b5
  Nick Lewycky authored Dec 27, 2011
```
to discard weights when appropriate. Still more to do (and a new TODO), but
it's a start!

llvm-svn: 147286
```
  c554a9b5
- Fix warning. · 2b14b80b
  Rafael Espindola authored Dec 26, 2011
```
llvm-svn: 147284
```
  2b14b80b