Commits · 6892c5507f9bd3b55864b9fb99f3b9b0404f98dd · Roger Ferrer / llvm-epi-0.8

May 29, 2008
- convert more operand loops to iterator formulation · 3a9fba5a
  Gabor Greif authored May 29, 2008
```
llvm-svn: 51663
```
  3a9fba5a
May 28, 2008
- Implement PR2370: memmove(x,x,size) -> noop. · ecdefb5d
  Chris Lattner authored May 28, 2008
```
llvm-svn: 51636
```
  ecdefb5d
May 27, 2008
- Fix some constructs that gcc-4.4 warns about. · 698348df
  Duncan Sands authored May 27, 2008
```
llvm-svn: 51591
```
  698348df
- InequalityGraph::node() can create new nodes, invalidating iterators across · 3ebe82b5
  Nick Lewycky authored May 27, 2008
```
the set of nodes. Fix makeEqual to handle this by creating the new node first
then iterating across them second.

llvm-svn: 51573
```
  3ebe82b5
- Grammaro. · 6be65d2a
  Nick Lewycky authored May 26, 2008
```
llvm-svn: 51572
```
  6be65d2a
May 26, 2008

Factor code to copy global value attributes like · dd7daee8

Duncan Sands authored May 26, 2008

the section or the visibility from one global
value to another: copyAttributesFrom.  This is
particularly useful for duplicating functions:
previously this was done by explicitly copying
each attribute in turn at each place where a
new function was created out of an old one, with
the result that obscure attributes were regularly
forgotten (like the collector or the section).
Hopefully now everything is uniform and nothing
is forgotten.

llvm-svn: 51567

dd7daee8

Use a DenseMap instead of an std::map, speeding up the testcase in PR2368 by about a third. · d3f21d16
Owen Anderson authored May 26, 2008
```
llvm-svn: 51565
```
d3f21d16

May 25, 2008

"ret (constexpr)" can't be folded into a Constant. Add a method to · f6ccd258

Nick Lewycky authored May 25, 2008

Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it
to try to use targetdata to fold constant expressions on void instructions.

Also extend the icmp(inttoptr, inttoptr) folding to handle the case where
int size != ptr size.

llvm-svn: 51559

f6ccd258

May 24, 2008
- Fix a serious brain-o. Obviously no-one reviewed my patch :( · 87a099a0
  Chris Lattner authored May 24, 2008
```
This fixes PR2359

llvm-svn: 51536
```
  87a099a0
- Fix PR2358 by resolving calls with undef arguments to overdefined. · 5c207c83
  Chris Lattner authored May 24, 2008
```
llvm-svn: 51535
```
  5c207c83
- Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into... · 02912418
  Evan Cheng authored May 24, 2008
```
Remove x86.sse2.loadh.pd and x86.sse2.loadl.pd. These will be lowered into load and shuffle instructions.

llvm-svn: 51521
```
  02912418
May 23, 2008

Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to · f96e1371
Dan Gohman authored May 23, 2008
```
use it instead of duplicating its functionality.

llvm-svn: 51499
```
f96e1371
Replace some weird usage of UserOp1 introduced in r49492 by a plain if. · f52b23c0
Matthijs Kooijman authored May 23, 2008
```
llvm-svn: 51482
```
f52b23c0

Restucture a part of the SimplifyCFG pass and include a testcase. · aef2b819

Matthijs Kooijman authored May 23, 2008

The SimplifyCFG pass looks at basic blocks that contain only phi nodes,
followed by an unconditional branch. In a lot of cases, such a block (BB) can
be merged into their successor (Succ).

This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does
this by taking all phi nodes in the succesor block Succ and expanding them to
include the predecessors of BB. Furthermore, any phi nodes in BB are moved to
Succ and expanded to include the predecessors of Succ as well.

Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if
all phi nodes can be properly merged. All functional changes are made to
this function, only comments were updated in
TryToSimplifyUncondBranchFromEmptyBlock.

In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted
and more like stack of checks added to handle different kinds of situations
than a comprehensive check. In particular the first check in the function did
some value checking for the case that BB and Succ have a common predecessor,
while the last check in the function simply rejected all cases where BB and
Succ have a common predecessor. The first check was still useful in the case
that BB did not contain any phi nodes at all, though, so it was not completely
useless.

Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more
similar to the code that actually performs the merge. Both functions now look
at the same phi nodes in about the same order. Any conflicts (phi nodes with
different values for the same source) that could arise from merging or moving
phi nodes are detected. If no conflicts are found, the merge can happen.

Apart from only restructuring the checks, two main changes in functionality
happened.

Firstly, the old code rejected blocks with common predecessors in most cases.
The new code performs some extra checks so common predecessors can be handled
in a lot of cases. Wherever common predecessors still pose problems, the
blocks are left untouched.

Secondly, the old code rejected the merge when values (phi nodes) from BB were
used in any other place than Succ. However, it does not seem that there is any
situation that would require this check. Even more, this can be proven.

Consider that BB is a block containing of a single phi node "%a" and a branch
to Succ. Now, since the definition of %a will dominate all of its uses, BB
will dominate all blocks that use %a. Furthermore, since the branch from BB to
Succ is unconditional, Succ will also dominate all uses of %a.

Now, assume that one predecessor of Succ is not dominated by BB (and thus not
dominated by Succ). Since at least one use of %a (but in reality all of them)
is reachable from Succ, you could end up at a use of %a without passing
through it's definition in BB (by coming from X through Succ). This is a
contradiction, meaning that our original assumption is wrong. Thus, all
predecessors of Succ must also be dominated by BB (and thus also by Succ).

This means that moving the phi node %a from BB to Succ does not pose any
problems when the two blocks are merged, and any use checks are not needed.

llvm-svn: 51478

aef2b819

Indent fix. · f399bbf9
Matthijs Kooijman authored May 23, 2008
```
llvm-svn: 51477
```
f399bbf9
Constant integer vectors may also be negated. · 3bf5512d
Nick Lewycky authored May 23, 2008
```
llvm-svn: 51476
```
3bf5512d
Typo. · 8f3127c5
Nick Lewycky authored May 23, 2008
```
llvm-svn: 51475
```
8f3127c5
Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. · 4f3d8785
Nick Lewycky authored May 23, 2008
```
llvm-svn: 51474
```
4f3d8785
Implement X + X for vectors. · 452fb329
Nick Lewycky authored May 23, 2008
```
llvm-svn: 51472
```
452fb329
Fix a recently added optimization to not crash on vectors. · 2ec9a011
Nick Lewycky authored May 23, 2008
```
llvm-svn: 51471
```
2ec9a011

Generalize the new code in instcombine's ComputeNumSignBits for handling · 6d5f120c

Dan Gohman authored May 23, 2008

and/or to handle more cases (such as this add-sitofp.ll testcase), and
port it to selectiondag's ComputeNumSignBits.

llvm-svn: 51469

6d5f120c

Use isSingleValueType instead of isFirstClassType to · 53b26985
Dan Gohman authored May 23, 2008
```
exclude struct and array types.

llvm-svn: 51467
```
53b26985
Allow for switch with no cases. Was causing fault · fecb8824
Dale Johannesen authored May 23, 2008
```
in gcc.dg/pr27531-1.c.

llvm-svn: 51464
```
fecb8824
Use isSingleValueType instead of isFirstClassType to · 30ab45d0
Dan Gohman authored May 23, 2008
```
exclude struct and array types.

llvm-svn: 51459
```
30ab45d0
Use isSingleValueType instead of isFirstClassType to · 7a0566b9
Dan Gohman authored May 23, 2008
```
exclude struct and array types.

llvm-svn: 51456
```
7a0566b9

May 22, 2008
- rewrite the validity checking for memory promotion to be simpler, · c5ec1e19
  Chris Lattner authored May 22, 2008
```
more aggressive, and more correct.  Verify that we only attempt to
promote loads and stores.

llvm-svn: 51406
```
  c5ec1e19
- Use 'continue' to reduce nesting in this loop. No functionality change. · f12c08dc
  Chris Lattner authored May 22, 2008
```
llvm-svn: 51399
```
  f12c08dc
May 21, 2008

When LSR is replacing an instruction, call · e62632e0

Dan Gohman authored May 21, 2008

ScalarEvolution::deleteValueFromRecords on it before doing the
replaceAllUsesWith, because ScalarEvolution looks at the instruction's
users to find SCEV references to the instruction's SCEV object in its
internal maps.

Move all of LSR's loop-related state clearing after processing the loop
and before cleaning up dead PHI nodes. This eliminates all of LSR's SCEV
references just before the calls to ScalarEvolution::deleteValueFromRecords
so that when ScalarEvolution drops its own SCEV references, the reference
counts will reach zero and the SCEVs will be deleted immediately.

These changes fix some compiler aborts involving ScalarEvolution holding
onto and reusing SCEV objects for instructions that have been deleted.
No regression test unfortunately; because the symptoms were due to
dangling pointers, reduced testcases ended up being fairly arbitrary.

llvm-svn: 51359

e62632e0

May 20, 2008

Port SelectionDAG's ComputeNumSignBits-using code to instcombine, · 81ab753b
Dan Gohman authored May 20, 2008
```
now that instcombine also has ComputeNumSignBits.

llvm-svn: 51350
```
81ab753b
Fix typo. · 5148a4ba
Matthijs Kooijman authored May 20, 2008
```
llvm-svn: 51303
```
5148a4ba

Teach instcombine 4 new xforms: · 7ac943ff

Chris Lattner authored May 20, 2008

  (add (sext x), cst) --> (sext (add x, cst'))
  (add (sext x), (sext y)) --> (sext (add int x, y))
  (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst))
  (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y))

This generally reduces conversions.  For example MiBench/telecomm-gsm
gets these simplifications:

HACK2: 	%tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32		; <i32> [#uses=1]
	%tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32		; <i32> [#uses=1]
	%tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i		; <i32> [#uses=3]
HACK2: 	%tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32		; <i32> [#uses=1]
	%tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32		; <i32> [#uses=1]
	%tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i		; <i32> [#uses=3]
HACK2: 	%tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32		; <i32> [#uses=1]
	%tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32		; <i32> [#uses=1]
	%tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i		; <i32> [#uses=3]
HACK2: 	%tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32		; <i32> [#uses=1]
	%tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32		; <i32> [#uses=1]
	%tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i		; <i32> [#uses=3]
HACK2: 	%tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32		; <i32> [#uses=1]
	%tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32		; <i32> [#uses=1]
	%tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i		; <i32> [#uses=3]
HACK2: 	%tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32		; <i32> [#uses=1]
	%tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32		; <i32> [#uses=1]
	%tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i		; <i32> [#uses=3]


This also fixes a bug in ComputeNumSignBits handling select and
makes it more aggressive with and/or.

llvm-svn: 51302

7ac943ff

fix two issues Neil noticed, thanks! · 9c27f96d
Chris Lattner authored May 20, 2008
```
llvm-svn: 51296
```
9c27f96d

Refine the fix in r51169 to only apply when the operand val being · e5572706

Dan Gohman authored May 20, 2008

replaced is a PHI. This prevents it from inserting uses before defs
in the case that it isn't a PHI and it depends on other instructions
later in the block. This fixes the 447.dealII regression on x86-64.

llvm-svn: 51292

e5572706

Make AssociativeOpt static. · d717761a
Dan Gohman authored May 20, 2008
```
llvm-svn: 51290
```
d717761a
Do not erase induction variable increment if it is used outside the loop. · ee7bf41c
Devang Patel authored May 19, 2008
```
llvm-svn: 51280
```
ee7bf41c
Add a ComputeNumSignBits function for use by instcombine, based on the · 123438cc
Dan Gohman authored May 19, 2008
```
code in SelectionDAG.

llvm-svn: 51279
```
123438cc

May 19, 2008

switch to Type::getFPMantissaWidth instead of reinventing it. · b4271228
Chris Lattner authored May 19, 2008
```
llvm-svn: 51275
```
b4271228
minor cleanups, teach instcombine that sitofp/uitofp cannot · ba9acbe6
Chris Lattner authored May 19, 2008
```
produce a negative zero.

llvm-svn: 51272
```
ba9acbe6

convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa · e35fe0f1

Chris Lattner authored May 19, 2008

to accurately represent the integer.  This triggers 9 times in 471.omnetpp,
though 8 of those seem to be inlined from the same place.

llvm-svn: 51271

e35fe0f1

Fold FP comparisons where one operand is converted from an integer · 5920a780

Chris Lattner authored May 19, 2008

type and the other operand is a constant into integer comparisons.
This happens surprisingly frequently (e.g. 10 times in 471.omnetpp),
which are things like this:

	%tmp8283 = sitofp i32 %tmp82 to double	
	%tmp1013 = fcmp ult double %tmp8283, 0.0

Clearly comparing tmp82 against i32 0 is cheaper here.

this also triggers 8 times in gobmk, including this one:

	%tmp375376 = sitofp i32 %tmp375 to double
	%tmp377 = fcmp ogt double %tmp375376, 8.150000e+01

which is comparing an integer against 81.5 :).

llvm-svn: 51268

5920a780