Commits · 1b706dd680745f4f7d6e6e62cf4777dc161abd9a · Roger Ferrer / llvm-epi-0.8

Jan 28, 2008

Fix PR1938 by forcing the code that uses an undefined value to branch one · 1b706dd6

Chris Lattner authored Jan 28, 2008

way or the other.  Rewriting the code itself prevents subsequent analysis
passes from making contradictory conclusions about the code that could 
cause an infeasible path to be made feasible.

llvm-svn: 46427

1b706dd6

Jan 27, 2008
- Be more careful modifying the use_list while also iterating through it. · efb16f70
  Nick Lewycky authored Jan 27, 2008
```
llvm-svn: 46417
```
  efb16f70
- The CorrelatedExpressionElimination pass is known to be buggy. Remove it. · 60361a16
  Bill Wendling authored Jan 27, 2008
```
This fixes PR1769.

llvm-svn: 46408
```
  60361a16
- Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. · fa1e7eef
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46406
```
  fa1e7eef
Jan 26, 2008

If there are no machine instructions emitted for a function, then insert · 50794839

Bill Wendling authored Jan 26, 2008

a "nop" instruction so that we don't have the function's label associated
with something that it's not supposed to be associated with.

llvm-svn: 46394

50794839

If we have a function like this: · 0862e342

Bill Wendling authored Jan 26, 2008

void bork() {
  int *address = 0;
  *address = 0;
}

It's compiled into LLVM code that looks like this:

define void @bork() noreturn nounwind  {
entry:
        unreachable
}

This is bad on some platforms (like PPC) because it will generate the label for
the function but no body. The label could end up being associated with some
non-code related stuff, like a section. This places a "trap" instruction if the
SimplifyCFG pass removed all code from the function leaving only one
"unreachable" instruction.

llvm-svn: 46387

0862e342

Jan 25, 2008

DeadStoreElimination can treat byval parameters as if there were alloca's for... · 6af19fd1

Owen Anderson authored Jan 25, 2008

DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores.

llvm-svn: 46351

6af19fd1

Jan 22, 2008
- Enable the fix I just checked in, silly me. · f0692641
  Nick Lewycky authored Jan 22, 2008
```
llvm-svn: 46247
```
  f0692641
- Multiply can be evaluated in a different type, so long as the target type has · 78712e5b
  Nick Lewycky authored Jan 22, 2008
```
a smaller bitwidth.

llvm-svn: 46244
```
  78712e5b
Jan 20, 2008
- Make sure the caller doesn't use freed memory. · afa84da4
  Duncan Sands authored Jan 20, 2008
```
Fixes PR1935.

llvm-svn: 46203
```
  afa84da4
- Initializing an unsigned with ~0UL causes the compiler · fe3bef09
  Duncan Sands authored Jan 20, 2008
```
to complain on x86-64 (gcc 4.1).  Use ~0U instead.

llvm-svn: 46197
```
  fe3bef09
Jan 14, 2008

I noticed that the trampoline straightening transformation could · b5ca2e9f

Duncan Sands authored Jan 14, 2008

drop attributes on varargs call arguments.  Also, it could generate
invalid IR if the transformed call already had the 'nest' attribute
somewhere (this can never happen for code coming from llvm-gcc,
but it's a theoretical possibility).  Fix both problems.

llvm-svn: 45973

b5ca2e9f

Turn a memcpy from a double* into a load/store of double instead of · 92bd7853

Chris Lattner authored Jan 14, 2008

a load/store of i64.  The later prevents promotion/scalarrepl of the
source and dest in many cases.

This fixes the 300% performance regression of the byval stuff on 
stepanov_v1p2.

llvm-svn: 45945

92bd7853

factor memcpy/memmove simplification out to its own SimplifyMemTransfer · 57974c8d
Chris Lattner authored Jan 13, 2008
```
method, no functionality change.

llvm-svn: 45944
```
57974c8d

Jan 13, 2008
- simplify some code. If we can infer alignment for source and dest that are · 8c5cdddf
  Chris Lattner authored Jan 13, 2008
```
greater than memcpy alignment, and if we lower to load/store, use the best 
alignment info we have.

llvm-svn: 45943
```
  8c5cdddf
- simplify some code by adding a InsertBitCastBefore method, · 5a86612d
  Chris Lattner authored Jan 13, 2008
```
make memmove->memcpy conversion a bit simpler.

llvm-svn: 45942
```
  5a86612d
- Fix PR1907, a nasty miscompilation because instcombine didn't · 5bc253c8
  Chris Lattner authored Jan 13, 2008
```
realize that ne & sgt  was a signed comparison (it was only 
looking at whether the left compare was signed).

llvm-svn: 45937
```
  5bc253c8
- When turning a call to a bitcast function into a direct call, · 781f6549
  Duncan Sands authored Jan 13, 2008
```
if this becomes a varargs call then deal correctly with any
parameter attributes on the newly vararg call arguments.

llvm-svn: 45931
```
  781f6549
Jan 08, 2008
- Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. · 2940c5c5
  Chris Lattner authored Jan 08, 2008
```
llvm-svn: 45745
```
  2940c5c5
Jan 07, 2008

Small cleanup for handling of type/parameter attribute · b18c30ac
Duncan Sands authored Jan 07, 2008
```
incompatibility.

llvm-svn: 45704
```
b18c30ac
Deleting an empty file. Thanks, /usr/bin/patch! · efb08802
Gordon Henriksen authored Jan 07, 2008
```
llvm-svn: 45675
```
efb08802

With this patch, the LowerGC transformation becomes the · 6047b6e1

Gordon Henriksen authored Jan 07, 2008

ShadowStackCollector, which additionally has reduced overhead with
no sacrifice in portability.

Considering a function @fun with 8 loop-local roots,
ShadowStackCollector introduces the following overhead
(x86):

; shadowstack prologue
        movl    L_llvm_gc_root_chain$non_lazy_ptr, %eax
        movl    (%eax), %ecx
        movl    $___gc_fun, 20(%esp)
        movl    $0, 24(%esp)
        movl    $0, 28(%esp)
        movl    $0, 32(%esp)
        movl    $0, 36(%esp)
        movl    $0, 40(%esp)
        movl    $0, 44(%esp)
        movl    $0, 48(%esp)
        movl    $0, 52(%esp)
        movl    %ecx, 16(%esp)
        leal    16(%esp), %ecx
        movl    %ecx, (%eax)

; shadowstack loop overhead
        (none)

; shadowstack epilogue
        movl    48(%esp), %edx
        movl    %edx, (%ecx)

; shadowstack metadata
        .align  3
___gc_fun:                              # __gc_fun
        .long   8
        .space  4

In comparison to LowerGC:

; lowergc prologue
        movl    L_llvm_gc_root_chain$non_lazy_ptr, %eax
        movl    (%eax), %ecx
        movl    %ecx, 48(%esp)
        movl    $8, 52(%esp)
        movl    $0, 60(%esp)
        movl    $0, 56(%esp)
        movl    $0, 68(%esp)
        movl    $0, 64(%esp)
        movl    $0, 76(%esp)
        movl    $0, 72(%esp)
        movl    $0, 84(%esp)
        movl    $0, 80(%esp)
        movl    $0, 92(%esp)
        movl    $0, 88(%esp)
        movl    $0, 100(%esp)
        movl    $0, 96(%esp)
        movl    $0, 108(%esp)
        movl    $0, 104(%esp)
        movl    $0, 116(%esp)
        movl    $0, 112(%esp)

; lowergc loop overhead
        leal    44(%esp), %eax
        movl    %eax, 56(%esp)
        leal    40(%esp), %eax
        movl    %eax, 64(%esp)
        leal    36(%esp), %eax
        movl    %eax, 72(%esp)
        leal    32(%esp), %eax
        movl    %eax, 80(%esp)
        leal    28(%esp), %eax
        movl    %eax, 88(%esp)
        leal    24(%esp), %eax
        movl    %eax, 96(%esp)
        leal    20(%esp), %eax
        movl    %eax, 104(%esp)
        leal    16(%esp), %eax
        movl    %eax, 112(%esp)

; lowergc epilogue
        movl    48(%esp), %edx
        movl    %edx, (%ecx)

; lowergc metadata
        (none)

llvm-svn: 45670

6047b6e1

Jan 06, 2008

The transform that tries to turn calls to bitcast functions into · 404eb052

Duncan Sands authored Jan 06, 2008

direct calls bails out unless caller and callee have essentially
equivalent parameter attributes.  This is illogical - the callee's
attributes should be of no relevance here.  Rework the logic, which
incidentally fixes a crash when removed arguments have attributes.

llvm-svn: 45658

404eb052

When transforming a call to a bitcast function into · 55e5090f

Duncan Sands authored Jan 06, 2008

a direct call with cast parameters and cast return
value (if any), instcombine was prepared to cast any
non-void return value into any other, whether castable
or not.  Add a new predicate for testing whether casting
is valid, and check it both for the return value and
(as a cleanup) for the parameters.

llvm-svn: 45657

55e5090f

Jan 05, 2008
- remove a couple more unsafe xforms in the face of overflow. · e666bc27
  Chris Lattner authored Jan 05, 2008
```
llvm-svn: 45613
```
  e666bc27
- remove the (x-y) < 0 comparison xform, it miscompiles · db026d70
  Chris Lattner authored Jan 05, 2008
```
things that are not equality comparisons, for example:
   (2147479553+4096)-2147479553 < 0    !=   (2147479553+4096) < 2147479553

llvm-svn: 45612
```
  db026d70
Jan 04, 2008
- fix typo · 30e43456
  Wojciech Matyjewicz authored Jan 04, 2008
```
llvm-svn: 45594
```
  30e43456
Dec 29, 2007
- Remove attribution from file headers, per discussion on llvmdev. · f3ebc3f3
  Chris Lattner authored Dec 29, 2007
```
llvm-svn: 45418
```
  f3ebc3f3
- remove attribution from lib Makefiles. · a087a8d2
  Chris Lattner authored Dec 29, 2007
```
llvm-svn: 45415
```
  a087a8d2
- Disable null pointer folding transforms for non-generic address spaces. This... · b053b80b
  Christopher Lamb authored Dec 29, 2007
```
Disable null pointer folding transforms for non-generic address spaces. This should probably be a target-specific predicate based on address space. That way for targets where this isn't applicable the predicate can be optimized away.

llvm-svn: 45403
```
  b053b80b
Dec 28, 2007

Repair a transform that Chris noticed a bug in. Thanks to Nicholas for... · 7363914e

Owen Anderson authored Dec 28, 2007

Repair a transform that Chris noticed a bug in.  Thanks to Nicholas for pointing out my stupid mistakes when writing this patch. :-)

llvm-svn: 45384

7363914e

disable this instcombine xform, it miscompiles: · 5179819b

Chris Lattner authored Dec 28, 2007

define i32 @main() {
entry:
	%z = alloca i32		; <i32*> [#uses=2]
	store i32 0, i32* %z
	%tmp = load i32* %z		; <i32> [#uses=1]
	%sub = sub i32 %tmp, 1		; <i32> [#uses=1]
	%cmp = icmp ult i32 %sub, 0		; <i1> [#uses=1]
	%retval = select i1 %cmp, i32 1, i32 0		; <i32> [#uses=1]
	ret i32 %retval
}

into ret 1, instead of ret 0.

Christopher, please investigate.

llvm-svn: 45383

5179819b

Dec 25, 2007

Don't break critical edges for single-bb loops, this helps with PR1877, though · ef1bbfc7

Chris Lattner authored Dec 25, 2007

it is only a partial fix. This change is noise for most programs, but
speeds up Shootout-C++/matrix by 20%, Ptrdist/ks by 24%, smg2000 by 8%,
hexxagon by 9%, bzip2 by 9% (not sure I trust this), ackerman by 13%, etc.

OTOH, it slows down Shootout/fib2 by 40% (I'll update PR1877 with this info).

llvm-svn: 45354

ef1bbfc7

Dec 24, 2007
- add a -backedge-hack llc-beta option to codegenprepare. · 62a806d5
  Chris Lattner authored Dec 24, 2007
```
When specified, don't split backedges of single-bb loops.
This helps address PR1877

llvm-svn: 45344
```
  62a806d5
Dec 22, 2007

implement InstCombine/shift-trunc-shift.ll. This allows · 74b2ab59

Chris Lattner authored Dec 22, 2007

us to compile:
#include <math.h>
int t1(double d) { return signbit(d); }

into:

_t1:
	movd	%xmm0, %rax
	shrq	$63, %rax
	ret

instead of:

_t1:
	movd	%xmm0, %rax
	shrq	$32, %rax
	shrl	$31, %eax
	ret

on x86-64.

llvm-svn: 45311

74b2ab59

Dec 20, 2007
- Implement review feedback, including additional transforms · 7d82bc46
  Christopher Lamb authored Dec 20, 2007
```
(icmp slt (sub A B) 1) -> (icmp sle A B)
icmp sgt (sub A B) -1) -> (icmp sge A B)

and add testcase.

llvm-svn: 45256
```
  7d82bc46
- Clean up previous patch: PHI uses should not prevent iv reuse if all other... · 26ee54eb
  Evan Cheng authored Dec 20, 2007
```
Clean up previous patch: PHI uses should not prevent iv reuse if all other uses are addresses. This trades a constant multiply for one fewer iv.

llvm-svn: 45251
```
  26ee54eb
- simplify this code with the new m_Zero() pattern. Make sure the select only · 16a51da0
  Chris Lattner authored Dec 20, 2007
```
has a single use, and generalize it to not require N to be a constant.

llvm-svn: 45250
```
  16a51da0
- Allow iv reuse if the user is a PHI node which is in turn used as addresses. · e2a8ba7f
  Evan Cheng authored Dec 19, 2007
```
llvm-svn: 45230
```
  e2a8ba7f
Dec 19, 2007

When inlining through an 'nounwind' call, mark inlined · aa31b925

Duncan Sands authored Dec 19, 2007

calls 'nounwind'.  It is important for correct C++
exception handling that nounwind markings do not get
lost, so this transformation is actually needed for
correctness.

llvm-svn: 45218

aa31b925