Commits · e373163e951f4c8a4d44e4aae8edbaff8f7bdf14 · Roger Ferrer / llvm-epi-0.8

Nov 10, 2005
- The pass everyone has been waiting for! · 4130a4f0
  Andrew Lenharth authored Nov 10, 2005
```
Reg2Mem

for fun you can opt -reg2mem -mem2reg

llvm-svn: 24267
```
  4130a4f0
Nov 05, 2005

Add support alignment of allocation instructions. · 848622f8

Nate Begeman authored Nov 05, 2005

Add support for specifying alignment and size of setjmp jmpbufs.

No targets currently do anything with this information, nor is it presrved
in the bytecode representation.  That's coming up next.

llvm-svn: 24196

848622f8

Implement Transforms/TailCallElim/return-undef.ll, a trivial case · 16b29e95
Chris Lattner authored Nov 05, 2005
```
that has been sitting in my inbox since May 18. :)

llvm-svn: 24194
```
16b29e95

Turn sdiv into udiv if both operands have a clear sign bit. This occurs · dd0c1740

Chris Lattner authored Nov 05, 2005

a few times in crafty:

OLD:    %tmp.36 = div int %tmp.35, 8            ; <int> [#uses=1]
NEW:    %tmp.36 = div uint %tmp.35, 8           ; <uint> [#uses=0]
OLD:    %tmp.19 = div int %tmp.18, 8            ; <int> [#uses=1]
NEW:    %tmp.19 = div uint %tmp.18, 8           ; <uint> [#uses=0]
OLD:    %tmp.117 = div int %tmp.116, 8          ; <int> [#uses=1]
NEW:    %tmp.117 = div uint %tmp.116, 8         ; <uint> [#uses=0]
OLD:    %tmp.92 = div int %tmp.91, 8            ; <int> [#uses=1]
NEW:    %tmp.92 = div uint %tmp.91, 8           ; <uint> [#uses=0]

Which all turn into shrs.

llvm-svn: 24190

dd0c1740

Turn srem -> urem when neither input has their sign bit set. This triggers · e9ff0eaf

Chris Lattner authored Nov 05, 2005

8 times in vortex, allowing the srems to be turned into shrs:

OLD: %tmp.104 = rem int %tmp.5.i37, 16 ; <int> [#uses=1]
NEW: %tmp.104 = rem uint %tmp.5.i37, 16 ; <uint> [#uses=0]
OLD: %tmp.98 = rem int %tmp.5.i24, 16 ; <int> [#uses=1]
NEW: %tmp.98 = rem uint %tmp.5.i24, 16 ; <uint> [#uses=0]
OLD: %tmp.91 = rem int %tmp.5.i19, 8 ; <int> [#uses=1]
NEW: %tmp.91 = rem uint %tmp.5.i19, 8 ; <uint> [#uses=0]
OLD: %tmp.88 = rem int %tmp.5.i14, 8 ; <int> [#uses=1]
NEW: %tmp.88 = rem uint %tmp.5.i14, 8 ; <uint> [#uses=0]
OLD: %tmp.85 = rem int %tmp.5.i9, 1024 ; <int> [#uses=2]
NEW: %tmp.85 = rem uint %tmp.5.i9, 1024 ; <uint> [#uses=0]
OLD: %tmp.82 = rem int %tmp.5.i, 512 ; <int> [#uses=2]
NEW: %tmp.82 = rem uint %tmp.5.i1, 512 ; <uint> [#uses=0]
OLD: %tmp.48.i = rem int %tmp.5.i.i161, 4 ; <int> [#uses=1]
NEW: %tmp.48.i = rem uint %tmp.5.i.i161, 4 ; <uint> [#uses=0]
OLD: %tmp.20.i2 = rem int %tmp.5.i.i, 4 ; <int> [#uses=1]
NEW: %tmp.20.i2 = rem uint %tmp.5.i.i, 4 ; <uint> [#uses=0]

it also occurs 9 times in gcc, but with odd constant divisors (1009 and 61)
so the payoff isn't as great.

llvm-svn: 24189

e9ff0eaf

Nov 02, 2005
- make this 64 bit clean, fixed test30 of /Regression/Transforms/InstCombine/add.ll · 66229558
  Andrew Lenharth authored Nov 02, 2005
```
llvm-svn: 24158
```
  66229558
Oct 31, 2005
- Limit the search depth of MaskedValueIsZero to 6 instructions, to avoid · 09efd4e5
  Chris Lattner authored Oct 31, 2005
```
bad cases.  This fixes Markus's second testcase in PR639, and should
seal it for good.

llvm-svn: 24123
```
  09efd4e5
Oct 29, 2005
- This pass is now obsolete since all targets have moved to the SelectionDAG · 27d351f1
  Chris Lattner authored Oct 29, 2005
```
infrastructure and the simple isels have been removed.

llvm-svn: 24090
```
  27d351f1
- Pull some code out into a function, give it the ability to see through +. · 8f663e8b
  Chris Lattner authored Oct 29, 2005
```
This allows us to turn code like malloc(4*x+4) -> malloc int, (x+1)

llvm-svn: 24081
```
  8f663e8b
- Remove a special case, allowing the general case to handle it. No functionality · 8270c336
  Chris Lattner authored Oct 29, 2005
```
change.

llvm-svn: 24076
```
  8270c336
Oct 28, 2005
- Fix a bit of backwards logic that broke exptree and smg2000 · b9d3ca5c
  Chris Lattner authored Oct 28, 2005
```
llvm-svn: 24056
```
  b9d3ca5c
Oct 27, 2005
- Do not sink any instruction with side effects, including vaarg. This fixes · c4f67e67
  Chris Lattner authored Oct 27, 2005
```
PR640

llvm-svn: 24046
```
  c4f67e67
- Fix typo · c6372cca
  Chris Lattner authored Oct 27, 2005
```
llvm-svn: 24033
```
  c6372cca
- Teach instcombine to promote stuff like (cast (malloc sbyte, 8*X) to int*) · 0fe7551b
  Chris Lattner authored Oct 27, 2005
```
into: malloc int, (2*X)

llvm-svn: 24032
```
  0fe7551b
- Promote cases like cast (malloc sbyte, 100) to int* into · b3ecf969
  Chris Lattner authored Oct 27, 2005
```
(malloc [25 x int]) directly without having to convert to
(malloc [100 x sbyte]) first.

llvm-svn: 24031
```
  b3ecf969
- Minor change to this file to support obscure cases with constant array amounts · bb17180a
  Chris Lattner authored Oct 27, 2005
```
llvm-svn: 24030
```
  bb17180a
Oct 26, 2005
- fold nested and's early to avoid inefficiencies in MaskedValueIsZero. This · 38a1b00a
  Chris Lattner authored Oct 26, 2005
```
fixes a very slow compile in PR639.

llvm-svn: 24011
```
  38a1b00a
- Update Visual Studio projects to reflect moved file. · 2b8cbf31
  Jeff Cohen authored Oct 26, 2005
```
llvm-svn: 23998
```
  2b8cbf31
Oct 24, 2005
- Handle allocations that, even after removing dead uses, still have more than · 46705b2f
  Chris Lattner authored Oct 24, 2005
```
one use (but one is a cast).  This handles the very common case of:

 X = alloc [n x byte]
 Y = cast X to somethingbetter
 seteq X, null

In order to avoid infinite looping when there are multiple casts, we only
allow this if the xform is strictly increasing the alignment of the
allocation.

llvm-svn: 23961
```
  46705b2f
- Fix a bug where we would 'promote' an allocation from one type to another · 355ecc09
  Chris Lattner authored Oct 24, 2005
```
where the second has less alignment required.  If we had explicit alignment
support in the IR, we could handle this case, but we can't until we do.

llvm-svn: 23960
```
  355ecc09
- Before promoting a malloc type, remove dead uses. This makes instcombine · ac87beb0
  Chris Lattner authored Oct 24, 2005
```
more effective at promoting these allocations, catching them earlier in the
compile process.

llvm-svn: 23959
```
  ac87beb0
- Pull some code out into a function, no functionality change · 216be918
  Chris Lattner authored Oct 24, 2005
```
llvm-svn: 23958
```
  216be918
- DONT_BUILD_RELINKED is gone and implied by BUILD_ARCHIVE now · bde38455
  Chris Lattner authored Oct 24, 2005
```
llvm-svn: 23940
```
  bde38455
- Only build .a file versions of these libraries, instead of .a and .o versions. · 8c087e96
  Chris Lattner authored Oct 24, 2005
```
This should speed up build times.

llvm-svn: 23933
```
  8c087e96
- Make sure that anything using the ADCE pass pulls in the UnifyFunctionExitNodes · bd77fac0
  Chris Lattner authored Oct 24, 2005
```
code

llvm-svn: 23931
```
  bd77fac0
Oct 23, 2005

When a function takes a variable number of pointer arguments, with a zero · 11e26b52

Jeff Cohen authored Oct 23, 2005

pointer marking the end of the list, the zero *must* be cast to the pointer
type.  An un-cast zero is a 32-bit int, and at least on x86_64, gcc will
not extend the zero to 64 bits, thus allowing the upper 32 bits to be
random junk.

The new END_WITH_NULL macro may be used to annotate a such a function
so that GCC (version 4 or newer) will detect the use of un-casted zero
at compile time.

llvm-svn: 23888

11e26b52

Oct 21, 2005
- My previous patch was too conservative. Reject FP and void types, but do · 5df0e36e
  Chris Lattner authored Oct 21, 2005
```
allow pointer types.

llvm-svn: 23859
```
  5df0e36e
Oct 20, 2005

Do NOT touch FP ops with LSR. This fixes a testcase Nate sent me from an · 0c0b38bb

Chris Lattner authored Oct 20, 2005

inner loop like this:

LBB_RateConvertMono8AltiVec_2:  ; no_exit
        lis r2, ha16(.CPI_RateConvertMono8AltiVec_0)
        lfs f3, lo16(.CPI_RateConvertMono8AltiVec_0)(r2)
        fmr f3, f3
        fadd f0, f2, f0
        fadd f3, f0, f3
        fcmpu cr0, f3, f1
        bge cr0, LBB_RateConvertMono8AltiVec_2  ; no_exit

to an inner loop like this:

LBB_RateConvertMono8AltiVec_1:  ; no_exit
        fsub f2, f2, f1
        fcmpu cr0, f2, f1
        fmr f0, f2
        bge cr0, LBB_RateConvertMono8AltiVec_1  ; no_exit

Doh! good catch!

llvm-svn: 23838

0c0b38bb

Oct 17, 2005
- Make this work for FP constantexprs · da1b152c
  Chris Lattner authored Oct 17, 2005
```
llvm-svn: 23773
```
  da1b152c
- Oops, X+0.0 isn't foldable, but X+-0.0 is. · 7fde91e3
  Chris Lattner authored Oct 17, 2005
```
llvm-svn: 23772
```
  7fde91e3
- relax this a bit, as we only support the default rounding mode · 32979336
  Chris Lattner authored Oct 17, 2005
```
llvm-svn: 23771
```
  32979336
Oct 11, 2005
- Fix (hopefully the last) issue where LSR is nondeterminstic. When pulling · 192cd18f
  Chris Lattner authored Oct 11, 2005
```
out CSE's of base expressions it could build a result whose order was
nondet.

llvm-svn: 23698
```
  192cd18f
- Fix another problem where LSR was being nondeterminstic. Also remove elements · 5c9d63da
  Chris Lattner authored Oct 11, 2005
```
from the end of a vector instead of the beginning

llvm-svn: 23697
```
  5c9d63da
- Fix another lsr-is-nondeterministic case · b7a3894e
  Chris Lattner authored Oct 11, 2005
```
llvm-svn: 23695
```
  b7a3894e
Oct 10, 2005
- Make MaskedValueIsZero a bit more aggressive · 03b9eb50
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23677
```
  03b9eb50
Oct 09, 2005
- Fix funky xcode indentation · 62010c45
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23674
```
  62010c45
- Hrm, you didn't see this. · eb4be8b9
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23673
```
  eb4be8b9
- Fix a source of non-determinism in the backend: the order of processing · 4ea0a3ea
  Chris Lattner authored Oct 09, 2005
```
IV strides dependend on the pointer order of the strides in memory.
Non-determinism is bad.

llvm-svn: 23672
```
  4ea0a3ea
Oct 07, 2005
- Remove useless variable. · 572910c9
  Jeff Cohen authored Oct 07, 2005
```
llvm-svn: 23656
```
  572910c9
Oct 03, 2005

Make IVUseShouldUsePostIncValue more aggressive when the use is a PHI. In · f07a587c

Chris Lattner authored Oct 03, 2005

particular, it should realize that phi's use their values in the pred block
not the phi block itself.  This change turns our em3d loop from this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_6    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        or r2, r6, r6
        lwz r6, 0(r3)
        cmpw cr0, r6, r5
        beq cr0, LBB_test_6     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r2, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; endif.loopexit.loopexit_crit_edge
        addi r3, r2, 1
        blr
LBB_test_6:     ; loopexit
        or r3, r2, r2
        blr

into:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        or r2, r6, r6
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        or r2, r6, r6
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r2, r2
        blr


Unfortunately, this is actually worse code, because the register coallescer
is getting confused somehow.  If it were doing its job right, it could turn the
code into this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r6, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r6, r6
        blr

... which I'll work on next. :)

llvm-svn: 23604

f07a587c