Commits · 8d6da9142c4f05c533fa5c086fd971913bad5a61 · Roger Ferrer / llvm-epi-0.8

Oct 14, 2007
- When coalescing an EXTRACT_SUBREG and the dst register is a physical register, · 8d6da914
  Evan Cheng authored Oct 14, 2007
```
the source register will be coalesced to the super register of the LHS. Properly
merge in the live ranges of the resulting coalesced interval that were part of
the original source interval to the live interval of the super-register.

llvm-svn: 42961
```
  8d6da914
- Revert 42908 for now. · cdf36091
  Evan Cheng authored Oct 14, 2007
```
llvm-svn: 42960
```
  cdf36091
- Fix type mismatch error in PPC Altivec (only causes · 2f6b6d6f
  Dale Johannesen authored Oct 14, 2007
```
a problem when asserts are on).  From vecLib.

llvm-svn: 42959
```
  2f6b6d6f
- Disable some compile-time optimizations on PPC · 19db093b
  Dale Johannesen authored Oct 14, 2007
```
long double.

llvm-svn: 42958
```
  19db093b
Oct 13, 2007

Clarify that fastcc has a problem with nested function · 29af26f1
Duncan Sands authored Oct 13, 2007
```
trampolines, rather than with nested functions themselves.

llvm-svn: 42955
```
29af26f1

Enhance the truncstore optimization code to handle shifted · f47e3062

Chris Lattner authored Oct 13, 2007

values and propagate demanded bits through them in simple cases.

This allows this code:
void foo(char *P) {
   strcpy(P, "abc");
}
to compile to:

_foo:
        ldrb r3, [r1]
        ldrb r2, [r1, #+1]
        ldrb r12, [r1, #+2]!
        ldrb r1, [r1, #+1]
        strb r1, [r0, #+3]
        strb r2, [r0, #+1]
        strb r12, [r0, #+2]
        strb r3, [r0]
        bx lr

instead of:

_foo:
        ldrb r3, [r1, #+3]
        ldrb r2, [r1, #+2]
        orr r3, r2, r3, lsl #8
        ldrb r2, [r1, #+1]
        ldrb r1, [r1]
        orr r2, r1, r2, lsl #8
        orr r3, r2, r3, lsl #16
        strb r3, [r0]
        mov r2, r3, lsr #24
        strb r2, [r0, #+3]
        mov r2, r3, lsr #16
        strb r2, [r0, #+2]
        mov r3, r3, lsr #8
        strb r3, [r0, #+1]
        bx lr

testcase here: test/CodeGen/ARM/truncstore-dag-combine.ll

This also helps occasionally for X86 and other cases not involving 
unaligned load/stores.

llvm-svn: 42954

f47e3062

Add a simple optimization to simplify the input to · 5e6fe054

Chris Lattner authored Oct 13, 2007

truncate and truncstore instructions, based on the 
knowledge that they don't demand the top bits.

llvm-svn: 42952

5e6fe054

If the power of 5 is exact, and the reciprocal exact, the error is zero not... · c799fe9e

Neil Booth authored Oct 13, 2007

If the power of 5 is exact, and the reciprocal exact, the error is zero not one half-ulps.  This prevents an infinite loop in rare cases.

llvm-svn: 42950

c799fe9e

Local spiller optimization: · b6307650

Evan Cheng authored Oct 13, 2007

Turn this:
movswl  %ax, %eax
movl    %eax, -36(%ebp)
xorl    %edi, -36(%ebp)
into
movswl  %ax, %eax
xorl    %edi, %eax
movl    %eax, -36(%ebp)
by unfolding the load / store xorl into an xorl and a store when we know the
value in the spill slot is available in a register. This doesn't change the
number of instructions but reduce the number of times memory is accessed.

Also unfold some load folding instructions and reuse the value when similar
situation presents itself.

llvm-svn: 42947

b6307650

Change unfoldMemoryOperand(). User is now responsible for passing in the · 7082dcf6

Evan Cheng authored Oct 13, 2007

register used by the unfolded instructions. User can also specify whether to
unfold the load, the store, or both.

llvm-svn: 42946

7082dcf6

Optionally create a MachineInstr without default implicit operands. · 9490e0d0
Evan Cheng authored Oct 13, 2007
```
llvm-svn: 42945
```
9490e0d0

Oct 12, 2007
- Correcting the corrections. Bad bad baaad emacs! · e8d0bf26
  Arnold Schwaighofer authored Oct 12, 2007
```
llvm-svn: 42935
```
  e8d0bf26
- Corrected many typing errors. And removed 'nest' parameter handling · 1f0da1fe
  Arnold Schwaighofer authored Oct 12, 2007
```
for fastcc from X86CallingConv.td.  This means that nested functions
are not supported for calling convention 'fastcc'.

llvm-svn: 42934
```
  1f0da1fe
- Dest type is always i8 *. This allows some simplification. · 371e6ca6
  Devang Patel authored Oct 12, 2007
```
Do not filter memmove.

llvm-svn: 42930
```
  371e6ca6
- Due to the new tail call optimization, trampolines can no · a6286bd5
  Duncan Sands authored Oct 12, 2007
```
longer be created for fastcc functions.

llvm-svn: 42925
```
  a6286bd5
- ppc long double. Implement fabs and fneg. · 61c574fc
  Dale Johannesen authored Oct 12, 2007
```
llvm-svn: 42924
```
  61c574fc
- Update. · 409fa443
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42922
```
  409fa443
- Fix a bug in my patch last night that broke InstCombine/2007-10-12-Crash.ll · ad618f66
  Chris Lattner authored Oct 12, 2007
```
llvm-svn: 42920
```
  ad618f66
- Implement i64->ppcf128 conversions. · a1a4a9eb
  Dale Johannesen authored Oct 12, 2007
```
llvm-svn: 42919
```
  a1a4a9eb
- Did mean to leave this in. INSERT_SUBREG isn't being coalesced yet. · 1410b851
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42916
```
  1410b851
- Remove duplicate comment. · d502a820
  Neil Booth authored Oct 12, 2007
```
llvm-svn: 42913
```
  d502a820
- Implement correctly-rounded decimal->binary conversion, i.e. conversion · b93d90e9
  Neil Booth authored Oct 12, 2007
```
from user input strings.

Such conversions are more intricate and subtle than they may appear;
it is unlikely I have got it completely right first time.  I would
appreciate being informed of any bugs and incorrect roundings you
might discover.

llvm-svn: 42912
```
  b93d90e9
- Remove a field that was never used. · e9dbe094
  Neil Booth authored Oct 12, 2007
```
llvm-svn: 42911
```
  e9dbe094
- If we're trying to be arbitrary precision, unsigned char clearly won't cut it.... · 146fdb3e
  Neil Booth authored Oct 12, 2007
```
If we're trying to be arbitrary precision, unsigned char clearly won't cut it.  Needed for dec->bin conversions.

llvm-svn: 42910
```
  146fdb3e
- Don't attempt to mask no bits · 7e74b17a
  Neil Booth authored Oct 12, 2007
```
llvm-svn: 42909
```
  7e74b17a
- Change the names used for internal labels to use the current · dc35bd79
  Dan Gohman authored Oct 12, 2007
```
function symbol name instead of a codegen-assigned function
number.

Thanks Evan! :-)

llvm-svn: 42908
```
  dc35bd79
- Fix some corner cases with vectors in copyToRegs and copyFromRegs. · e3583817
  Dan Gohman authored Oct 12, 2007
```
llvm-svn: 42907
```
  e3583817
- Add support to SplitVectorOp for powi, where the second operand · 4f056f3c
  Dan Gohman authored Oct 12, 2007
```
is a scalar integer.

llvm-svn: 42906
```
  4f056f3c
- Mark vector ctpop, cttz, and ctlz as Expand on x86. · 8d978da3
  Dan Gohman authored Oct 12, 2007
```
llvm-svn: 42905
```
  8d978da3
- Mark vector pow, ctpop, cttz, and ctlz as Expand on PowerPC. · 9013eaff
  Dan Gohman authored Oct 12, 2007
```
llvm-svn: 42904
```
  9013eaff
- Restrict EXTRACT_SUBREG coalescing to avoid negative performance impact. · 11330f75
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42903
```
  11330f75
- EXTRACT_SUBREG coalescing support. The coalescer now treats EXTRACT_SUBREG like · aa2d6ef8
  Evan Cheng authored Oct 12, 2007
```
(almost) a register copy. However, it always coalesced to the register of the
RHS (the super-register). All uses of the result of a EXTRACT_SUBREG are sub-
register uses which adds subtle complications to load folding, spiller rewrite,
etc.

llvm-svn: 42899
```
  aa2d6ef8
- Some clean up. · 89d59169
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42898
```
  89d59169
- Fold load / store into MOV32to32_ and MOV16to16_. · 09c0fe0a
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42895
```
  09c0fe0a
- Flag MOV32to32_ with EXTRACT_SUBREG. They should not be scheduled apart. · f8c23f07
  Evan Cheng authored Oct 12, 2007
```
llvm-svn: 42894
```
  f8c23f07
- eliminate warning · 5d8f7e0c
  Gabor Greif authored Oct 12, 2007
```
llvm-svn: 42892
```
  5d8f7e0c
- Fix some 80 column violations. · d8675e49
  Chris Lattner authored Oct 12, 2007
```
Fix DecomposeSimpleLinearExpr to handle simple constants better.
Don't nuke gep(bitcast(allocation)) if the bitcast(allocation) will
fold the allocation.  This fixes PR1728 and Instcombine/malloc3.ll

llvm-svn: 42891
```
  d8675e49
- PPC long double. Implement a couple more conversions. · 05ff9e8c
  Dale Johannesen authored Oct 12, 2007
```
llvm-svn: 42888
```
  05ff9e8c
- Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so · be37007e
  Dan Gohman authored Oct 12, 2007
```
may be overloaded with vector types. And add a testcase for codegen for
these.

llvm-svn: 42885
```
  be37007e
- Codegen support for vector intrinsics. · 2a7de416
  Dan Gohman authored Oct 11, 2007
```
Factor out the code that expands the "nasty scalar code" for unrolling
vectors into a separate routine, teach it how to handle mixed
vector/scalar operands, as seen in powi, and use it for several operators,
including sin, cos, powi, and pow.

Add support in SplitVectorOp for fpow, fpowi and for several unary
operators.

llvm-svn: 42884
```
  2a7de416