Commits · f359fed9f972ae2c825c86895751611d1fa05ded · Roger Ferrer / llvm-epi-0.8

Nov 13, 2007

Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack · f359fed9

Bill Wendling authored Nov 13, 2007

adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in
the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If
not, then there is the potential for the stack to be changed while the stack's
being used by another instruction (like a call).

This can only result in tears...

llvm-svn: 44037

f359fed9

Nov 12, 2007

Add a flag for indirect branch instructions. · 933b5b7e

Owen Anderson authored Nov 12, 2007

Target maintainers: please check that the instructions for your target are correctly marked.

llvm-svn: 44012

933b5b7e

Nov 11, 2007

Use TableGen to emit information for dwarf register numbers. · 4edfea43

Anton Korobeynikov authored Nov 11, 2007

This makes DwarfRegNum to accept list of numbers instead.
Added three different "flavours", but only slightly tested on x86-32/linux.
Please check another subtargets if possible,

llvm-svn: 43997

4edfea43

Nov 10, 2007
- Add CCAssignToStackABISizeAlign for convenience in · b988e7e8
  Dale Johannesen authored Nov 10, 2007
```
dealing with types whose size & alignment are
different on different subtargets.  Use it for x86 f80.

llvm-svn: 43988
```
  b988e7e8
- Update tailcall code to include inline attribute operand for memcpy. · d2c16ff9
  Arnold Schwaighofer authored Nov 10, 2007
```
llvm-svn: 43978
```
  d2c16ff9
Nov 09, 2007

Unbreak x86-64 jumptable. · fb13fd6f
Evan Cheng authored Nov 09, 2007
```
llvm-svn: 43955
```
fb13fd6f
Revert previous rewrite per chris's comments. · dfb85c78
Dale Johannesen authored Nov 09, 2007
```
llvm-svn: 43950
```
dfb85c78

Much improved pic jumptable codegen: · 797d56ff

Evan Cheng authored Nov 09, 2007

Then:
        call    "L1$pb"
"L1$pb":
        popl    %eax
		...
LBB1_1: # entry
        imull   $4, %ecx, %ecx
        leal    LJTI1_0-"L1$pb"(%eax), %edx
        addl    LJTI1_0-"L1$pb"(%ecx,%eax), %edx
        jmpl    *%edx

        .align  2
        .set L1_0_set_3,LBB1_3-LJTI1_0
        .set L1_0_set_2,LBB1_2-LJTI1_0
        .set L1_0_set_5,LBB1_5-LJTI1_0
        .set L1_0_set_4,LBB1_4-LJTI1_0
LJTI1_0:
        .long    L1_0_set_3
        .long    L1_0_set_2

Now:
        call    "L1$pb"
"L1$pb":
        popl    %eax
		...
LBB1_1: # entry
        addl    LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax
        jmpl    *%eax

		.align  2
		.set L1_0_set_3,LBB1_3-"L1$pb"
		.set L1_0_set_2,LBB1_2-"L1$pb"
		.set L1_0_set_5,LBB1_5-"L1$pb"
		.set L1_0_set_4,LBB1_4-"L1$pb"
LJTI1_0:
        .long    L1_0_set_3
        .long    L1_0_set_2

llvm-svn: 43924

797d56ff

Rewrite Dwarf number handling per review comments. · 04fd8208
Dale Johannesen authored Nov 09, 2007
```
llvm-svn: 43918
```
04fd8208

Nov 07, 2007
- Complete conditionalization of Dwarf reg numbers. · 1b9de4dd
  Dale Johannesen authored Nov 07, 2007
```
Would somebody not on Darwin please make sure this
doesn't break anything.  Exception handling failures
would be the most likely symptom.

llvm-svn: 43844
```
  1b9de4dd
- Interchange Dwarf numbers of ESP and EBP on x86 Darwin. · fbe69d2c
  Dale Johannesen authored Nov 07, 2007
```
Much improvement in exception handling.

llvm-svn: 43794
```
  fbe69d2c
Nov 06, 2007
- Move the LowerMEMCPY and LowerMEMCPYCall to a common place. · fa0df55b
  Rafael Espindola authored Nov 05, 2007
```
Thanks for the suggestions Bill :-)

llvm-svn: 43742
```
  fa0df55b
Nov 05, 2007

Use movups to spill / restore SSE registers on targets where stacks alignment is · 9337929a
Evan Cheng authored Nov 05, 2007
```
less than 16. This is a temporary solution until dynamic stack alignment is
implemented.

llvm-svn: 43703
```
9337929a

Eliminate the remaining uses of getTypeSize. This · 283207a7

Duncan Sands authored Nov 05, 2007

should only effect x86 when using long double.  Now
12/16 bytes are output for long double globals (the
exact amount depends on the alignment).  This brings
globals in line with the rest of LLVM: the space
reserved for an object is now always the ABI size.
One tricky point is that only 10 bytes should be
output for long double if it is a field in a packed
struct, which is the reason for the additional
argument to EmitGlobalConstant.

llvm-svn: 43688

283207a7

Nov 04, 2007
- Fix PR1761 by not printing (rip) suffix when in -static mode. · 9329e780
  Chris Lattner authored Nov 04, 2007
```
Evan, please review this.

llvm-svn: 43680
```
  9329e780
- Fix PR1763 by allowing the 'q' constraint to work with 64-bit · 296160d4
  Chris Lattner authored Nov 04, 2007
```
regs on x86-64.

llvm-svn: 43669
```
  296160d4
Nov 02, 2007
- Unbreak tailcall opt. · 2b93a20b
  Evan Cheng authored Nov 02, 2007
```
llvm-svn: 43646
```
  2b93a20b
- add a note · 389d430c
  Chris Lattner authored Nov 02, 2007
```
llvm-svn: 43642
```
  389d430c
- Missing a getNumOperands check. · e453ff49
  Evan Cheng authored Nov 02, 2007
```
llvm-svn: 43630
```
  e453ff49
Nov 01, 2007
- Silence, accersed warning · b7cabbe2
  Bill Wendling authored Nov 01, 2007
```
llvm-svn: 43609
```
  b7cabbe2
Oct 31, 2007
- Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold · 419b6d7c
  Rafael Espindola authored Oct 31, 2007
```
and by restructuring the X86 version.

New I just have to move this to a common place :-)

llvm-svn: 43554
```
  419b6d7c
- Make ARM an X86 memcpy expansion more similar to each other. · 063f1773
  Rafael Espindola authored Oct 31, 2007
```
Now both subtarget define getMaxInlineSizeThreshold and the expansion uses it.

This should not change generated code.

llvm-svn: 43552
```
  063f1773
- Make i64=expand_vector_elt(v2i64) work in 32-bit mode. · b066c1f2
  Dale Johannesen authored Oct 31, 2007
```
llvm-svn: 43535
```
  b066c1f2
Oct 30, 2007
- Add missing SSE builtins: CVTPD2PI, CVTPS2PI, · d50c8bce
  Dale Johannesen authored Oct 30, 2007
```
CVTTPD2PI, CVTTPS2PI, CVTPI2PD, CVTPI2PS.

llvm-svn: 43523
```
  d50c8bce
- Fix for visibility warnings generated by gcc-4.2. · b508c53c
  Duncan Sands authored Oct 30, 2007
```
llvm-svn: 43500
```
  b508c53c
- Add missing MMX PSUBQ. · 6aa304e5
  Dale Johannesen authored Oct 30, 2007
```
llvm-svn: 43488
```
  6aa304e5
Oct 29, 2007
- Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) · e106e2f1
  Evan Cheng authored Oct 29, 2007
```
transformation. Previously, it's restricted by ensuring the number of load uses
is one. Now the restriction is loosened up by allowing setcc uses to be
"extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq).

llvm-svn: 43465
```
  e106e2f1
- Avoid doing something dumb like rewriting using a 64-bit iv in 32-bit mode. · 7b3f7fea
  Evan Cheng authored Oct 29, 2007
```
llvm-svn: 43446
```
  7b3f7fea
- add a note. · 909a54cc
  Chris Lattner authored Oct 29, 2007
```
llvm-svn: 43444
```
  909a54cc
- Add support for the x86-64 'q' regigster modifier, and add support for the · 5e99fd8c
  Chris Lattner authored Oct 29, 2007
```
b/h/w/k/q inline asm memory modifiers, which are just ignored.  This fixes
PR1748 and CodeGen/X86/2007-10-28-inlineasm-q-modifier.ll

llvm-svn: 43430
```
  5e99fd8c
Oct 28, 2007
- New entry. · c826ac53
  Evan Cheng authored Oct 28, 2007
```
llvm-svn: 43420
```
  c826ac53
Oct 26, 2007

Fix off-by-one stack offset computations (dwarf information) for callee-saved · d07d6a41

Anton Korobeynikov authored Oct 26, 2007

registers in case, when FP pointer was eliminated. This should fixes misc. random
EH-related crahses, when stuff is compiled with -fomit-frame-pointer.
Thanks Duncan for nailing this bug!

llvm-svn: 43381

d07d6a41

Loosen up iv reuse to allow reuse of the same stride but a larger type when... · 7f3d0247

Evan Cheng authored Oct 26, 2007

Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free.
e.g.
Turns this loop:
LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
        movw    %dx, %si
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %edi
        movw    %si, (%edi)
        movl    L_Y$non_lazy_ptr, %edi
        movw    %dx, (%edi)
		addw    $4, %dx
		incw    %si
		incl    %ecx
		cmpl    %eax, %ecx
		jne     LBB1_2  # bb
	
into

LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %esi
        movw    %cx, (%esi)
        movl    L_Y$non_lazy_ptr, %esi
        movw    %dx, (%esi)
        addw    $4, %dx
		incl    %ecx
        cmpl    %eax, %ecx
        jne     LBB1_2  # bb

llvm-svn: 43375

7f3d0247

Oct 22, 2007
- Fix the folding of multiplication into addresses on x86, which was broken · bf474959
  Dan Gohman authored Oct 22, 2007
```
by the recent {U,S}MUL_LOHI changes.

llvm-svn: 43230
```
  bf474959
- Fix an unfolding bug. · c92446af
  Evan Cheng authored Oct 22, 2007
```
llvm-svn: 43212
```
  c92446af
Oct 21, 2007
- Allow for copysign having f80 second argument. · 8ee70112
  Dale Johannesen authored Oct 21, 2007
```
Fixes 5550319.

llvm-svn: 43205
```
  8ee70112
Oct 20, 2007
- Resolve unfold tables ambiguity. · 45e096c7
  Evan Cheng authored Oct 19, 2007
```
llvm-svn: 43194
```
  45e096c7
Oct 19, 2007

Local spiller optimization: · 35ff7937

Evan Cheng authored Oct 19, 2007

Turn a store folding instruction into a load folding instruction. e.g.
     xorl  %edi, %eax
     movl  %eax, -32(%ebp)
     movl  -36(%ebp), %eax
     orl   %eax, -32(%ebp)
=>
     xorl  %edi, %eax
     orl   -36(%ebp), %eax
     mov   %eax, -32(%ebp)
This enables the unfolding optimization for a subsequent instruction which will
also eliminate the newly introduced store instruction.

llvm-svn: 43192

35ff7937

Add support for byval function whose argument is not 32 bit aligned. · 846c19dd

Rafael Espindola authored Oct 19, 2007

To do this it is necessary to add a "always inline" argument to the
memcpy node. For completeness I have also added this node to memmove
and memset.  I have also added getMem* functions, because the extra
argument makes it cumbersome to use getNode and because I get confused
by it :-)

llvm-svn: 43172

846c19dd

- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but... · 463e2ab0

Evan Cheng authored Oct 18, 2007

- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding.
- Fix some copy+paste bugs.

llvm-svn: 43153

463e2ab0