Commits · 7a5388bf755acf4b56368758194984750af86ddb · Roger Ferrer / llvm-epi-0.8

Mar 20, 2011
- Replace a broken LiveInterval::MergeValueInAsValue() with something simpler. · ccacd0df
  Jakob Stoklund Olesen authored Mar 19, 2011
```
llvm-svn: 127960
```
  ccacd0df
- Add debug output. · 8698507f
  Jakob Stoklund Olesen authored Mar 19, 2011
```
llvm-svn: 127959
```
  8698507f
Mar 19, 2011

Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors · 327cd36f
Daniel Dunbar authored Mar 19, 2011
```
to canonicalize IR", it broke a lot of things.

llvm-svn: 127954
```
327cd36f

SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR · 824a7113

Evan Cheng authored Mar 19, 2011

to have single return block (at least getting there) for optimizations. This
is general goodness but it would prevent some tailcall optimizations.
One specific case is code like this:
int f1(void);
int f2(void);
int f3(void);
int f4(void);
int f5(void);
int f6(void);
int foo(int x) {
  switch(x) {
  case 1: return f1();
  case 2: return f2();
  case 3: return f3();
  case 4: return f4();
  case 5: return f5();
  case 6: return f6();
  }
}

=>
LBB0_2:                                 ## %sw.bb
  callq   _f1
  popq    %rbp
  ret
LBB0_3:                                 ## %sw.bb1
  callq   _f2
  popq    %rbp
  ret
LBB0_4:                                 ## %sw.bb3
  callq   _f3
  popq    %rbp
  ret

This patch teaches codegenprep to duplicate returns when the return value
is a phi and where the phi operands are produced by tail calls followed by
an unconditional branch:

sw.bb7:                                           ; preds = %entry
  %call8 = tail call i32 @f5() nounwind
  br label %return
sw.bb9:                                           ; preds = %entry
  %call10 = tail call i32 @f6() nounwind
  br label %return
return:
  %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ]
  ret i32 %retval.0

This allows codegen to generate better code like this:

LBB0_2:                                 ## %sw.bb
        jmp     _f1                     ## TAILCALL
LBB0_3:                                 ## %sw.bb1
        jmp     _f2                     ## TAILCALL
LBB0_4:                                 ## %sw.bb3
        jmp     _f3                     ## TAILCALL

rdar://9147433

llvm-svn: 127953

824a7113

Minor code re-structuring. · b1f3b498
Evan Cheng authored Mar 19, 2011
```
llvm-svn: 127952
```
b1f3b498

Add support for legalizing UINT_TO_FP of vectors on platforms which do · e7a101cc

Nadav Rotem authored Mar 19, 2011

not have native support for this operation (such as X86).
The legalized code uses two vector INT_TO_FP operations and is faster
than scalarizing.

llvm-svn: 127951

e7a101cc

Reapply 127939 since Daniel fixed the breakage. <rdar://problem/9012638> · 12d53126
Stuart Hastings authored Mar 19, 2011
```
llvm-svn: 127944
```
12d53126
Revert 127939. <rdar://problem/9012638> · 08b4daa1
Stuart Hastings authored Mar 19, 2011
```
llvm-svn: 127943
```
08b4daa1
Revise r126127 to address Daniel's comments. <rdar://problem/9012638> · 83d4a28d
Stuart Hastings authored Mar 19, 2011
```
llvm-svn: 127939
```
83d4a28d

Fixed an assert by the ARM disassembler for LDRD_PRE/POST. · 0c5f670f

Johnny Chen authored Mar 19, 2011

The relevant instruction table entries were changed sometime ago to no longer take
<Rt2> as an operand.  Modify ARMDisassemblerCore.cpp to accomodate the change and
add a test case.

llvm-svn: 127935

0c5f670f

Tweak CrashRecoveryContextCleanup to provide an easy method for clients to... · 32aea2e4

Ted Kremenek authored Mar 19, 2011

Tweak CrashRecoveryContextCleanup to provide an easy method for clients to select between 'delete' and 'destructor' cleanups, and allow the destructor of CrashRecoveryContextCleanupRegister to be pseudo re-entrant.

llvm-svn: 127929

32aea2e4

Tweak CrashRecoveryContext::GetCurrent() to return quickly if... · 794a0714

Ted Kremenek authored Mar 19, 2011

Tweak CrashRecoveryContext::GetCurrent() to return quickly if 'gCrsahRecoveryEnabled' is false. This avoids us needing to go to thread local storage for
the performance sensitive case where we are compiling code.

llvm-svn: 127928

794a0714

If an AllocaInst referred by DbgDeclareInst is used by a LoadInst then the... · 2c7ee270

Devang Patel authored Mar 18, 2011

If an AllocaInst referred by DbgDeclareInst is used by a LoadInst then the LoadInst should also get a corresponding llvm.dbg.value intrinsic.

llvm-svn: 127924

2c7ee270

Remove dead code. · 3ac171d4
Devang Patel authored Mar 18, 2011
```
llvm-svn: 127923
```
3ac171d4
Consider debug info intrinsics pointing to null value as dead instructions. · c1431e6e
Devang Patel authored Mar 18, 2011
```
llvm-svn: 127922
```
c1431e6e

Mar 18, 2011

Silence a warning. · 54140eb5
Jim Grosbach authored Mar 18, 2011
```
llvm-svn: 127918
```
54140eb5

Add support to the ARM asm parser for the register-shifted-register forms of... · 1d2f5ceb

Owen Anderson authored Mar 18, 2011

Add support to the ARM asm parser for the register-shifted-register forms of basic instructions like ADD.  More work left to be done to support other instances of shifter ops in the ISA.

llvm-svn: 127917

1d2f5ceb

Beginnings of MC-JIT code generation. · 7b162490

Jim Grosbach authored Mar 18, 2011

Proof-of-concept code that code-gens a module to an in-memory MachO object.
This will be hooked up to a run-time dynamic linker library (see: llvm-rtdyld
for similarly conceptual work for that part) which will take the compiled
object and link it together with the rest of the system, providing back to the
JIT a table of available symbols which will be used to respond to the
getPointerTo*() queries.

llvm-svn: 127916

7b162490

Match a few more obvious patterns to revsh. rdar://9147637. · dc1d626a
Evan Cheng authored Mar 18, 2011
```
llvm-svn: 127913
```
dc1d626a

Extend live debug values down the dominator tree by following copies. · 816f5f4c

Jakob Stoklund Olesen authored Mar 18, 2011

The llvm.dbg.value intrinsic refers to SSA values, not virtual registers, so we
should be able to extend the range of a value by tracking that value through
register copies. This greatly improves the debug value tracking for function
arguments that for some reason are copied to a second virtual register at the
end of the entry block.

We only extend the debug value range where its register is killed. All original
llvm.dbg.value locations are still respected.

Copies from physical registers are ignored. That should not be a problem since
the entry block already adds DBG_VALUE instructions for the virtual registers
holding the function arguments.

llvm-svn: 127912

816f5f4c

Revert r127852; it's apparently causing an ICE on mingw. · 59721e32
Eli Friedman authored Mar 18, 2011
```
llvm-svn: 127909
```
59721e32
Clean whitespace. · 9c6456e8
Owen Anderson authored Mar 18, 2011
```
llvm-svn: 127900
```
9c6456e8
Reduce code duplication. · 6d55745d
Owen Anderson authored Mar 18, 2011
```
llvm-svn: 127899
```
6d55745d

PTX: Fix various codegen issues · 0984dcc0

Justin Holewinski authored Mar 18, 2011

- Emit mad instead of mad.rn for shader model 1.0
- Emit explicit mov.u32 instructions for reading global variables
- (most PTX instructions cannot take global variable immediates)

llvm-svn: 127895

0984dcc0

setExecutable() should default to success if there's nothing custom for it. · 806d507b
Jim Grosbach authored Mar 18, 2011
```
llvm-svn: 127891
```
806d507b
Thumb2 PC-relative loads require a fixup rather than just an immediate. · eb4b63d6
Owen Anderson authored Mar 18, 2011
```
llvm-svn: 127888
```
eb4b63d6

Avoid creating canonical induction variables for non-native types. · 1c4b42d0

Andrew Trick authored Mar 18, 2011

For example, on 32-bit architecture, don't promote all uses of the IV
to 64-bits just because one use is a 64-bit cast.
Alternate implementation of the patch by Arnaud de Grandmaison.

llvm-svn: 127884

1c4b42d0

Support explicit argument forms for the X86 string instructions. · 3fbfcc0e
Joerg Sonnenberger authored Mar 18, 2011
```
For now, only the default segments are supported.

llvm-svn: 127875
```
3fbfcc0e
ptx: fix parameter order that is reversed · b1df0fe1
Che-Liang Chiou authored Mar 18, 2011
```
llvm-svn: 127874
```
b1df0fe1
ptx: add unconditional and conditional branch · ff9d938e
Che-Liang Chiou authored Mar 18, 2011
```
llvm-svn: 127873
```
ff9d938e

raw_ostream: [PR6745] Tweak formatting (double)%e for Windows hosts. · bac0d769

NAKAMURA Takumi authored Mar 18, 2011

On MSVCRT and compatible, output of %e is incompatible to Posix by default. Number of exponent digits should be at least 2. "%+03d"

FIXME: Implement our formatter in future!
llvm-svn: 127872

bac0d769

Initialize the only-used-with-PPC-double-double parts of the APFloat class. This · a50db654

Bill Wendling authored Mar 18, 2011

makes valgrind stop complaining about uninitialized variables being read when it
accesses a bitfield (category) that shares its bits with these variables.

llvm-svn: 127871

a50db654

Hoist spills when the same value is known to be in less loopy sibling registers. · 27320cb8

Jakob Stoklund Olesen authored Mar 18, 2011

Stack slot real estate is virtually free compared to registers, so it is
advantageous to spill earlier even though the same value is now kept in both a
register and a stack slot.

Also eliminate redundant spills by extending the stack slot live range
underneath reloaded registers.

This can trigger a dead code elimination, removing copies and even reloads that
were only feeding spills.

llvm-svn: 127868

27320cb8

Accept instructions that read undefined values. · fdc09941

Jakob Stoklund Olesen authored Mar 18, 2011

This is not supposed to happen, but I have seen the x86 rematter getting
confused when rematerializing partial redefs.

llvm-svn: 127857

fdc09941

Be more accurate about the slot index reading a register when dealing with defs · c099dde9
Jakob Stoklund Olesen authored Mar 18, 2011
```
and early clobbers.

Assert when trying to find an undefined value.

llvm-svn: 127856
```
c099dde9
Check RequiresNullTerminator first, or we might read from an invalid address. · 7c9cc461
Rafael Espindola authored Mar 18, 2011
```
llvm-svn: 127853
```
7c9cc461

Add a target-specific branchless method for double-width relational · 1a916a3c

Eli Friedman authored Mar 18, 2011

comparisons on x86.  Essentially, the way this works is that SUB+SBB sets
the relevant flags the same way a double-width CMP would.

This is a substantial improvement over the generic lowering in LLVM. The output
is also shorter than the gcc-generated output; I haven't done any detailed
benchmarking, though.

llvm-svn: 127852

1a916a3c

Augment CrashRecoveryContext to have registered "cleanup" objects that can be... · c44d3cf5

Ted Kremenek authored Mar 18, 2011

Augment CrashRecoveryContext to have registered "cleanup" objects that can be used to release resources during a crash.

llvm-svn: 127849

c44d3cf5

The disassembler for Thumb was wrongly adding 4 to the computed imm32 offset. · e387f8a5
Johnny Chen authored Mar 18, 2011
```
Remove the offending logic and update the test cases.

llvm-svn: 127843
```
e387f8a5
Remove TargetData and ValueTracking includes. I didn't mean for them to sneak in my last checkin. · f8f67f01
Andrew Trick authored Mar 18, 2011
```
llvm-svn: 127842
```
f8f67f01