Commits · 045e26166a0dfc8f3c8af04eb4eda63eaff68e76 · Roger Ferrer / llvm-epi-0.8

Jun 23, 2011
- Remove TargetOptions.h dependency from X86Subtarget. · 3a0c5e52
  Evan Cheng authored Jun 23, 2011
```
llvm-svn: 133726
```
  3a0c5e52
Jun 18, 2011
- Remove unused but set variables. · 25e17b0f
  Benjamin Kramer authored Jun 18, 2011
```
llvm-svn: 133347
```
  25e17b0f
Jun 15, 2011

Add a new function attribute, nonlazybind, which inhibits lazy-loading · 4b7a8d68

John McCall authored Jun 15, 2011

optimizations when emitting calls to the function;  instead those calls may
use faster relocations which require the function to be immediately resolved
upon loading the dynamic object featuring the call.  This is useful when it
is known that the function will be called frequently and pervasively and
therefore there is no merit in delaying binding of the function.

Currently only implemented for x86-64, where it turns into a call through
the global offset table.

Patch by Dan Gohman, who assures me that he's going to add LangRef documentation
for this once it's committed.

llvm-svn: 133080

4b7a8d68

Jun 09, 2011
- Add a parameter to CCState so that it can access the MachineFunction. · 0713a9d8
  Eric Christopher authored Jun 08, 2011
```
No functional change.

Part of PR6965

llvm-svn: 132763
```
  0713a9d8
Jun 07, 2011
- Followup to 132458, omit unnecessary stack copy when x87 input is a · e0d3426e
  Stuart Hastings authored Jun 06, 2011
```
load.  rdar://problem/6373334

llvm-svn: 132696
```
  e0d3426e
Jun 04, 2011
- Reapply 132424 with fixes. This fixes PR10068. · be605494
  Stuart Hastings authored Jun 03, 2011
```
rdar://problem/5993888

llvm-svn: 132606
```
  be605494
Jun 03, 2011
- Have LowerOperandForConstraint handle multiple character constraints. · de9399bf
  Eric Christopher authored Jun 02, 2011
```
Part of rdar://9119939

llvm-svn: 132510
```
  de9399bf
Jun 02, 2011
- Revert 132424 to fix PR10068. · aa318ae4
  Rafael Espindola authored Jun 02, 2011
```
llvm-svn: 132479
```
  aa318ae4
- Omit unnecessary stack copy when x87 input is a load. · 8d530ad2
  Stuart Hastings authored Jun 02, 2011
```
rdar://problem/6373334

llvm-svn: 132458
```
  8d530ad2
Jun 01, 2011
- Recommit 132404 with fixes. rdar://problem/5993888 · 7adc95f6
  Stuart Hastings authored Jun 01, 2011
```
llvm-svn: 132424
```
  7adc95f6
- Revert 132404 to appease a buildbot. rdar://problem/5993888 · aab130d9
  Stuart Hastings authored Jun 01, 2011
```
llvm-svn: 132419
```
  aab130d9
- Add support for x86 CMPEQSS and friends. These instructions do a · 7b7c102f
  Stuart Hastings authored Jun 01, 2011
```
floating-point comparison, generate a mask of 0s or 1s, and generally
DTRT with NaNs.  Only profitable when the user wants a materialized 0
or 1 at runtime.  rdar://problem/5993888

llvm-svn: 132404
```
  7b7c102f
- FGETSIGN support for x86, using movmskps/pd. Will be enabled with a · 9f208042
  Stuart Hastings authored Jun 01, 2011
```
patch to TargetLowering.cpp.  rdar://problem/5660695

llvm-svn: 132388
```
  9f208042
May 26, 2011
- Reverting 132105: it broke some LLVM-GCC DejaGNU tests. · 493a12bf
  Stuart Hastings authored May 26, 2011
```
llvm-svn: 132108
```
  493a12bf
- Correctly handle a one-word struct passed byval on x86_64. · 276f231c
  Stuart Hastings authored May 26, 2011
```
rdar://problem/6920088

llvm-svn: 132105
```
  276f231c
May 24, 2011

- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is · 88f9137f

Evan Cheng authored May 24, 2011

  non-zero.
- Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension
  when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero.

rdar://9490949

llvm-svn: 131948

88f9137f

May 20, 2011
- Don't attempt to tail call optimize for Win64. · 552f8c48
  Chad Rosier authored May 20, 2011
```
llvm-svn: 131709
```
  552f8c48
- Revert r131664 and fix it in instcombine instead. rdar://9467055 · e8d2e9eb
  Evan Cheng authored May 20, 2011
```
llvm-svn: 131708
```
  e8d2e9eb
May 19, 2011
- Oddly people want to use the 'r' constraint for fp constants on x86. · 4014e5e2
  Eric Christopher authored May 19, 2011
```
Fixes rdar://9218925
Fixes PR9601

llvm-svn: 131682
```
  4014e5e2
- crc32 with 64-bit output zeros upper 32-bits. rdar://9467055 · 2b9bd386
  Evan Cheng authored May 19, 2011
```
llvm-svn: 131664
```
  2b9bd386
May 18, 2011
- Enables vararg functions that pass all arguments via registers to be optimized... · f4e832b1
  Chad Rosier authored May 18, 2011
```
Enables vararg functions that pass all arguments via registers to be optimized into tail-calls when possible.

llvm-svn: 131560
```
  f4e832b1
May 17, 2011
- Clean up the mess created by r131467+r131469. · d000a2c2
  Eli Friedman authored May 17, 2011
```
llvm-svn: 131471
```
  d000a2c2
- Revert 131467 due to buildbot complaint. · c65d8eda
  Stuart Hastings authored May 17, 2011
```
llvm-svn: 131469
```
  c65d8eda
- Fix an obscure issue in X86_64 parameter passing: if a tiny byval is · 3cf53088
  Stuart Hastings authored May 17, 2011
```
passed as the fifth parameter, insure it's passed correctly (in R9).
rdar://problem/6920088

llvm-svn: 131467
```
  3cf53088
- · d8edb1d5
  Nadav Rotem authored May 17, 2011
```
Fix a bug in PerformEXTRACT_VECTOR_ELTCombine. The code created an ADD SDNode
with two different types, in cases where the index and the ptr had different
types.

llvm-svn: 131461
```
  d8edb1d5
May 16, 2011
- Remove dead code. Fix associated test to use FileCheck. · d4a3609d
  Eli Friedman authored May 16, 2011
```
llvm-svn: 131424
```
  d4a3609d
May 11, 2011

· 8f971c27

Nadav Rotem authored May 11, 2011

Add custom lowering of X86 vector SRA/SRL/SHL when the shift amount is a splat vector.

llvm-svn: 131179

8f971c27

May 06, 2011
- Make the logic for determining function alignment more explicit. No functionality change. · 2518f837
  Eli Friedman authored May 06, 2011
```
llvm-svn: 131012
```
  2518f837
Apr 20, 2011
- ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS · cd01ed5b
  Daniel Dunbar authored Apr 20, 2011
```
triple component.

llvm-svn: 129838
```
  cd01ed5b
Apr 19, 2011
- Target/X86: Eliminate uses of getDarwinVers(). · 100455a3
  Daniel Dunbar authored Apr 19, 2011
```
llvm-svn: 129813
```
  100455a3
Apr 15, 2011
- Fix a ton of comment typos found by codespell. Patch by · 0ab5e2cd
  Chris Lattner authored Apr 15, 2011
```
Luis Felipe Strano Moraes!

llvm-svn: 129558
```
  0ab5e2cd
Mar 31, 2011
- Don't try to create zero-sized stack objects. · ee9d45dd
  Evan Cheng authored Mar 30, 2011
```
llvm-svn: 128586
```
  ee9d45dd
Mar 26, 2011
- Make helper static. · 8d222737
  Benjamin Kramer authored Mar 26, 2011
```
llvm-svn: 128338
```
  8d222737
Mar 24, 2011

Target/X86: [PR8777][PR8778] Tweak alloca/chkstk for Windows targets. · 521eb7c1
NAKAMURA Takumi authored Mar 24, 2011
```
FIXME: Some cleanups would be needed.
llvm-svn: 128206
```
521eb7c1

Revert r128175. · 4ab9a165

Andrew Trick authored Mar 23, 2011

I'm backing this out for the second time. It was supposed to be fixed by r128164, but the mingw self-host must be defeating the fix.

llvm-svn: 128181

4ab9a165

Mar 23, 2011
- Reapply Eli's r127852 now that the pre-RA scheduler can spill EFLAGS. · 4046a0de
  Andrew Trick authored Mar 23, 2011
```
(target-specific branchless method for double-width relational comparisons on x86)

llvm-svn: 128175
```
  4046a0de
Mar 21, 2011

Re-apply r127953 with fixes: eliminate empty return block if it has no... · 0663f23b

Evan Cheng authored Mar 21, 2011

Re-apply r127953 with fixes: eliminate empty return block if it has no predecessors; update dominator tree if cfg is modified.

llvm-svn: 127981

0663f23b

Mar 19, 2011

Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors · 327cd36f
Daniel Dunbar authored Mar 19, 2011
```
to canonicalize IR", it broke a lot of things.

llvm-svn: 127954
```
327cd36f

SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR · 824a7113

Evan Cheng authored Mar 19, 2011

to have single return block (at least getting there) for optimizations. This
is general goodness but it would prevent some tailcall optimizations.
One specific case is code like this:
int f1(void);
int f2(void);
int f3(void);
int f4(void);
int f5(void);
int f6(void);
int foo(int x) {
  switch(x) {
  case 1: return f1();
  case 2: return f2();
  case 3: return f3();
  case 4: return f4();
  case 5: return f5();
  case 6: return f6();
  }
}

=>
LBB0_2:                                 ## %sw.bb
  callq   _f1
  popq    %rbp
  ret
LBB0_3:                                 ## %sw.bb1
  callq   _f2
  popq    %rbp
  ret
LBB0_4:                                 ## %sw.bb3
  callq   _f3
  popq    %rbp
  ret

This patch teaches codegenprep to duplicate returns when the return value
is a phi and where the phi operands are produced by tail calls followed by
an unconditional branch:

sw.bb7:                                           ; preds = %entry
  %call8 = tail call i32 @f5() nounwind
  br label %return
sw.bb9:                                           ; preds = %entry
  %call10 = tail call i32 @f6() nounwind
  br label %return
return:
  %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ]
  ret i32 %retval.0

This allows codegen to generate better code like this:

LBB0_2:                                 ## %sw.bb
        jmp     _f1                     ## TAILCALL
LBB0_3:                                 ## %sw.bb1
        jmp     _f2                     ## TAILCALL
LBB0_4:                                 ## %sw.bb3
        jmp     _f3                     ## TAILCALL

rdar://9147433

llvm-svn: 127953

824a7113

Add support for legalizing UINT_TO_FP of vectors on platforms which do · e7a101cc

Nadav Rotem authored Mar 19, 2011

not have native support for this operation (such as X86).
The legalized code uses two vector INT_TO_FP operations and is faster
than scalarizing.

llvm-svn: 127951

e7a101cc