Commits · b0848c5d911b8bd0d13231cbe79e018925edc8a3 · Roger Ferrer / llvm-epi-0.8

Jan 08, 2011

Do not model all INLINEASM instructions as having unmodelled side effects. · 6eb516db

Evan Cheng authored Jan 07, 2011

Instead encode llvm IR level property "HasSideEffects" in an operand (shared
with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check
the operand when the instruction is an INLINEASM.

This allows memory instructions to be moved around INLINEASM instructions.

llvm-svn: 123044

6eb516db

Jan 07, 2011

Revert r122955. It seems using movups to lower memcpy can cause massive... · a048c83f

Evan Cheng authored Jan 07, 2011

Revert r122955. It seems using movups to lower memcpy can cause massive regression (even on Nehalem) in edge cases. I also didn't see any real performance benefit.

llvm-svn: 123015

a048c83f

Jan 06, 2011

Correctly disassemble truncated asm. · 9f9a1069
Rafael Espindola authored Jan 06, 2011
```
Patch by Richard Simth.

llvm-svn: 122962
```
9f9a1069
Remove dead code and silence warnings. · 3aa955e9
Benjamin Kramer authored Jan 06, 2011
```
llvm-svn: 122957
```
3aa955e9

Use movups to lower memcpy and memset even if it's not fast (like corei7). · 7998b1d6

Evan Cheng authored Jan 06, 2011

The theory is it's still faster than a pair of movq / a quad of movl. This
will probably hurt older chips like P4 but should run faster on current
and future Intel processors. rdar://8817010

llvm-svn: 122955

7998b1d6

Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy · 3ae2b79a

Evan Cheng authored Jan 06, 2011

etc. takes an option OptSize. If OptSize is true, it would return
the inline limit for functions with attribute OptSize.

llvm-svn: 122952

3ae2b79a

PR8919 - LLVM incorrectly generates "_alloca" as the stack probing call. That · 3b949bca
Bill Wendling authored Jan 06, 2011
```
works only on MinGW32. On 64-bit, the function to call is "__chkstk".
Patch by KS Sreeram!

llvm-svn: 122934
```
3b949bca

PR8918 - When used with MinGW64, LLVM generates a "calll __main" at the · 81d40711

Bill Wendling authored Jan 06, 2011

beginning of the "main" function. The assembler complains about the invalid
suffix for the 'call' instruction. The right instruction is "callq __main".
Patch by KS Sreeram!

llvm-svn: 122933

81d40711

Jan 05, 2011
- fix PR8900, a shuffle miscompilation. Patch by Nadav Rotem! · 872908fd
  Chris Lattner authored Jan 05, 2011
```
llvm-svn: 122921
```
  872908fd
- silence more self assignment warnings. · 2d7df026
  Chris Lattner authored Jan 05, 2011
```
llvm-svn: 122920
```
  2d7df026
Jan 04, 2011
- Use the EdgeBundles analysis in X86FloatingPoint instead of recomputing CFG · 01d4d865
  Jakob Stoklund Olesen authored Jan 04, 2011
```
bundles in the pass.

llvm-svn: 122833
```
  01d4d865
- Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. · f96ae684
  Jakob Stoklund Olesen authored Jan 04, 2011
```
The analysis will be needed by both the greedy register allocator and the
X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't
change.

This pass is very fast, usually showing up as 0.0% wall time.

llvm-svn: 122832
```
  f96ae684
- Eliminate a warning compiling with llvm-gcc. (IMO the · e45a2389
  Dale Johannesen authored Jan 04, 2011
```
warning is overzealous but gcc is what it is.)

llvm-svn: 122829
```
  e45a2389
Jan 03, 2011

Use pushq / popq instead of subq $8, %rsp / addq $8, %rsp to adjust stack in · 65089fc6

Evan Cheng authored Jan 03, 2011

prologue and epilogue if the adjustment is 8. Similarly, use pushl / popl if
the adjustment is 4 in 32-bit mode.

In the epilogue, takes care to pop to a caller-saved register that's not live
at the exit (either return or tailcall instruction).
rdar://8771137

llvm-svn: 122783

65089fc6

Jan 02, 2011
- Try to reuse the value when lowering memset. · 25e6e06e
  Benjamin Kramer authored Jan 02, 2011
```
This allows us to compile:
  void test(char *s, int a) {
    __builtin_memset(s, a, 15);
  }
into 1 mul + 3 stores instead of 3 muls + 3 stores.

llvm-svn: 122710
```
  25e6e06e
- A workaround for a bug in cmake 2.8.3 diagnosed on PR 8885. · 68b7bb95
  Oscar Fuentes authored Jan 02, 2011
```
llvm-svn: 122706
```
  68b7bb95
- update a bunch of entries. · 51415d26
  Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122700
```
  51415d26
Jan 01, 2011
- Add support for the 'H' modifier. · d606e547
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122667
```
  d606e547
Dec 31, 2010
- Add to the list of cmake files the object file, not the asm file. This · a8eb6043
  Oscar Fuentes authored Dec 31, 2010
```
is necessary for executing the custom command that runs the
assember. Fixes PR8877.

llvm-svn: 122649
```
  a8eb6043
Dec 30, 2010
- Add another non-commutable instruction that gas accepts commuted forms for. · ee0432ce
  Nick Lewycky authored Dec 30, 2010
```
Fixes PR8861.

llvm-svn: 122641
```
  ee0432ce
Dec 29, 2010
- CMake: Add disabling optimization on MSVC8 and MSVC10 as workaround for some... · de8fda89
  NAKAMURA Takumi authored Dec 29, 2010
```
CMake: Add disabling optimization on MSVC8 and MSVC10 as workaround for some files in Target/ARM and Target/X86.

llvm-svn: 122623
```
  de8fda89
Dec 27, 2010
- Add support for the same encodings of the personality function that gnu as · 2ac8355e
  Rafael Espindola authored Dec 27, 2010
```
supports.

llvm-svn: 122577
```
  2ac8355e
Dec 26, 2010
- fix some sort of weird pasto · f9e0a56b
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122560
```
  f9e0a56b
- add a note · 424de349
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122559
```
  424de349
Dec 24, 2010
- Code clean up. No functionality change. · 62de0fa6
  Evan Cheng authored Dec 23, 2010
```
llvm-svn: 122528
```
  62de0fa6
Dec 23, 2010

Flag -> Glue, the ongoing saga · 2a0a3b43
Chris Lattner authored Dec 23, 2010
```
llvm-svn: 122513
```
2a0a3b43
Remove some obsolete README items, add a new one off the top of my head. · b37ae331
Benjamin Kramer authored Dec 23, 2010
```
llvm-svn: 122495
```
b37ae331

X86: Lower a select directly to a setcc_carry if possible. · 6020ed9d

Benjamin Kramer authored Dec 22, 2010

  int test(unsigned long a, unsigned long b) { return -(a < b); }
compiles to
  _test:                              ## @test
    cmpq  %rsi, %rdi                  ## encoding: [0x48,0x39,0xf7]
    sbbl  %eax, %eax                  ## encoding: [0x19,0xc0]
    ret                               ## encoding: [0xc3]
instead of
  _test:                              ## @test
    xorl  %ecx, %ecx                  ## encoding: [0x31,0xc9]
    cmpq  %rsi, %rdi                  ## encoding: [0x48,0x39,0xf7]
    movl  $-1, %eax                   ## encoding: [0xb8,0xff,0xff,0xff,0xff]
    cmovael %ecx, %eax                ## encoding: [0x0f,0x43,0xc1]
    ret                               ## encoding: [0xc3]

llvm-svn: 122451

6020ed9d

Dec 21, 2010

Add some x86 specific dagcombines for conditional increments. · f6ddc4a1

Benjamin Kramer authored Dec 21, 2010

(add Y, (sete  X, 0)) -> cmp X, 1; adc  0, Y
(add Y, (setne X, 0)) -> cmp X, 1; sbb -1, Y
(sub (sete  X, 0), Y) -> cmp X, 1; sbb  0, Y
(sub (setne X, 0), Y) -> cmp X, 1; adc -1, Y

for
  unsigned foo(unsigned a, unsigned b) {
    if (a == 0) b++;
    return b;
  }
we now get:
  foo:
    cmpl  $1, %edi
    movl  %esi, %eax
    adcl  $0, %eax
    ret
instead of:
  foo:
    testl %edi, %edi
    sete  %al
    movzbl  %al, %eax
    addl  %esi, %eax
    ret

llvm-svn: 122364

f6ddc4a1

rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for · 3e5fbd74
Chris Lattner authored Dec 21, 2010
```
something that just glues two nodes together, even if it is
sometimes used for flags.

llvm-svn: 122310
```
3e5fbd74

Dec 20, 2010

Implement feedback from Bruno on making pblendvb an x86-specific ISD node in... · 4b9db07b

Nate Begeman authored Dec 20, 2010

Implement feedback from Bruno on making pblendvb an x86-specific ISD node in addition to being an intrinsic, and convert
lowering to use it. Hopefully the pattern fragment is doing the right thing with XMM0, looks correct in testing.

llvm-svn: 122277

4b9db07b

Add header... · ca2511d8
Daniel Dunbar authored Dec 20, 2010
```
llvm-svn: 122247
```
ca2511d8
X86/MC/Mach-O: Split out createX86MachObjectWriter(). · 7da045e5
Daniel Dunbar authored Dec 20, 2010
```
llvm-svn: 122246
```
7da045e5

now that addc/adde are gone, "ADDC" in the X86 backend uses EFLAGS results, · 5c00d416

Chris Lattner authored Dec 20, 2010

the same as setcc.  Optimize ADDC(0,0,FLAGS) -> SET_CARRY(FLAGS).  This is
a step towards finishing off PR5443.  In the testcase in that bug we now  get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	sbbq	%rcx, %rcx
	testb	$1, %cl
	setne	%dl
	ret

instead of:

	movq	%rdi, %rax
	addq	%rsi, %rax
	movl	$0, %ecx
	adcq	$0, %rcx
	testq	%rcx, %rcx
	setne	%dl
	ret

llvm-svn: 122219

5c00d416

We lower setb to sbb with the hope that the and will go away, when it · 46b9efca

Chris Lattner authored Dec 20, 2010

doesn't, match it back to setb.

On a 64-bit version of the testcase before we'd get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	sbbb	%dl, %dl
	andb	$1, %dl
	ret

now we get:

	movq	%rdi, %rax
	addq	%rsi, %rax
	setb	%dl
	ret

llvm-svn: 122217

46b9efca

use for loop over types. · 9c26d271
Chris Lattner authored Dec 20, 2010
```
llvm-svn: 122214
```
9c26d271

Change the X86 backend to stop using the evil ADDC/ADDE/SUBC/SUBE nodes (which · 846c20d4

Chris Lattner authored Dec 20, 2010

their carry depenedencies with MVT::Flag operands) and use clean and beautiful
EFLAGS dependences instead.

We do this by changing the modelling of SBB/ADC to have EFLAGS input and outputs
(which is what requires the previous scheduler change) and change X86 ISelLowering
to custom lower ADDC and friends down to X86ISD::ADD/ADC/SUB/SBB nodes.

With the previous series of changes, this causes no changes in the testsuite, woo.

llvm-svn: 122213

846c20d4

Prevents PerformShuffleCombine from creating a node with an illegal type after legalize types · 1064992c
Mon P Wang authored Dec 19, 2010
```
has run, e.g., prevent creating an i64 node from a v2i64 when i64 is not a legal type.

llvm-svn: 122206
```
1064992c

Dec 19, 2010
- improve the setcc -> setcc_carry optimization to happen more · 9edf3f50
  Chris Lattner authored Dec 19, 2010
```
consistently by moving it out of lowering into dag combine.

Add some missing patterns for matching away extended versions of setcc_c.

llvm-svn: 122201
```
  9edf3f50
- simplify some code to just reuse a setcc if we can instead of · 6dddab2f
  Chris Lattner authored Dec 19, 2010
```
going through the CSE maps to get it.

llvm-svn: 122196
```
  6dddab2f