Commits · 201501995f3fc4ac74939fe82b55d9f8d1bad756 · Roger Ferrer / llvm-epi-0.8

Jan 21, 2009

Favors generating "not" over "xor -1". For example. · 20150199

Evan Cheng authored Jan 21, 2009

unsigned test(unsigned a) {
  return ~a;
}
llvm used to generate:
movl    $4294967295, %eax
xorl    4(%esp), %eax

Now it generates:
movl      4(%esp), %eax
notl      %eax

It's 3 bytes shorter.

llvm-svn: 62661

20150199

Jan 20, 2009
- Change TargetInstrInfo::isMoveInstr to return source and destination sub-register indices as well. · c544cb0e
  Evan Cheng authored Jan 20, 2009
```
llvm-svn: 62600
```
  c544cb0e
Jan 19, 2009

DIVREM isel deficiency: If sign bit is known zero, zero out DX/EDX/RDX instead... · 44cc5543

Evan Cheng authored Jan 19, 2009

DIVREM isel deficiency: If sign bit is known zero, zero out DX/EDX/RDX instead of sign extending the low part (in AX/EAX/RAX) into it.

llvm-svn: 62519

44cc5543

Minor tweak to LowerUINT_TO_FP_i32. Bias (after scalar_to_vector) has two uses... · 8f367e53

Evan Cheng authored Jan 19, 2009

Minor tweak to LowerUINT_TO_FP_i32. Bias (after scalar_to_vector) has two uses so we should make it the second source operand of ISD::OR so 2-address pass won't have to be smart about commuting.

%reg1024<def> = MOVSDrm %reg0, 1, %reg0, <cp#0>, Mem:LD(8,8) [ConstantPool + 0]
%reg1025<def> = MOVSD2PDrr %reg1024
%reg1026<def> = MOVDI2PDIrm <fi#-1>, 1, %reg0, 0, Mem:LD(4,16) [FixedStack-1 + 0]
%reg1027<def> = ORPSrr %reg1025<kill>, %reg1026<kill>
%reg1028<def> = MOVPD2SDrr %reg1027<kill>
%reg1029<def> = SUBSDrr %reg1028<kill>, %reg1024<kill>
%reg1030<def> = CVTSD2SSrr %reg1029<kill>
MOVSSmr <fi#0>, 1, %reg0, 0, %reg1030<kill>, Mem:ST(4,4) [FixedStack0 + 0]
%reg1031<def> = LD_Fp32m80 <fi#0>, 1, %reg0, 0, Mem:LD(4,16) [FixedStack0 + 0]
RET %reg1031<kill>, %ST0<imp-use,kill>

The reason 2-addr pass isn't smart enough to commute the ORPSrr is because it can't look pass the MOVSD2PDrr instruction.

llvm-svn: 62505

8f367e53

Now not UINT_TO_FP is legal (it's marked custom), dag combiner won't · 7e9ef4d7

Evan Cheng authored Jan 19, 2009

optimize it to a SINT_TO_FP when the sign bit is known zero. X86 isel should perform the optimization itself.

llvm-svn: 62504

7e9ef4d7

Jan 17, 2009

Extend thi · f9291cf4
Bill Wendling authored Jan 17, 2009
```
llvm-svn: 62415
```
f9291cf4
Fix MatchAddress bug that's preventing negative displacement from being folded in 64-bit mode. · bf38a5e5
Evan Cheng authored Jan 17, 2009
```
llvm-svn: 62413
```
bf38a5e5
Temporarily revert my last change. It is causing a bootstrap failure. · dd40f268
Bill Wendling authored Jan 17, 2009
```
llvm-svn: 62405
```
dd40f268

Implement a special algorithm for converting uint_to_fp for i32 values on · 4d527590

Bill Wendling authored Jan 17, 2009

X86. This code:

void f() {
  uint32_t x;
  float y = (float)x;
}

used to be:

     movl     %eax, -8(%ebp)
     movl     [2^52 double], -4(%ebp)
     movsd    -8(%ebp), %xmm0
     subsd    [2^52 double], %xmm0
     cvtsd2ss %xmm0, %xmm0

Is now:

   movsd        [2^52 double], %xmm0
   movsd        %xmm0, %xmm1
   movd         %ecx, %xmm2
   orps         %xmm2, %xmm1
   subsd        %xmm0, %xmm1
   cvtsd2ss     %xmm1, %xmm0

This is faster on X86. Note that there's an extra load of %xmm0 into %xmm1. That
will be fixed in a later coalescer fix.

llvm-svn: 62404

4d527590

Jan 16, 2009
- Add support for non-zero __builtin_return_address values on X86. · e0433473
  Bill Wendling authored Jan 16, 2009
```
llvm-svn: 62338
```
  e0433473
Jan 15, 2009
- Expand insert/extract of a <4 x i32> with a variable index. · ebfafee9
  Mon P Wang authored Jan 15, 2009
```
llvm-svn: 62281
```
  ebfafee9
- Add the private linkage. · 6de96a1b
  Rafael Espindola authored Jan 15, 2009
```
llvm-svn: 62279
```
  6de96a1b
- Move a few containers out of ScheduleDAGInstrs::BuildSchedGraph · 619ef48a
  Dan Gohman authored Jan 15, 2009
```
and into the ScheduleDAGInstrs class, so that they don't get
destructed and re-constructed for each block. This fixes a
compile-time hot spot in the post-pass scheduler.

To help facilitate this, tidy and do some minor reorganization
in the scheduler constructor functions.

llvm-svn: 62275
```
  619ef48a
- Add load-folding table entries for BT*ri8 instructions. · dbb22a44
  Dan Gohman authored Jan 15, 2009
```
llvm-svn: 62267
```
  dbb22a44
- Make getWidenVectorType const. · 0ad43ca6
  Dan Gohman authored Jan 15, 2009
```
llvm-svn: 62265
```
  0ad43ca6
Jan 14, 2009
- BT appears to be available on all >= i386 chips. · a63bede3
  Dan Gohman authored Jan 13, 2009
```
llvm-svn: 62196
```
  a63bede3
- Don't use a BT instruction if the AND has multiple uses. · d3942af5
  Dan Gohman authored Jan 13, 2009
```
llvm-svn: 62195
```
  d3942af5
- Disable the register+memory forms of the bt instructions for now. Thanks · b8f5ba67
  Dan Gohman authored Jan 13, 2009
```
to Eli for pointing out that these forms don't ignore the high bits of
their index operands, and as such are not immediately suitable for use
by isel.

llvm-svn: 62194
```
  b8f5ba67
Jan 13, 2009
- Add bt instructions that take immediate operands. · 0fdf71cb
  Dan Gohman authored Jan 13, 2009
```
llvm-svn: 62180
```
  0fdf71cb
- Fix a few more JIT encoding issues in the BT instructions. · eb2591bb
  Dan Gohman authored Jan 13, 2009
```
llvm-svn: 62179
```
  eb2591bb
- · 5c6e1e3b
  Devang Patel authored Jan 13, 2009
```
Use DebugInfo interface to lower dbg_* intrinsics.

llvm-svn: 62127
```
  5c6e1e3b
Jan 12, 2009
- Rename getABITypeSize to getTypePaddedSize, as · dc020f9c
  Duncan Sands authored Jan 12, 2009
```
suggested by Chris.

llvm-svn: 62099
```
  dc020f9c
Jan 10, 2009
- 80 col violation. · 5a272e79
  Evan Cheng authored Jan 10, 2009
```
llvm-svn: 62024
```
  5a272e79
Jan 09, 2009
- Removed trailing whitespace from Makefiles. · 5cbf2239
  Misha Brukman authored Jan 09, 2009
```
llvm-svn: 61991
```
  5cbf2239
- Add load-folding table entries for MOVDQA. · bdc0f8b6
  Dan Gohman authored Jan 09, 2009
```
llvm-svn: 61972
```
  bdc0f8b6
- Whitespace and other minor adjustments to make SSE instructions have · e907a0a5
  Dan Gohman authored Jan 09, 2009
```
the same formatting as their corresponding SSE2 instructions, for
consistency.

llvm-svn: 61971
```
  e907a0a5
- · f6466687
  Devang Patel authored Jan 08, 2009
```
Convert DwarfWriter into a pass.
Now Users request DwarfWriter through getAnalysisUsage() instead of creating an instance of DwarfWriter object directly.

llvm-svn: 61955
```
  f6466687
Jan 07, 2009
- Add patterns to match conditional moves with loads folded · 8e8d1da3
  Dan Gohman authored Jan 07, 2009
```
into their left operand, rather than their right. Do this
by commuting the operands and inverting the condition.

llvm-svn: 61842
```
  8e8d1da3
- Add load-folding table entries for cmovno too. · 1e6e9a8b
  Dan Gohman authored Jan 07, 2009
```
llvm-svn: 61841
```
  1e6e9a8b
- Define instructions for cmovo and cmovno. · 7e47cc7c
  Dan Gohman authored Jan 07, 2009
```
llvm-svn: 61836
```
  7e47cc7c
- X86_COND_C and X86_COND_NC are alternate mnemonics for · 33e6fcd5
  Dan Gohman authored Jan 07, 2009
```
X86_COND_B and X86_COND_AE, respectively.

llvm-svn: 61835
```
  33e6fcd5
- Revert r42653 and forward-port the code that lets INC64_32r be · beac19e2
  Dan Gohman authored Jan 06, 2009
```
converted to LEA64_32r in x86's convertToThreeAddress. This
replaces code like this:
   movl  %esi, %edi
   inc   %edi
with this:
   lea   1(%rsi), %edi
which appears to be beneficial.

llvm-svn: 61830
```
  beac19e2
Jan 05, 2009
- Revert r61415 and r61484. Duncan was correct that these weren't needed. · f9b5ba7b
  Bill Wendling authored Jan 05, 2009
```
llvm-svn: 61765
```
  f9b5ba7b
- Tidy up #includes, deleting a bunch of unnecessary #includes. · 906152a2
  Dan Gohman authored Jan 05, 2009
```
llvm-svn: 61715
```
  906152a2
- · 56a8bb67
  Devang Patel authored Jan 05, 2009
```
squash warnings.

llvm-svn: 61707
```
  56a8bb67
- Atom and Core i7 do not have same model number after all. · c3b09c3b
  Evan Cheng authored Jan 05, 2009
```
llvm-svn: 61686
```
  c3b09c3b
Jan 03, 2009
- Add Intel processors core i7 and atom. · 6e100a62
  Evan Cheng authored Jan 03, 2009
```
llvm-svn: 61603
```
  6e100a62
- Fix PR3210: Detect more Intel processors. Patch by Torok Edwin. · 9a3ec1b2
  Evan Cheng authored Jan 03, 2009
```
llvm-svn: 61602
```
  9a3ec1b2
Jan 02, 2009
- Do not isel load folding bt instructions for pentium m, core, core2, and AMD... · 4c91aa34
  Evan Cheng authored Jan 02, 2009
```
Do not isel load folding bt instructions for pentium m, core, core2, and AMD processors. These are significantly slower than a load followed by a bt of a register.

llvm-svn: 61557
```
  4c91aa34
- Fix x86 CPU id detection to identify Penryn (and future processors). · 13f3a33f
  Evan Cheng authored Jan 02, 2009
```
llvm-svn: 61556
```
  13f3a33f