Commits · 351c71a85f74eea1f8e68d7efbe506f8cb06fcf6 · Roger Ferrer / llvm-epi-0.8

Mar 27, 2009
- Avoid hardcoding that X86 addresses have 4 operands. · 705f2a6c
  Rafael Espindola authored Mar 27, 2009
```
llvm-svn: 67848
```
  705f2a6c
Mar 13, 2009

Fix some significant problems with constant pools that resulted in unnecessary... · 1fb8aedd

Evan Cheng authored Mar 13, 2009

Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues.

1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants.
2. MachineConstantPool alignment field is also a log2 value.
3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values.
4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries.
5. Asm printer uses expensive data structure multimap to track constant pool entries by sections.
6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic.

Solutions:
1. ConstantPoolSDNode alignment field is changed to keep non-log2 value.
2. MachineConstantPool alignment field is also changed to keep non-log2 value.
3. Functions that create ConstantPool nodes are passing in non-log2 alignments.
4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT.
5. Asm printer uses cheaper data structure to group constant pool entries.
6. Asm printer compute entry offsets after grouping is done.
7. Change JIT code to compute entry offsets on the fly.

llvm-svn: 66875

1fb8aedd

Mar 04, 2009

Correct this comment. · f8920d0c
Dan Gohman authored Mar 04, 2009
```
llvm-svn: 66057
```
f8920d0c

When using MachineInstr operand indices on SDNodes, the number · cc329b56

Dan Gohman authored Mar 04, 2009

of MachineInstr def operands must be subtracted out. This bug
was uncovered by the recent x86 EFLAGS optimization. Before
that, the only instructions that ever needed unfolding were
things like CMP32rm, where NumDefs is zero.

llvm-svn: 66056

cc329b56

Feb 22, 2009

Do not consider MMX_MOVD64rr a move instructions. The source register is in... · 91193c00

Evan Cheng authored Feb 22, 2009

Do not consider MMX_MOVD64rr a move instructions. The source register is in GR32, the destination is VR64. They are not compatible.

llvm-svn: 65273

91193c00

Feb 18, 2009
- Factor out the code to add a MachineOperand to a MachineInstrBuilder. · 2af1f85f
  Dan Gohman authored Feb 18, 2009
```
llvm-svn: 64891
```
  2af1f85f
Feb 13, 2009
- Remove non-DebugLoc versions of BuildMI from X86. · 9bba902c
  Dale Johannesen authored Feb 13, 2009
```
There were some that might even matter in X86FastISel.

llvm-svn: 64437
```
  9bba902c
- Eliminate a couple of non-DebugLoc BuildMI variants. · 6b8c76a9
  Dale Johannesen authored Feb 12, 2009
```
Modify callers.

llvm-svn: 64409
```
  6b8c76a9
Feb 11, 2009
- Propagate DebugLoc info for spiller call-backs. · 27b508db
  Bill Wendling authored Feb 11, 2009
```
llvm-svn: 64329
```
  27b508db
Feb 10, 2009
- Implement FpSET_ST1_*. · e5ade4a9
  Evan Cheng authored Feb 09, 2009
```
llvm-svn: 64186
```
  e5ade4a9
Feb 09, 2009

Turns out AnalyzeBranch can modify the mbb being analyzed. This is a nasty · 64dfcacd

Evan Cheng authored Feb 09, 2009

suprise to some callers, e.g. register coalescer. For now, add an parameter
that tells AnalyzeBranch whether it's safe to modify the mbb. A better
solution is out there, but I don't have time to deal with it right now.

llvm-svn: 64124

64dfcacd

Feb 06, 2009
- Move getPointerRegClass from TargetInstrInfo to TargetRegisterInfo. · 066757ee
  Evan Cheng authored Feb 06, 2009
```
llvm-svn: 63938
```
  066757ee
- Add TargetInstrInfo::isSafeToMoveRegisterClassDefs. It returns true if it's... · b5f0ec3e
  Evan Cheng authored Feb 06, 2009
```
Add TargetInstrInfo::isSafeToMoveRegisterClassDefs. It returns true if it's safe to move an instruction which defines a value in the register class. Replace pre-splitting specific IgnoreRegisterClassBarriers with this new hook.

llvm-svn: 63936
```
  b5f0ec3e
- Get rid of one more non-DebugLoc getNode and · 9f3f72f1
  Dale Johannesen authored Feb 06, 2009
```
its corresponding getTargetNode.  Lots of
caller changes.

llvm-svn: 63904
```
  9f3f72f1
Feb 03, 2009
- Create DebugLoc information in FastISel. Several temporary methods were · e3c78361
  Bill Wendling authored Feb 03, 2009
```
created. Specifically, those BuildMIs which use
"DebugLoc::getUnknownLoc()". I'll remove them soon.

llvm-svn: 63584
```
  e3c78361
Jan 20, 2009
- Change TargetInstrInfo::isMoveInstr to return source and destination sub-register indices as well. · c544cb0e
  Evan Cheng authored Jan 20, 2009
```
llvm-svn: 62600
```
  c544cb0e
Jan 15, 2009
- Add load-folding table entries for BT*ri8 instructions. · dbb22a44
  Dan Gohman authored Jan 15, 2009
```
llvm-svn: 62267
```
  dbb22a44
Jan 09, 2009
- Add load-folding table entries for MOVDQA. · bdc0f8b6
  Dan Gohman authored Jan 09, 2009
```
llvm-svn: 61972
```
  bdc0f8b6
Jan 07, 2009
- Add load-folding table entries for cmovno too. · 1e6e9a8b
  Dan Gohman authored Jan 07, 2009
```
llvm-svn: 61841
```
  1e6e9a8b
- Define instructions for cmovo and cmovno. · 7e47cc7c
  Dan Gohman authored Jan 07, 2009
```
llvm-svn: 61836
```
  7e47cc7c
- X86_COND_C and X86_COND_NC are alternate mnemonics for · 33e6fcd5
  Dan Gohman authored Jan 07, 2009
```
X86_COND_B and X86_COND_AE, respectively.

llvm-svn: 61835
```
  33e6fcd5
- Revert r42653 and forward-port the code that lets INC64_32r be · beac19e2
  Dan Gohman authored Jan 06, 2009
```
converted to LEA64_32r in x86's convertToThreeAddress. This
replaces code like this:
   movl  %esi, %edi
   inc   %edi
with this:
   lea   1(%rsi), %edi
which appears to be beneficial.

llvm-svn: 61830
```
  beac19e2
Jan 05, 2009
- Tidy up #includes, deleting a bunch of unnecessary #includes. · 906152a2
  Dan Gohman authored Jan 05, 2009
```
llvm-svn: 61715
```
  906152a2
Dec 23, 2008
- Make the fuse-failed debug output human-readable. · d72358cb
  Dan Gohman authored Dec 23, 2008
```
llvm-svn: 61356
```
  d72358cb
Dec 18, 2008
- Fixed x86 code generation of multiple for v2i64. It was incorrect for SSE4.1. · 998fd29c
  Mon P Wang authored Dec 18, 2008
```
llvm-svn: 61211
```
  998fd29c
Dec 05, 2008

Reason #3 from 60595 doesn't hold true. If we can fold a PIC load from... · 43c08918

Evan Cheng authored Dec 05, 2008

Reason #3 from 60595 doesn't hold true. If we can fold a PIC load from constpool into a use, the rewrite happens at time of spill (not in VirtRegMap). Later on, if the GlobalBaseReg is spilled, the spiller can see the use uses GlobalBaseReg and do the right thing.

llvm-svn: 60596

43c08918

Effectively undo 60461 in PIC mode which simply transform V_SET0 /... · fd8c4d59

Evan Cheng authored Dec 05, 2008

Effectively undo 60461 in PIC mode which simply transform V_SET0 / V_SETALLONES into a load from constpool in order to fold into restores. This is not safe to do when PIC base is being used for a number of reasons:
1. GlobalBaseReg may have been spilled.
2. It may not be live at the use.
3. Spiller doesn't know this is happening so it won't prevent GlobalBaseReg from being spilled later (That by itself is a nasty hack. It's needed because we don't insert the reload until later).

llvm-svn: 60595

fd8c4d59

Dec 03, 2008

Split foldMemoryOperand into public non-virtual and protected virtual · 3f86b513
Dan Gohman authored Dec 03, 2008
```
parts, and add target-independent code to add/preserve
MachineMemOperands.

llvm-svn: 60488
```
3f86b513

Mark x86's V_SET0 and V_SETALLONES with isSimpleLoad, and teach X86's · cc78cdf2

Dan Gohman authored Dec 03, 2008

foldMemoryOperand how to "fold" them, by converting them into constant-pool
loads. When they aren't folded, they use xorps/cmpeqd, but for example when
register pressure is high, they may now be folded as memory operands, which
reduces register pressure.

Also, mark V_SET0 isAsCheapAsAMove so that two-address-elimination will
remat it instead of copying zeros around (V_SETALLONES was already marked).

llvm-svn: 60461

cc78cdf2

Dec 02, 2008
- Reapply r60382. This time, don't mark "ADC" nodes with "implicit EFLAGS". · 122c5158
  Bill Wendling authored Dec 02, 2008
```
llvm-svn: 60385
```
  122c5158
- Temporarily revert r60382. It caused CodeGen/X86/i2k.ll and others to fail. · 351b6659
  Bill Wendling authored Dec 01, 2008
```
llvm-svn: 60383
```
  351b6659
- - Have "ADD" instructions return an implicit EFLAGS. · a435b1ae
  Bill Wendling authored Dec 01, 2008
```
- Add support for seto, setno, setc, and setnc instructions.

llvm-svn: 60382
```
  a435b1ae
Nov 26, 2008

Generate something sensible for an [SU]ADDO op when the overflow/carry flag is · 751a694a

Bill Wendling authored Nov 26, 2008

the conditional for the BRCOND statement. For instance, it will generate:

    addl %eax, %ecx
    jo LOF

instead of

    addl %eax, %ecx
    ; About 10 instructions to compare the signs of LHS, RHS, and sum.
    jl LOF

llvm-svn: 60123

751a694a

Fish kill flag annotations in PUSH instructions. · 002a2cb2
Dan Gohman authored Nov 26, 2008
```
llvm-svn: 60095
```
002a2cb2

Nov 18, 2008
- Add more const qualifiers. This fixes build breakage from r59540. · 0b273259
  Dan Gohman authored Nov 18, 2008
```
llvm-svn: 59542
```
  0b273259
Oct 27, 2008

For now, don't split live intervals around x87 stack register barriers.... · f7137229

Evan Cheng authored Oct 27, 2008

For now, don't split live intervals around x87 stack register barriers. FpGET_ST0_80 must be right after a call instruction (and ADJCALLSTACKUP) so we need to find a way to prevent reload of x87 registers between them.

llvm-svn: 58230

f7137229

Oct 25, 2008
- Generate code for TLS instructions. · db30612f
  Nicolas Geoffray authored Oct 25, 2008
```
llvm-svn: 58141
```
  db30612f
Oct 21, 2008

Optimized FCMP_OEQ and FCMP_UNE for x86. · 97d95d6d

Dan Gohman authored Oct 21, 2008

Where previously LLVM might emit code like this:

        ucomisd %xmm1, %xmm0
        setne   %al
        setp    %cl
        orb     %al, %cl
        jne     .LBB4_2

it now emits this:

        ucomisd %xmm1, %xmm0
        jne     .LBB4_2
        jp      .LBB4_2

It has fewer instructions and uses fewer registers, but it does
have more branches. And in the case that this code is followed by
a non-fallthrough edge, it may be followed by a jmp instruction,
resulting in three branch instructions in sequence. Some effort
is made to avoid this situation.

To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and
FCMP_UNE in lowered form, and replace them with code that emits
two branches, except in the case where it would require converting
a fall-through edge to an explicit branch.

Also, X86InstrInfo.cpp's branch analysis and transform code now
knows now to handle blocks with multiple conditional branches. It
uses loops instead of having fixed checks for up to two
instructions. It can now analyze and transform code generated
from FCMP_OEQ and FCMP_UNE.

llvm-svn: 57873

97d95d6d

When the coalescer is doing rematerializing, have it remove · c835458d

Dan Gohman authored Oct 21, 2008

the copy instruction from the instruction list before asking the
target to create the new instruction. This gets the old instruction
out of the way so that it doesn't interfere with the target's
rematerialization code. In the case of x86, this helps it find
more cases where EFLAGS is not live.

Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check
to see if it reached the end of the block after scanning each
instruction, instead of just before. This lets it notice when the
end of the block is only two instructions away, without doing any
additional scanning.

These changes allow rematerialization to clobber EFLAGS in more
cases, for example using xor instead of mov to set the return value
to zero in the included testcase.

llvm-svn: 57872

c835458d

Oct 17, 2008

Define patterns for shld and shrd that match immediate · a39b0a1f

Dan Gohman authored Oct 17, 2008

shift counts, and patterns that match dynamic shift counts
when the subtract is obscured by a truncate node.

Add DAGCombiner support for recognizing rotate patterns
when the shift counts are defined by truncate nodes.

Fix and simplify the code for commuting shld and shrd
instructions to work even when the given instruction doesn't
have a parent, and when the caller needs a new instruction.

These changes allow LLVM to use the shld, shrd, rol, and ror
instructions on x86 to replace equivalent code using two
shifts and an or in many more cases.

llvm-svn: 57662

a39b0a1f