Commits · 7e07d6fb69bf52b45e1a0f790135bd4746d70194 · Roger Ferrer / llvm-epi-0.8

Apr 14, 2011
- Have the X86 back-end emit the alias instead of what's being aliased. In most · 7e07d6fb
  Bill Wendling authored Apr 14, 2011
```
cases, it's much nicer and more informative reading the alias.

llvm-svn: 129497
```
  7e07d6fb
- Add an option to not print the alias of an instruction. It defaults to "print · 6dd69d92
  Bill Wendling authored Apr 13, 2011
```
the alias".

llvm-svn: 129485
```
  6dd69d92
Apr 13, 2011
- Reapply r129401 with patch for clang. · b902f1dd
  Bill Wendling authored Apr 13, 2011
```
llvm-svn: 129419
```
  b902f1dd
- Revert r129401 for now. Clang is using the old way of doing things. · dbfde424
  Bill Wendling authored Apr 12, 2011
```
llvm-svn: 129403
```
  dbfde424
- Remove the unaligned load intrinsics in favor of using native unaligned loads. · 47c24875
  Bill Wendling authored Apr 12, 2011
```
Now that we have a first-class way to represent unaligned loads, the unaligned
load intrinsics are superfluous.

First part of <rdar://problem/8460511>.

llvm-svn: 129401
```
  47c24875
Apr 11, 2011
- Don't include Operator.h from InstrTypes.h. · 7c14a558
  Jay Foad authored Apr 11, 2011
```
llvm-svn: 129271
```
  7c14a558
Apr 09, 2011
- fix rdar://8735979 - "int 3" doesn't match to "int3". Unfortunately, · fc4fe00a
  Chris Lattner authored Apr 09, 2011
```
InstAlias doesn't allow matching immediate operands, so we have to write
C++ code to do this.

llvm-svn: 129223
```
  fc4fe00a
Apr 07, 2011

Replace the old algorithm that emitted the "print the alias for an instruction" · bc3f7904

Bill Wendling authored Apr 07, 2011

with the newer, cleaner model. It uses the IAPrinter class to hold the
information that is needed to match an instruction with its alias. This also
takes into account the available features of the platform.

There is one bit of ugliness. The way the logic determines if a pattern is
unique is O(N**2), which is gross. But in reality, the number of items it's
checking against isn't large. So while it's N**2, it shouldn't be a massive time
sink.

llvm-svn: 129110

bc3f7904

Apr 06, 2011
- Add another case we are not optimizing. · b4dd95b4
  Rafael Espindola authored Apr 06, 2011
```
llvm-svn: 129012
```
  b4dd95b4
- The original issue has been fixed by not doing unnecessary sign extensions. · 7a3b244d
  Rafael Espindola authored Apr 06, 2011
```
Change the test to force a sign extension and expose the problem again.

llvm-svn: 129011
```
  7a3b244d
Apr 04, 2011
- Make OpcodeMask an unsigned long long literal to deal with overflow. · 418f186a
  Joerg Sonnenberger authored Apr 04, 2011
```
llvm-svn: 128847
```
  418f186a
- Add support for the VIA PadLock instructions. · fc4789da
  Joerg Sonnenberger authored Apr 04, 2011
```
llvm-svn: 128826
```
  fc4789da
- Expand Op0Mask by one bit in preparation for the PadLock prefixes. · cc53d991
  Joerg Sonnenberger authored Apr 04, 2011
```
Define most shift masks incrementally to reduce the redundant
hard-coding. Introduce new shift for the VEX flags to replace the
magic constant 32 in various places.

llvm-svn: 128822
```
  cc53d991
Mar 31, 2011
- Don't try to create zero-sized stack objects. · ee9d45dd
  Evan Cheng authored Mar 30, 2011
```
llvm-svn: 128586
```
  ee9d45dd
Mar 26, 2011
- Make helper static. · 8d222737
  Benjamin Kramer authored Mar 26, 2011
```
llvm-svn: 128338
```
  8d222737
Mar 24, 2011

Target/X86: [PR8777][PR8778] Tweak alloca/chkstk for Windows targets. · 521eb7c1
NAKAMURA Takumi authored Mar 24, 2011
```
FIXME: Some cleanups would be needed.
llvm-svn: 128206
```
521eb7c1

Revert r128175. · 4ab9a165

Andrew Trick authored Mar 23, 2011

I'm backing this out for the second time. It was supposed to be fixed by r128164, but the mingw self-host must be defeating the fix.

llvm-svn: 128181

4ab9a165

Mar 23, 2011
- Reapply Eli's r127852 now that the pre-RA scheduler can spill EFLAGS. · 4046a0de
  Andrew Trick authored Mar 23, 2011
```
(target-specific branchless method for double-width relational comparisons on x86)

llvm-svn: 128175
```
  4046a0de
Mar 22, 2011
- Fix fast-isel address mode folding to avoid folding instructions · c1783b31
  Dan Gohman authored Mar 22, 2011
```
outside of the current basic block. This fixes PR9500, rdar://9156159.

llvm-svn: 128041
```
  c1783b31
Mar 21, 2011

We need to pass the TargetMachine object to the InstPrinter if we are printing · 00f0cddf

Bill Wendling authored Mar 21, 2011

the alias of an InstAlias instead of the thing being aliased. Because we need to
know the features that are valid for an InstAlias.

This is part of a work-in-progress.

llvm-svn: 127986

00f0cddf

Re-apply r127953 with fixes: eliminate empty return block if it has no... · 0663f23b

Evan Cheng authored Mar 21, 2011

Re-apply r127953 with fixes: eliminate empty return block if it has no predecessors; update dominator tree if cfg is modified.

llvm-svn: 127981

0663f23b

Mar 19, 2011

Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors · 327cd36f
Daniel Dunbar authored Mar 19, 2011
```
to canonicalize IR", it broke a lot of things.

llvm-svn: 127954
```
327cd36f

SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR · 824a7113

Evan Cheng authored Mar 19, 2011

to have single return block (at least getting there) for optimizations. This
is general goodness but it would prevent some tailcall optimizations.
One specific case is code like this:
int f1(void);
int f2(void);
int f3(void);
int f4(void);
int f5(void);
int f6(void);
int foo(int x) {
  switch(x) {
  case 1: return f1();
  case 2: return f2();
  case 3: return f3();
  case 4: return f4();
  case 5: return f5();
  case 6: return f6();
  }
}

=>
LBB0_2:                                 ## %sw.bb
  callq   _f1
  popq    %rbp
  ret
LBB0_3:                                 ## %sw.bb1
  callq   _f2
  popq    %rbp
  ret
LBB0_4:                                 ## %sw.bb3
  callq   _f3
  popq    %rbp
  ret

This patch teaches codegenprep to duplicate returns when the return value
is a phi and where the phi operands are produced by tail calls followed by
an unconditional branch:

sw.bb7:                                           ; preds = %entry
  %call8 = tail call i32 @f5() nounwind
  br label %return
sw.bb9:                                           ; preds = %entry
  %call10 = tail call i32 @f6() nounwind
  br label %return
return:
  %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ]
  ret i32 %retval.0

This allows codegen to generate better code like this:

LBB0_2:                                 ## %sw.bb
        jmp     _f1                     ## TAILCALL
LBB0_3:                                 ## %sw.bb1
        jmp     _f2                     ## TAILCALL
LBB0_4:                                 ## %sw.bb3
        jmp     _f3                     ## TAILCALL

rdar://9147433

llvm-svn: 127953

824a7113

Add support for legalizing UINT_TO_FP of vectors on platforms which do · e7a101cc

Nadav Rotem authored Mar 19, 2011

not have native support for this operation (such as X86).
The legalized code uses two vector INT_TO_FP operations and is faster
than scalarizing.

llvm-svn: 127951

e7a101cc

Mar 18, 2011

Revert r127852; it's apparently causing an ICE on mingw. · 59721e32
Eli Friedman authored Mar 18, 2011
```
llvm-svn: 127909
```
59721e32
Support explicit argument forms for the X86 string instructions. · 3fbfcc0e
Joerg Sonnenberger authored Mar 18, 2011
```
For now, only the default segments are supported.

llvm-svn: 127875
```
3fbfcc0e

Add a target-specific branchless method for double-width relational · 1a916a3c

Eli Friedman authored Mar 18, 2011

comparisons on x86.  Essentially, the way this works is that SUB+SBB sets
the relevant flags the same way a double-width CMP would.

This is a substantial improvement over the generic lowering in LLVM. The output
is also shorter than the gcc-generated output; I haven't done any detailed
benchmarking, though.

llvm-svn: 127852

1a916a3c

Mar 17, 2011
- Move more logic into getTypeForExtArgOrReturn. · 2ef0c69d
  Cameron Zwarich authored Mar 17, 2011
```
llvm-svn: 127809
```
  2ef0c69d
- Rename getTypeForExtendedInteger() to getTypeForExtArgOrReturn(). · 34e7b3f7
  Cameron Zwarich authored Mar 17, 2011
```
llvm-svn: 127807
```
  34e7b3f7
- A couple new README entries. · e8f2be0c
  Eli Friedman authored Mar 17, 2011
```
llvm-svn: 127786
```
  e8f2be0c
Mar 16, 2011

The x86-64 ABI says that a bool is only guaranteed to be sign-extended to a byte · ac106273

Cameron Zwarich authored Mar 16, 2011

rather than an int. Thankfully, this only causes LLVM to miss optimizations, not
generate incorrect code.

This just fixes the zext at the return. We still insert an i32 ZextAssert when
reading a function's arguments, but it is followed by a truncate and another i8
ZextAssert so it is not optimized.

llvm-svn: 127766

ac106273

Mar 15, 2011

Enabled disassembler support for AVX instructions · b60b0bc4

Sean Callanan authored Mar 15, 2011

in the instruction tables and fixed a few bugs that
were causing decode conflicts.  Rudimentary tests
are coming up in the next patch.

llvm-svn: 127646

b60b0bc4

X86 table-generator and disassembler support for the AVX · c3fd5237

Sean Callanan authored Mar 15, 2011

instruction set.  This code adds support for the VEX prefix
and for the YMM registers accessible on AVX-enabled
architectures.  Instruction table support that enables AVX
instructions for the disassembler is in an upcoming patch.

llvm-svn: 127644

c3fd5237

Mar 11, 2011

Change the x86 32-bit scheduler to register pressure and fix up the · cf56a503

Eric Christopher authored Mar 11, 2011

corresponding testcases back to the previous versions.

Fixes some performance regressions only seen on 32-bit.

llvm-svn: 127441

cf56a503

Mar 10, 2011
- Revert 127359; it broke lencod. · d17ae4e9
  Stuart Hastings authored Mar 10, 2011
```
llvm-svn: 127382
```
  d17ae4e9
- Re-commit 127368 and 127371. They are exonerated. · b4c6a344
  Evan Cheng authored Mar 10, 2011
```
llvm-svn: 127380
```
  b4c6a344
- Revert 127368 and 127371 for now. · d4b3f8e0
  Evan Cheng authored Mar 09, 2011
```
llvm-svn: 127376
```
  d4b3f8e0
Mar 09, 2011

Change the definition of TargetRegisterInfo::getCrossCopyRegClass to be more · ca9a9363

Evan Cheng authored Mar 09, 2011

flexible.

If it returns a register class that's different from the input, then that's the
register class used for cross-register class copies.
If it returns a register class that's the same as the input, then no cross-
register class copies are needed (normal copies would do).
If it returns null, then it's not at all possible to copy registers of the
specified register class.

llvm-svn: 127368

ca9a9363

Fix a pasto that broke all x86_64-elf targets. · 801c9afd
Benjamin Kramer authored Mar 09, 2011
```
llvm-svn: 127365
```
801c9afd
X86 byval copies no longer always_inline. <rdar://problem/8706628> · 9955e2f9
Stuart Hastings authored Mar 09, 2011
```
llvm-svn: 127359
```
9955e2f9