Commits · b02dcec79ee25c965dafe63ee98c9225ec30f6a1 · Roger Ferrer / llvm-epi-0.8

Oct 17, 2012

Check for empty YMM use-def lists in X86VZeroUpper. · a10c0980

Jakob Stoklund Olesen authored Oct 17, 2012

The previous MRI.isPhysRegUsed(YMM0) would also return true when the
function contains a call to a function that may clobber YMM0. That's
most of them.

Checking the use-def chains allows us to skip functions that don't
explicitly mention YMM registers.

llvm-svn: 166110

a10c0980

Fix fallout from RegInfo => FrameLowering refactoring on MSP430. · 0a69176c
Anton Korobeynikov authored Oct 17, 2012
```
Patch by Job Noorman!

llvm-svn: 166108
```
0a69176c
misched: Better handling of invalid latencies in the machine model · 0b1d8d04
Andrew Trick authored Oct 17, 2012
```
llvm-svn: 166107
```
0b1d8d04

Support: Don't remove special files on signals. · 511479dd

Daniel Dunbar authored Oct 17, 2012

 - Similar to Path::eraseFromDisk(), we don't want LLVM to remove things like
   /dev/null, even if it has the permission.

llvm-svn: 166105

511479dd

[asan] better debug diagnostics in asan compiler module · 20343351
Kostya Serebryany authored Oct 17, 2012
```
llvm-svn: 166102
```
20343351

This just in, it is a *bad idea* to use 'udiv' on an offset of · 6fab42aa

Chandler Carruth authored Oct 17, 2012

a pointer. A very bad idea. Let's not do that. Fixes PR14105.

Note that this wasn't *that* glaring of an oversight. Originally, these
routines were only called on offsets within an alloca, which are
intrinsically positive. But over the evolution of the pass, they ended
up being called for arbitrary offsets, and things went downhill...

llvm-svn: 166095

6fab42aa

Fix a really annoying "bug" introduced in r165941. The change from that · 40617f59

Chandler Carruth authored Oct 17, 2012

revision makes no sense. We cannot use the address space of the *post
indexed* type to conclude anything about a *pre indexed* pointer type's
size. More importantly, this index can never be over a pointer. We are
indexing over arrays and vectors here.

Of course, I have no test case here. Neither did the original patch. =/

llvm-svn: 166091

40617f59

Check SSSE3 instead of SSE4.1 · cef9541d

Michael Liao authored Oct 17, 2012

- All shuffle insns required, especially PSHUB, are added in SSSE3.

llvm-svn: 166086

cef9541d

Fix setjmp on models with non-Small code model nor non-Static relocation model · 6f720613

Michael Liao authored Oct 17, 2012

- MBB address is only valid as an immediate value in Small & Static
  code/relocation models. On other models, LEA is needed to load IP address of
  the restore MBB.
- A minor fix of MBB in MC lowering is added as well to enable target
  relocation flag being propagated into MC.

llvm-svn: 166084

6f720613

Use a SparseSet instead of a BitVector for UsedInInstr in RAFast. · a2136be1

Jakob Stoklund Olesen authored Oct 17, 2012

This is just as fast, and it makes it possible to avoid leaking the
UsedPhysRegs BitVector implementation through
MachineRegisterInfo::addPhysRegsUsed().

llvm-svn: 166083

a2136be1

Use a typedef to reduce some typing and reformat code accordingly. · 494109b0
Eric Christopher authored Oct 16, 2012
```
llvm-svn: 166077
```
494109b0
Variable name cleanup. · 02509481
Eric Christopher authored Oct 16, 2012
```
llvm-svn: 166076
```
02509481

Avoid rematerializing a redef immediately after the old def. · 4df59a9f

Jakob Stoklund Olesen authored Oct 16, 2012

PR14098 contains an example where we would rematerialize a MOV8ri
immediately after the original instruction:

  %vreg7:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7
  %vreg22:sub_8bit<def> = MOV8ri 9; GR32_ABCD:%vreg7

Besides being pointless, it is also wrong since the original instruction
only redefines part of the register, and the value read by the new
instruction is wrong.

The problem was the LiveRangeEdit::allUsesAvailableAt() didn't
special-case OrigIdx == UseIdx and found the wrong SSA value.

llvm-svn: 166068

4df59a9f

Revert r166046 "Switch back to the old coalescer for now to fix the 32 bit bit" · 2043329e
Jakob Stoklund Olesen authored Oct 16, 2012
```
A fix for PR14098, including the test case is in the next commit.

llvm-svn: 166067
```
2043329e

Oct 16, 2012

[InstCombine] Teach InstCombine how to handle an obfuscated splat. · 02a1141e

Michael Gottesman authored Oct 16, 2012

An obfuscated splat is where the frontend poorly generates code for a splat
using several different shuffles to create the splat, i.e.,

  %A = load <4 x float>* %in_ptr, align 16
  %B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
  %C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef>
  %D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4>

llvm-svn: 166061

02a1141e

[ms-inline asm] Add the helper function, isParseringInlineAsm(). To be used in a future commit. · e4ad2a0b
Chad Rosier authored Oct 16, 2012
```
llvm-svn: 166054
```
e4ad2a0b
Simplify code. No functionality change. · 8f46e914
Jakub Staszak authored Oct 16, 2012
```
llvm-svn: 166053
```
8f46e914
Check .rela instead of ELF64 for the compensation vaue resetting · d6f3168a
Michael Liao authored Oct 16, 2012
```
llvm-svn: 166051
```
d6f3168a
80-col fixup. · 25dcab1e
Jakub Staszak authored Oct 16, 2012
```
llvm-svn: 166050
```
25dcab1e
Teach DAG combine to fold (trunc (fptoXi x)) to (fptoXi x) · 19006206
Michael Liao authored Oct 16, 2012
```
llvm-svn: 166049
```
19006206
Switch back to the old coalescer for now to fix the 32 bit bit · b58be2c5
Rafael Espindola authored Oct 16, 2012
```
llvm+clang+compiler-rt bootstrap.

llvm-svn: 166046
```
b58be2c5
Simplify potentially quadratic behavior while erasing elements from std::vector. · ba34fdb0
Jakub Staszak authored Oct 16, 2012
```
llvm-svn: 166045
```
ba34fdb0

Support v8f32 to v8i8/vi816 conversion through custom lowering · 02ca3454

Michael Liao authored Oct 16, 2012

- Add custom FP_TO_SINT on v8i16 (and v8i8 which is legalized as v8i16 due to
  vector element-wise widening) to reduce DAG combiner and its overhead added
  in X86 backend.

llvm-svn: 166036

02ca3454

This patch addresses PR13949. · 48081cad

Bill Schmidt authored Oct 16, 2012

For the PowerPC 64-bit ELF Linux ABI, aggregates of size less than 8
bytes are to be passed in the low-order bits ("right-adjusted") of the
doubleword register or memory slot assigned to them. A previous patch
addressed this for aggregates passed in registers. However, small
aggregates passed in the overflow portion of the parameter save area are
still being passed left-adjusted.

The fix is made in PPCTargetLowering::LowerCall_Darwin_Or_64SVR4 on the
caller side, and in PPCTargetLowering::LowerFormalArguments_64SVR4 on
the callee side. The main fix on the callee side simply extends
existing logic for 1- and 2-byte objects to 1- through 7-byte objects,
and correcting a constant left over from 32-bit code. There is also a
fix to a bogus calculation of the offset to the following argument in
the parameter save area.

On the caller side, again a constant left over from 32-bit code is
fixed. Additionally, some code for 1, 2, and 4-byte objects is
duplicated to handle the 3, 5, 6, and 7-byte objects for SVR4 only. The
LowerCall_Darwin_Or_64SVR4 logic is getting fairly convoluted trying to
handle both ABIs, and I propose to separate this into two functions in a
future patch, at which time the duplication can be removed.

The patch adds a new test (structsinmem.ll) to demonstrate correct
passing of structures of all seven sizes. Eight dummy parameters are
used to force these structures to be in the overflow portion of the
parameter save area.

As a side effect, this corrects the case when aggregates passed in
registers are saved into the first eight doublewords of the parameter
save area: Previously they were stored left-justified, and now are
properly stored right-justified. This requires changing the expected
output of existing test case structsinregs.ll.

llvm-svn: 166022

48081cad

Issue: · e59a920b

Stepan Dyatkovskiy authored Oct 16, 2012

Stack is formed improperly for long structures passed as byval arguments for
EABI mode.

If we took AAPCS reference, we can found the next statements:

A: "If the argument requires double-word alignment (8-byte), the NCRN (Next
Core Register Number) is rounded up to the next even register number." (5.5
Parameter Passing, Stage C, C.3).

B: "The alignment of an aggregate shall be the alignment of its most-aligned
component." (4.3 Composite Types, 4.3.1 Aggregates).

So if we have structure with doubles (9 double fields) and 3 Core unused
registers (r1, r2, r3): caller should use r2 and r3 registers only.
Currently r1,r2,r3 set is used, but it is invalid.

Callee VA routine should also use r2 and r3 regs only. All is ok here. This
behaviour is guessed by rounding up SP address with ADD+BFC operations.

Fix:
Main fix is in ARMTargetLowering::HandleByVal. If we detected AAPCS mode and
8 byte alignment, we waste odd registers then.

P.S.:
I also improved LDRB_POST_IMM regression test. Since ldrb instruction will
not generated by current regression test after this patch. 

llvm-svn: 166018

e59a920b

Reapply r165661, Patch by Shuxin Yang <shuxin.llvm@gmail.com>. · 1705a999

NAKAMURA Takumi authored Oct 16, 2012

Original message:

The attached is the fix to radar://11663049. The optimization can be outlined by following rules:

   (select (x != c), e, c) -> select (x != c), e, x),
   (select (x == c), c, e) -> select (x == c), x, e)
where the <c> is an integer constant.

 The reason for this change is that : on x86, conditional-move-from-constant needs two instructions;
however, conditional-move-from-register need only one instruction.

  While the LowerSELECT() sounds to be the most convenient place for this optimization, it turns out to be a bad place. The reason is that by replacing the constant <c> with a symbolic value, it obscure some instruction-combining opportunities which would otherwise be very easy to spot. For that reason, I have to postpone the change to last instruction-combining phase.

  The change passes the test of "make check-all -C <build-root/test" and "make -C project/test-suite/SingleSource".

Original message since r165661:

My previous change has a bug: I negated the condition code of a CMOV, and go ahead creating a new CMOV using the *ORIGINAL* condition code.

llvm-svn: 166017

1705a999

Cleanup whitespace. · 118a78b9
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166016
```
118a78b9
Move X86MCInstLower class definition into implementation file. It's not needed outside. · 2a3f7758
Craig Topper authored Oct 16, 2012
```
llvm-svn: 166014
```
2a3f7758
Cleanup whitespace. · a529ade5
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166013
```
a529ade5
Have AttributesImpl defriend the Attributes class. · 147ee8e3
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166012
```
147ee8e3
Have AttrBuilder defriend the Attributes class. · 3ffbac44
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166011
```
3ffbac44

Use the Attributes::get method which takes an AttrVal value directly to... · c6a15cf5

Bill Wendling authored Oct 16, 2012

Use the Attributes::get method which takes an AttrVal value directly to simplify the code a bit. No functionality change.

llvm-svn: 166009

c6a15cf5

Put simple c'tors inline. · a517c30e
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166008
```
a517c30e
Pass in the context to the Attributes::get method. · 4f69e148
Bill Wendling authored Oct 16, 2012
```
llvm-svn: 166007
```
4f69e148
Fix filename in file header. · c74b600a
Craig Topper authored Oct 16, 2012
```
llvm-svn: 166004
```
c74b600a

misched: Added handleMove support for updating all kill flags, not just for allocatable regs. · d9d4be0d

Andrew Trick authored Oct 16, 2012

This is a medium term workaround until we have a more robust solution
in the form of a register liveness utility for postRA passes.

llvm-svn: 166001

d9d4be0d

Remove unused BitVectors from getAllocatableSet(). · 244beb42
Jakob Stoklund Olesen authored Oct 16, 2012
```
llvm-svn: 165999
```
244beb42
Remove RegisterClassInfo::isReserved() and isAllocatable(). · f67bf3e0
Jakob Stoklund Olesen authored Oct 15, 2012
```
Clients can use the equivalent functions in MRI.

llvm-svn: 165990
```
f67bf3e0

Add __builtin_setjmp/_longjmp supprt in X86 backend · 97bf363a

Michael Liao authored Oct 15, 2012

- Besides used in SjLj exception handling, __builtin_setjmp/__longjmp is also
  used as a light-weight replacement of setjmp/longjmp which are used to
  implementation continuation, user-level threading, and etc. The support added
  in this patch ONLY addresses this usage and is NOT intended to support SjLj
  exception handling as zero-cost DWARF exception handling is used by default
  in X86.

llvm-svn: 165989

97bf363a

Remove LIS::isAllocatable() and isReserved() helpers. · cea596ac
Jakob Stoklund Olesen authored Oct 15, 2012
```
All callers can simply use the corresponding MRI functions.

llvm-svn: 165985
```
cea596ac