Commits · bd847cc5622cabc9abb1a59e950981cdf9418f96 · Roger Ferrer / llvm-epi-0.8

Oct 15, 2012

ARM: v1i64 and v2i64 VBSL intrinsic support. · 54c7432e
Jim Grosbach authored Oct 15, 2012
```
rdar://12502028

llvm-svn: 165981
```
54c7432e

Move the Attributes::Builder outside of the Attributes class and into its own... · 50d27849

Bill Wendling authored Oct 15, 2012

Move the Attributes::Builder outside of the Attributes class and into its own class named AttrBuilder. No functionality change.

llvm-svn: 165960

50d27849

[ms-inline asm] If we parsed a statement and the opcode is valid, then it's an instruction. · f3bc5996
Chad Rosier authored Oct 15, 2012
```
llvm-svn: 165955
```
f3bc5996
[ms-inline asm] Update the end loc for ParseIntelMemOperand. · 499d4a14
Chad Rosier authored Oct 15, 2012
```
llvm-svn: 165947
```
499d4a14
[ms-inline asm] Use incoming argument rather than hard coding to false. · ca0ada1a
Chad Rosier authored Oct 15, 2012
```
llvm-svn: 165945
```
ca0ada1a

Resubmit the changes to llvm core to update the functions to support different... · 4bb926d9

Micah Villmow authored Oct 15, 2012

Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis.

llvm-svn: 165941

4bb926d9

PowerPC: add EmitTCEntry class for TOC creation · ef206f19

Adhemerval Zanella authored Oct 15, 2012

This patch replaces the EmitRawText by a EmitTCEntry class (specialized for
each Streamer) in PowerPC64 TOC entry creation.

llvm-svn: 165940

ef206f19

Fixed PR13938: the ARM backend was crashing because it couldn't select a... · b1409700

Silviu Baranga authored Oct 15, 2012

Fixed PR13938: the ARM backend was crashing because it couldn't select a VDUPLANE node with the vector input size different from the output size. This was bacause the BUILD_VECTOR lowering code didn't check that the size of the input vector was correct for using VDUPLANE.

llvm-svn: 165929

b1409700

Attributes Rewrite · d079a446

Bill Wendling authored Oct 15, 2012

Convert the internal representation of the Attributes class into a pointer to an
opaque object that's uniqued by and stored in the LLVMContext object. The
Attributes class then becomes a thin wrapper around this opaque
object. Eventually, the internal representation will be expanded to include
attributes that represent code generation options, etc.

llvm-svn: 165917

d079a446

Oct 14, 2012
- Don't pass in an Attributes object to something that expects an integral value. · 9e1eb4d1
  Bill Wendling authored Oct 14, 2012
```
llvm-svn: 165887
```
  9e1eb4d1
Oct 13, 2012

X86: Disable long nops for all cpus prior to pentiumpro/i686. · 35480284
Benjamin Kramer authored Oct 13, 2012
```
llvm-svn: 165878
```
35480284
X86: Fix accidentally swapped operands. · ecd15d7f
Benjamin Kramer authored Oct 13, 2012
```
llvm-svn: 165871
```
ecd15d7f

X86: Promote i8 cmov when both operands are coming from truncates of the same width. · d6b9362f

Benjamin Kramer authored Oct 13, 2012

X86 doesn't have i8 cmovs so isel would emit a branch. Emitting branches at this
level is often not a good idea because it's too late for many optimizations to
kick in. This solution doesn't add any extensions (truncs are free) and tries
to avoid introducing partial register stalls by filtering direct copyfromregs.

I'm seeing a ~10% speedup on reading a random .png file with libpng15 via
graphicsmagick on x86_64/westmere, but YMMV depending on the microarchitecture.

llvm-svn: 165868

d6b9362f

[ms-inline asm] Remove the MatchInstruction() function. Previously, this was · 49963555

Chad Rosier authored Oct 13, 2012

the interface between the front-end and the MC layer when parsing inline
assembly.  Unfortunately, this is too deep into the parsing stack. Specifically,
we're unable to handle target-independent assembly (i.e., assembly directives,
labels, etc.).  Note the MatchAndEmitInstruction() isn't the correct
abstraction either.  I'll be exposing target-independent hooks shortly, so this
is really just a cleanup.

llvm-svn: 165858

49963555

ARM: tail-call inside a function where part of a byval argument is on caller's · 7e48b252

Manman Ren authored Oct 12, 2012

local frame causes problem.

For example:
void f(StructToPass s) {
  g(&s, sizeof(s));
}
will cause problem with tail-call since part of s is passed via registers and
saved in f's local frame. When g tries to access s, part of s may be corrupted
since f's local frame is popped out before the tail-call.

The current fix is to disable tail-call if getVarArgsRegSaveSize is not 0 for
the caller. This is a conservative approach, if we can prove the address of
s or part of s is not taken and passed to g, it should be okay to perform
tail-call.

rdar://12442472

llvm-svn: 165853

7e48b252

[ms-inline asm] Capitalize per coding standard. · 4453e845
Chad Rosier authored Oct 12, 2012
```
llvm-svn: 165847
```
4453e845

ARM: Mark VSELECT as 'expand'. · 30af442a

Jim Grosbach authored Oct 12, 2012

The backend already pattern matches to form VBSL when it can. We may want to
teach it to use the vbsl intrinsics at some point to prevent machine licm from
mucking with this, but using the Expand is completely correct.

http://llvm.org/bugs/show_bug.cgi?id=13831
http://llvm.org/bugs/show_bug.cgi?id=13961

Patch by Peter Couperus <peter.couperus@st.com>.

llvm-svn: 165845

30af442a

[ms-inline asm] Use the new API introduced in r165830 in lieu of the · 2f480a8a
Chad Rosier authored Oct 12, 2012
```
MapAndConstraints vector.  Also remove the unused Kind argument.

llvm-svn: 165833
```
2f480a8a

Oct 12, 2012
- Div, Rem int/unsigned int · cf11c59e
  Reed Kotler authored Oct 12, 2012
```
llvm-svn: 165783
```
  cf11c59e
- Remove unnecessary classof()'s · 506a1c5a
  Sean Silva authored Oct 11, 2012
```
isa<> et al. automatically infer when the cast is an upcast (including a
self-cast), so these are no longer necessary.

llvm-svn: 165767
```
  506a1c5a
Oct 11, 2012

Revert 165732 for further review. · 0c61134d
Micah Villmow authored Oct 11, 2012
```
llvm-svn: 165747
```
0c61134d

Add in the first iteration of support for llvm/clang/lldb to allow variable... · 08318973

Micah Villmow authored Oct 11, 2012

Add in the first iteration of support for llvm/clang/lldb to allow variable per address space pointer sizes to be optimized correctly.

llvm-svn: 165726

08318973

This patch addresses PR13947. · 22162470

Bill Schmidt authored Oct 11, 2012

For function calls on the 64-bit PowerPC SVR4 target, each parameter
is mapped to as many doublewords in the parameter save area as
necessary to hold the parameter.  The first 13 non-varargs
floating-point values are passed in registers; any additional
floating-point parameters are passed in the parameter save area.  A
single-precision floating-point parameter (32 bits) must be mapped to
the second (rightmost, low-order) word of its assigned doubleword
slot.

Currently LLVM violates this ABI requirement by mapping such a
parameter to the first (leftmost, high-order) word of its assigned
doubleword slot.  This is internally self-consistent but will not
interoperate correctly with libraries compiled with an ABI-compliant
compiler.

This patch corrects the problem by adjusting the parameter addressing
on both sides of the calling convention.

llvm-svn: 165714

22162470

Expose move to/from coprocessor instructions in MIPS64 mode. · 6a00ab4b

David Chisnall authored Oct 11, 2012

Note: [D]M{T,F}CP2 is just a recommended encoding. Vendors often provide a
custom CP2 that interprets instructions differently and may wish to add their
own instructions that use this opcode. We should ensure that this is easy to
do. I will probably add a 'has custom CP{0-3}' subtarget flag to make this
easy: We want to avoid the GCC situation where every MIPS vendor makes a custom
fork that breaks every other MIPS CPU and so can't be merged upstream.

llvm-svn: 165711

6a00ab4b

Revert r165661, "Patch by Shuxin Yang <shuxin.llvm@gmail.com>." · da0730c2
NAKAMURA Takumi authored Oct 11, 2012
```
It broke stage2 clang and test-suite/MultiSource/Benchmarks/mediabench/g721/g721encode.

llvm-svn: 165692
```
da0730c2
Change MachineInstrBuilder::addDisp to copy over target flags by default. · 60a25a57
Evan Cheng authored Oct 11, 2012
```
llvm-svn: 165677
```
60a25a57
Add isel patterns for v2f32 / v4f32 neon.vbsl intrinsics. rdar://12471808 · 0f2565fd
Evan Cheng authored Oct 10, 2012
```
llvm-svn: 165673
```
0f2565fd
Add getters for the MIPS TargetTransform classes · ffd3bf41
Nadav Rotem authored Oct 10, 2012
```
llvm-svn: 165670
```
ffd3bf41
Remove unused member variable introduced in r165665. · e249f18e
David Blaikie authored Oct 10, 2012
```
llvm-svn: 165669
```
e249f18e

· e1032873

Nadav Rotem authored Oct 10, 2012

Add a new interface to allow IR-level passes to access codegen-specific information.

llvm-svn: 165665

e1032873

Oct 10, 2012

Patch by Shuxin Yang <shuxin.llvm@gmail.com>. · 17418964

Nadav Rotem authored Oct 10, 2012

Original message:

The attached is the fix to radar://11663049. The optimization can be outlined by following rules:

   (select (x != c), e, c) -> select (x != c), e, x),
   (select (x == c), c, e) -> select (x == c), x, e)
where the <c> is an integer constant.

 The reason for this change is that : on x86, conditional-move-from-constant needs two instructions;
however, conditional-move-from-register need only one instruction.

  While the LowerSELECT() sounds to be the most convenient place for this optimization, it turns out to be a bad place. The reason is that by replacing the constant <c> with a symbolic value, it obscure some instruction-combining opportunities which would otherwise be very easy to spot. For that reason, I have to postpone the change to last instruction-combining phase.

  The change passes the test of "make check-all -C <build-root/test" and "make -C project/test-suite/SingleSource".

llvm-svn: 165661

17418964

When generating spill and reload code for vector registers on PowerPC, · b9bc4740

Bill Schmidt authored Oct 10, 2012

the compiler makes use of GPR0.  However, there are two flavors of
GPR0 defined by the target:  the 32-bit GPR0 (R0) and the 64-bit GPR0
(X0).  The spill/reload code makes use of R0 regardless of whether we
are generating 32- or 64-bit code.

This patch corrects the problem in the obvious manner, using X0 and
ADDI8 for 64-bit and R0 and ADDI for 32-bit.

llvm-svn: 165658

b9bc4740

The PowerPC VRSAVE register has been somewhat of an odd beast since · 38d94587

Bill Schmidt authored Oct 10, 2012

the Altivec extensions were introduced.  Its use is optional, and
allows the compiler to communicate to the operating system which
vector registers should be saved and restored during a context switch.
In practice, this information is ignored by the various operating
systems using the SVR4 ABI; the kernel saves and restores the entire
register state.  Setting the VRSAVE register is no longer performed by
the AIX XL compilers, the IBM i compilers, or by GCC on Power Linux
systems.  It seems best to avoid this logic within LLVM as well.

This patch avoids generating code to update and restore VRSAVE for the
PowerPC SVR4 ABIs (32- and 64-bit).  The code remains in place for the
Darwin ABI.

llvm-svn: 165656

38d94587

Add support for FP_ROUND from v2f64 to v2f32 · e999b865

Michael Liao authored Oct 10, 2012

- Due to the current matching vector elements constraints in
  ISD::FP_ROUND, rounding from v2f64 to v4f32 (after legalization from
  v2f32) is scalarized. Add a customized v2f32 widening to convert it
  into a target-specific X86ISD::VFPROUND to work around this
  constraints.

llvm-svn: 165631

e999b865

Add alternative support for FP_ROUND from v2f32 to v2f64 · effae0c8

Michael Liao authored Oct 10, 2012

- Due to the current matching vector elements constraints in ISD::FP_EXTEND,
  rounding from v2f32 to v2f64 is scalarized. Add a customized v2f32 widening
  to convert it into a target-specific X86ISD::VFPEXT to work around this
  constraints. This patch also reverts a previous attempt to fix this issue by
  recovering the scalarized ISD::FP_EXTEND pattern and thus significantly
  reduces the overhead of supporting non-power-2 vector FP extend.

llvm-svn: 165625

effae0c8

Fix for LDRB instruction: · 283baa00

Stepan Dyatkovskiy authored Oct 10, 2012

SDNode for LDRB_POST_IMM is invalid: number of registers added to SDNode fewer
that described in .td.

7 ops is needed, but SDNode with only 6 is created.

In more details:
In ARMInstrInfo.td, in multiclass AI2_ldridx, in definition _POST_IMM, offset
operand is defined as am2offset_imm. am2offset_imm is complex parameter type,
and actually it consists from dummy register and imm itself. As I understood
trick with dummy reg was made for AsmParser. In ARMISelLowering.cpp, this dummy
register was not added to SDNode, and it cause crash in Peephole Optimizer pass.

The problem fixed by setting up additional dummy reg when emitting
LDRB_POST_IMM instruction.

llvm-svn: 165617

283baa00

Issue description: · f13dbb8e

Stepan Dyatkovskiy authored Oct 10, 2012

SchedulerDAGInstrs::buildSchedGraph ignores dependencies between FixedStack
objects and byval parameters. So loading byval parameters from stack may be
inserted *before* it will be stored, since these operations are treated as
independent.

Fix:
Currently ARMTargetLowering::LowerFormalArguments saves byval registers with
FixedStack MachinePointerInfo. To fix the problem we need to store byval
registers with MachinePointerInfo referenced to first the "byval" parameter.

Also commit adds two new fields to the InputArg structure: Function's argument
index and InputArg's part offset in bytes relative to the start position of
Function's argument. E.g.: If function's argument is 128 bit width and it was
splitted onto 32 bit regs, then we got 4 InputArg structs with same arg index,
but different offset values. 

llvm-svn: 165616

f13dbb8e

Remove the final bits of Attributes being declared in the Attribute · bbcdf4e2
Bill Wendling authored Oct 10, 2012
```
namespace. Use the attribute's enum value instead. No functionality change
intended.

llvm-svn: 165610
```
bbcdf4e2

misched: Use the TargetSchedModel interface wherever possible. · dd79f0fc

Andrew Trick authored Oct 10, 2012

Allows the new machine model to be used for NumMicroOps and OutputLatency.

Allows the HazardRecognizer to be disabled along with itineraries.

llvm-svn: 165603

dd79f0fc

whitespace · d9296ec2
Andrew Trick authored Oct 10, 2012
```
llvm-svn: 165601
```
d9296ec2