Commits · 3a1fd4c0ac7f5a17d271c895a1597273ac8252c8 · Roger Ferrer / llvm-epi-0.8

Jun 01, 2013

X86: change MOV64ri64i32 into MOV32ri64 · 3a1fd4c0

Tim Northover authored Jun 01, 2013

The MOV64ri64i32 instruction required hacky MCInst lowering because it
was allocated as setting a GR64, but the eventual instruction ("movl")
only set a GR32. This converts it into a so-called "MOV32ri64" which
still accepts a (appropriate) 64-bit immediate but defines a GR32.
This is then converted to the full GR64 by a SUBREG_TO_REG operation,
thus keeping everyone happy.

This fixes a typo in the opcode field of the original patch, which
should make the legact JIT work again (& adds test for that problem).

llvm-svn: 183068

3a1fd4c0

[Sparc] Generate correct code for leaf functions with stack objects · 3521dcdc
Venkatraman Govindaraju authored Jun 01, 2013
```
llvm-svn: 183067
```
3521dcdc

Make SubRegIndex size mandatory, following r183020. · b1a4d9da

Ahmed Bougacha authored May 31, 2013

This also makes TableGen able to compute sizes/offsets of synthesized
indices representing tuples.

llvm-svn: 183061

b1a4d9da

Temporarily Revert "X86: change MOV64ri64i32 into MOV32ri64" as it · e1e57e5e
Eric Christopher authored May 31, 2013
```
seems to have caused PR16192 and other JIT related failures.

llvm-svn: 183059
```
e1e57e5e

May 31, 2013

NVPTX: Don't even create a regalloc if we're not going to use it. · fae7ff12
Benjamin Kramer authored May 31, 2013
```
Fixes a leak found by valgrind.

llvm-svn: 183031
```
fae7ff12

Add a way to define the bit range covered by a SubRegIndex. · f1ed334d

Ahmed Bougacha authored May 31, 2013

NOTE: If this broke your out-of-tree backend, in *RegisterInfo.td, change
the instances of SubRegIndex that have a comps template arg to use the
ComposedSubRegIndex class instead.

In TableGen land, this adds Size and Offset attributes to SubRegIndex,
and the ComposedSubRegIndex class, for which the Size and Offset are
computed by TableGen. This also adds an accessor in MCRegisterInfo, and
Size/Offsets for the X86 and ARM subreg indices.

llvm-svn: 183020

f1ed334d

ARM: permit upper-case BE/LE on setend instruction · 4d141440
Tim Northover authored May 31, 2013
```
Patch by Amaury de la Vieuville.

llvm-svn: 183012
```
4d141440

ARM: add fstmx and fldmx instructions for assembly · 4173e29a

Tim Northover authored May 31, 2013

These instructions are deprecated oddities, but we still need to be able to
disassemble (and reassemble) them if and when they're encountered.

Patch by Amaury de la Vieuville.

llvm-svn: 183011

4173e29a

ARM: fix VEXT encoding corner case · 1bb672da

Tim Northover authored May 31, 2013

The disassembly of VEXT instructions was too lax in the bits checked. This
fixes the case where the instruction affects Q-registers but a misaligned lane
was specified (should be UNDEFINED).

Patch by Amaury de la Vieuville

llvm-svn: 183003

1bb672da

[SystemZ] Don't use LOAD and STORE REVERSED for volatile accesses · 30efd87f

Richard Sandiford authored May 31, 2013

Unlike most -- hopefully "all other", but I'm still checking -- memory
instructions we support, LOAD REVERSED and STORE REVERSED may access
the memory location several times.  This means that they are not suitable
for volatile loads and stores.

This patch is a prerequisite for better atomic load and store support.
The same principle applies there: almost all memory instructions we
support are inherently atomic ("block concurrent"), but LOAD REVERSED
and STORE REVERSED are exceptions.

Other instructions continue to allow volatile operands.  I will add
positive "allows volatile" tests at the same time as the "allows atomic
load or store" tests.

llvm-svn: 183002

30efd87f

[NVPTX] Re-enable support for virtual registers in the final output · dbb3b2f4

Justin Holewinski authored May 31, 2013

Now that 3.3 is branched, we are re-enabling virtual registers to help
iron out bugs before the next release. Some of the post-RA passes do
not play well with virtual registers, so we disable them for now. The
needed functionality of the PrologEpilogInserter pass is copied to a
new backend-specific NVPTXPrologEpilog pass.

The test for this commit is not breaking the existing tests.

llvm-svn: 182998

dbb3b2f4

X86: change MOV64ri64i32 into MOV32ri64 · d4736d67

Tim Northover authored May 31, 2013

The MOV64ri64i32 instruction required hacky MCInst lowering because it was
allocated as setting a GR64, but the eventual instruction ("movl") only set a
GR32. This converts it into a so-called "MOV32ri64" which still accepts a
(appropriate) 64-bit immediate but defines a GR32. This is then converted to
the full GR64 by a SUBREG_TO_REG operation, thus keeping everyone happy.

llvm-svn: 182991

d4736d67

[mips] Big-endian code generation for atomic instructions. · 2bf97336
Akira Hatanaka authored May 31, 2013
```
Patch by Jyun-Yan You.

llvm-svn: 182984
```
2bf97336

May 30, 2013

Revert r182937 and r182877. · 99bd2ae4

Rafael Espindola authored May 30, 2013

r182877 broke MCJIT tests on ARM and r182937 was working around another failure
by r182877.

This should make the ARM bots green.

llvm-svn: 182960

99bd2ae4

X86: use sub-register sequences for MOV*r0 operations · 64ec0ff4

Tim Northover authored May 30, 2013

Instead of having a bunch of separate MOV8r0, MOV16r0, ... pseudo-instructions,
it's better to use a single MOV32r0 (which will expand to "xorl %reg, %reg")
and obtain other sizes with EXTRACT_SUBREG and SUBREG_TO_REG. The encoding is
smaller and partial register updates can sometimes be avoided.

Until recently, this sequence was a barrier to rematerialization though. That
should now be fixed so it's an appropriate time to make the change.

llvm-svn: 182928

64ec0ff4

[NVPTX] Fix case where a sext load of an i1 type may produce an · 994d66a3
Justin Holewinski authored May 30, 2013
```
ld.u1 instead of an ld.u8.

llvm-svn: 182924
```
994d66a3

X86: change zext moves to use sub-register infrastructure. · 04eb4234

Tim Northover authored May 30, 2013

32-bit writes on amd64 zero out the high bits of the corresponding 64-bit
register. LLVM makes use of this for zero-extension, but until now relied on
custom MCLowering and other code to fixup instructions. Now we have proper
handling of sub-registers, this can be done by creating SUBREG_TO_REG
instructions at selection-time.

Should be no change in functionality.

llvm-svn: 182921

04eb4234

[SystemZ] Enable unaligned accesses · 46af5a2c

Richard Sandiford authored May 30, 2013

The code to distinguish between unaligned and aligned addresses was
already there, so this is mostly just a switch-on-and-test process.

llvm-svn: 182920

46af5a2c

Order CALLSEQ_START and CALLSEQ_END nodes. · ad6d08ac

Andrew Trick authored May 29, 2013

Fixes PR16146: gdb.base__call-ar-st.exp fails after
pre-RA-sched=source fixes.

Patch by Xiaoyi Guo!

This also fixes an unsupported dbg.value test case. Codegen was
previously incorrect but the test was passing by luck.

llvm-svn: 182885

ad6d08ac

May 29, 2013

X86: Fix Defs/Uses for insts that imp-def/imp-use both an A-register and EFLAGS. · 00e08db3

Ahmed Bougacha authored May 29, 2013

This corrects a problem where x86 instructions that implicitly define/use both
an A-register (RAX, EAX, ..) and EFLAGS were declared as only defining/using
EFLAGS, because the outer "let Defs/Uses = [EFLAGS]" in the various multiclasses
overrides the "let Defs/Uses = [areg]" in BinOpAI.

The instructions deriving from BinOpAI were moved out of the "let Defs", and a
BinOpAI_FF class was created, for instructions that implicitly define and use
EFLAGS and the A-register (SBC, ADC).

llvm-svn: 182883

00e08db3

Don't assume the registers will be enumerated sequentially. · 33b73662
Chad Rosier authored May 29, 2013
```
llvm-svn: 182879
```
33b73662

Enable FastISel on ARM for Linux and NaCl · f60e0e44

JF Bastien authored May 29, 2013

FastISel was only enabled for iOS ARM and Thumb2, this patch enables it
for ARM (not Thumb2) on Linux and NaCl.

Thumb2 support needs a bit more work, mainly around register class
restrictions.

The patch punts to SelectionDAG when doing TLS relocation on non-Darwin
targets. I will fix this and other FastISel-to-SelectionDAG failures in
a separate patch.

The patch also forces FastISel to retain frame pointers: iOS always
keeps them for backtracking (so emitted code won't change because of
this), but Linux was getting much worse code that was incorrect when
using big frames (such as test-suite's lencod). I'll also fix this in a
later patch, it will probably require a peephole so that FastISel
doesn't rematerialize frame pointers back-to-back.

The test changes are straightforward, similar to:
  http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130513/174279.html
They also add a vararg test that got dropped in that change.

I ran all of test-suite on A15 hardware with --optimize-option=-O0 and
all the tests pass.

llvm-svn: 182877

f60e0e44

Don't reach into the middle of TargetMachine and cache one of its ivars. · 70b1400e
Bill Wendling authored May 29, 2013
```
Not only does this break encapsulation, it's gross.

llvm-svn: 182876
```
70b1400e

Tidy some register classes for ARM and Thumb · 13969d0a

JF Bastien authored May 29, 2013

Tidy up three places where the register class for ARM and Thumb wasn't
restrictive enough:
 - No PC dest for reg-reg add/orr/sub.
 - No PC dest for shifts.
 - No PC or SP for Thumb2 reg-imm add.

I encountered this while combining FastISel with
-verify-machineinstrs. These instructions defined registers whose
classes weren't restrictive enough, and the uses failed
verification. They're also undefined in the ISA, or would produce code
that FastISel wouldn't want. This doesn't fix the register class
narrowing issue (where uses should restrict definitions), and isn't
thorough, but it's a small step in the right direction.

llvm-svn: 182863

13969d0a

SparcFrameLowering.cpp: Mark verifyLeafProcRegUse() as UNUSED. [-Wunused-function] · dbd3bbe1
NAKAMURA Takumi authored May 29, 2013
```
llvm-svn: 182850
```
dbd3bbe1
[SystemZ] Immediate compare-and-branch support · e1d9f00f
Richard Sandiford authored May 29, 2013
```
This patch adds support for the CIJ and CGIJ instructions.

llvm-svn: 182846
```
e1d9f00f
Temporary fix to get rid of gcc warning. · ae8faf2e
Patrik Hagglund authored May 29, 2013
```
llvm-svn: 182832
```
ae8faf2e
[Sparc] Add support for leaf functions in sparc backend. · ca0fe2f5
Venkatraman Govindaraju authored May 29, 2013
```
llvm-svn: 182822
```
ca0fe2f5

Mips assembler: Improve set register alias handling · 02593003

Jack Carter authored May 28, 2013

This patch solves the problem of numeric register values not being accepted:

../set_alias.s:1:11: error: expected valid expression after comma
        .set    r4,$4
                    ^
The parsing of .set directive is changed and handling of symbols in code 
as well to enable this feature. 

The test example is added.

Patch by Vladimir Medic

llvm-svn: 182807

02593003

May 28, 2013

AArch64: clarify -help message · 8a1aa518
Tim Northover authored May 28, 2013
```
llvm-svn: 182804
```
8a1aa518
Hexagon: Typo fix. · cceafb2d
Jyotsna Verma authored May 28, 2013
```
llvm-svn: 182790
```
cceafb2d

[SystemZ] Register compare-and-branch support · 0fb90ab0

Richard Sandiford authored May 28, 2013

This patch adds support for the CRJ and CGRJ instructions.  Support for
the immediate forms will be a separate patch.

The architecture has a large number of comparison instructions.  I think
it's generally better to concentrate on using the "best" comparison
instruction first and foremost, then only use something like CRJ if
CR really was the natual choice of comparison instruction.  The patch
therefore opportunistically converts separate CR and BRC instructions
into a single CRJ while emitting instructions in ISelLowering.

llvm-svn: 182764

0fb90ab0

[SystemZ] Tweak SystemZInstrInfo::isBranch() interface · 53c9efd9
Richard Sandiford authored May 28, 2013
```
This is needed for the upcoming compare-and-branch patch.  No functional
change intended.

llvm-svn: 182762
```
53c9efd9
Make helper functions static. · f30f2cce
Rafael Espindola authored May 27, 2013
```
And remove header and cpp file that are empty after that.

llvm-svn: 182746
```
f30f2cce

May 27, 2013

Convert sqrt functions into sqrt instructions when -ffast-math is in effect. · 048f99de

Preston Gurd authored May 27, 2013

When -ffast-math is in effect (on Linux, at least), clang defines
__FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the
preprocessor to include <bits/math-finite.h>, which renames the sqrt functions.
For instance, "sqrt" is renamed as "__sqrt_finite". 

This patch adds the 3 new names in such a way that they will be treated
as equivalent to their respective original names.

llvm-svn: 182739

048f99de

PPC: Add a isConsecutiveLS utility function · 8ebfe6c2

Hal Finkel authored May 27, 2013

isConsecutiveLS is a slightly more general form of
SelectionDAG::isConsecutiveLoad. Aside from also handling stores, it also does
not assume equality of the chain operands is necessary. In the case of the PPC
backend, this chain condition is checked in a more general way by the
surrounding code.

Mostly, this part of the refactoring in preparation for supporting optimized
unaligned stores.

llvm-svn: 182723

8ebfe6c2

May 26, 2013

Prefer to duplicate PPC Altivec loads when expanding unaligned loads · 7d8a691b

Hal Finkel authored May 26, 2013

When expanding unaligned Altivec loads, we use the decremented offset trick to
prevent page faults. Unfortunately, if we have a sequence of consecutive
unaligned loads, this leads to suboptimal code generation because the 'extra'
load from the first unaligned load can be combined with the base load from the
second (but only if the decremented offset trick is not used for the first).
Search up and down the chain, through loads and token factors, looking for
consecutive loads, and if one is found, don't use the offset reduction trick.
These duplicate loads are later combined to yield the desired sequence (in the
future, we might want a more-powerful chain search, but that will require some
changes to allow the combiner routines to access the AA object).

This should complete the initial implementation of the optimized unaligned
Altivec load expansion. There is some refactoring that should be done, but
that will happen when the unaligned store expansion is added.

llvm-svn: 182719

7d8a691b

May 25, 2013

PPC: Combine duplicate (offset) lvsl Altivec intrinsics · bc2ee4c4

Hal Finkel authored May 25, 2013

The lvsl permutation control instruction is a function only of the alignment of
the pointer operand (relative to the 16-byte natural alignment of Altivec
vectors). As a result, multiple lvsl intrinsics where the operands differ by a
multiple of 16 can be combined.

llvm-svn: 182708

bc2ee4c4

Track IR ordering of SelectionDAG nodes 3/4. · e2431c64

Andrew Trick authored May 25, 2013

Remove the old IR ordering mechanism and switch to new one.  Fix unit
test failures.

llvm-svn: 182704

e2431c64

Track IR ordering of SelectionDAG nodes 2/4. · ef9de2a7

Andrew Trick authored May 25, 2013

Change SelectionDAG::getXXXNode() interfaces as well as call sites of
these functions to pass in SDLoc instead of DebugLoc.

llvm-svn: 182703

ef9de2a7