Commits · 339bf154ccded0ecc22f12bbee88cf435d5b3f27 · Roger Ferrer / llvm-epi-0.8

Jun 01, 2013

Revert r183069: "TMP: LEA64_32r fixing" · 339bf154
Tim Northover authored Jun 01, 2013
```
Very sorry, it was committed from the wrong branch by mistake.

llvm-svn: 183070
```
339bf154
TMP: LEA64_32r fixing · 57954f04
Tim Northover authored Jun 01, 2013
```
llvm-svn: 183069
```
57954f04

X86: change MOV64ri64i32 into MOV32ri64 · 3a1fd4c0

Tim Northover authored Jun 01, 2013

The MOV64ri64i32 instruction required hacky MCInst lowering because it
was allocated as setting a GR64, but the eventual instruction ("movl")
only set a GR32. This converts it into a so-called "MOV32ri64" which
still accepts a (appropriate) 64-bit immediate but defines a GR32.
This is then converted to the full GR64 by a SUBREG_TO_REG operation,
thus keeping everyone happy.

This fixes a typo in the opcode field of the original patch, which
should make the legact JIT work again (& adds test for that problem).

llvm-svn: 183068

3a1fd4c0

[Sparc] Generate correct code for leaf functions with stack objects · 3521dcdc
Venkatraman Govindaraju authored Jun 01, 2013
```
llvm-svn: 183067
```
3521dcdc

Make SubRegIndex size mandatory, following r183020. · b1a4d9da

Ahmed Bougacha authored May 31, 2013

This also makes TableGen able to compute sizes/offsets of synthesized
indices representing tuples.

llvm-svn: 183061

b1a4d9da

Prevent loop-unroll from making assumptions about undefined behavior. · ee9143ac

Andrew Trick authored May 31, 2013

Fixes rdar:14036816, PR16130.

There is an opportunity to compute precise trip counts for 'or'
expressions and multi-exit loops.
rdar:14038809: Optimize trip count computation for multi-exit loops.

To do this we need to record the fact that ExitLimit assumes NSW. When
it does not we can safely assume that the loop trip count is the
minimum ExitLimt across all subexpressions and loop exits.

llvm-svn: 183060

ee9143ac

Temporarily Revert "X86: change MOV64ri64i32 into MOV32ri64" as it · e1e57e5e
Eric Christopher authored May 31, 2013
```
seems to have caused PR16192 and other JIT related failures.

llvm-svn: 183059
```
e1e57e5e
Const-ify some printing and dumping code for DIEValues. · 65ac02ad
Eric Christopher authored May 31, 2013
```
llvm-svn: 183057
```
65ac02ad
Add support for adding the contents of a StringRef to the MD5 hash. · 1ec87e8b
Eric Christopher authored May 31, 2013
```
llvm-svn: 183054
```
1ec87e8b
Convert more unsigned char -> uint8_t. · 85bd745e
Eric Christopher authored May 31, 2013
```
llvm-svn: 183053
```
85bd745e
Fix comment. · d0910436
Eric Christopher authored May 31, 2013
```
llvm-svn: 183052
```
d0910436
Move "unsigned char" -> "uint8_t". · 606ecda4
Eric Christopher authored May 31, 2013
```
llvm-svn: 183051
```
606ecda4

May 31, 2013

LoopVectorize: Change API call to get the backedge taken count · 7b1b4db3

Arnold Schwaighofer authored May 31, 2013

Use ScalarEvolution's getBackedgeTakenCount API instead of getExitCount since
that is really what we want to know. Using the more specific getExitCount was
safe because we made sure that there is only one exiting block.

No functionality change.

llvm-svn: 183047

7b1b4db3

Loop Strength Reduce: Scaling factor cost. · bf490d4a

Quentin Colombet authored May 31, 2013

Account for the cost of scaling factor in Loop Strength Reduce when rating the
formulae. This uses a target hook.

The default implementation of the hook is: if the addressing mode is legal, the
scaling factor is free.

<rdar://problem/13806271>

llvm-svn: 183045

bf490d4a

Rename COFFYaml.h to COFFYAML.h for consistency. · 3f1c99a6
Rafael Espindola authored May 31, 2013
```
llvm-svn: 183042
```
3f1c99a6
Don't allocate temporary string for section data. · a3310e0b
Rafael Espindola authored May 31, 2013
```
llvm-svn: 183040
```
a3310e0b

LoopVectorize: PHIs with only outside users should prevent vectorization · 70a9be52

Arnold Schwaighofer authored May 31, 2013

We check that instructions in the loop don't have outside users (except if
they are reduction values). Unfortunately, we skipped this check for
if-convertable PHIs.

Fixes PR16184.

llvm-svn: 183035

70a9be52

NVPTX: Don't even create a regalloc if we're not going to use it. · fae7ff12
Benjamin Kramer authored May 31, 2013
```
Fixes a leak found by valgrind.

llvm-svn: 183031
```
fae7ff12

Modify how the formulae are rated in Loop Strength Reduce. · 8aa7abe2

Quentin Colombet authored May 31, 2013

Namely, check if the target allows to fold more that one register in the
addressing mode and if yes, adjust the cost accordingly.

Prior to this commit, reg1 + scale * reg2 accesses were artificially preferred
to reg1 + reg2 accesses. Indeed, the cost model wrongly assumed that reg1 + reg2
needs a temporary register for the computation, whereas it was correctly
estimated for reg1 + scale * reg2.

<rdar://problem/13973908>

llvm-svn: 183021

8aa7abe2

Add a way to define the bit range covered by a SubRegIndex. · f1ed334d

Ahmed Bougacha authored May 31, 2013

NOTE: If this broke your out-of-tree backend, in *RegisterInfo.td, change
the instances of SubRegIndex that have a comps template arg to use the
ComposedSubRegIndex class instead.

In TableGen land, this adds Size and Offset attributes to SubRegIndex,
and the ComposedSubRegIndex class, for which the Size and Offset are
computed by TableGen. This also adds an accessor in MCRegisterInfo, and
Size/Offsets for the X86 and ARM subreg indices.

llvm-svn: 183020

f1ed334d

Remove useless code from transitioning to new EH scheme · e1823b6b

Kai Nacke authored May 31, 2013

Removes all uses of the variable UsesNewEH. Simply return false in case that no
resume instructions were found.

llvm-svn: 183016

e1823b6b

ARM: permit upper-case BE/LE on setend instruction · 4d141440
Tim Northover authored May 31, 2013
```
Patch by Amaury de la Vieuville.

llvm-svn: 183012
```
4d141440

ARM: add fstmx and fldmx instructions for assembly · 4173e29a

Tim Northover authored May 31, 2013

These instructions are deprecated oddities, but we still need to be able to
disassemble (and reassemble) them if and when they're encountered.

Patch by Amaury de la Vieuville.

llvm-svn: 183011

4173e29a

Simplify multiplications by vectors whose elements are powers of 2. · 65281bf3
Rafael Espindola authored May 31, 2013
```
Patch by Andrea Di Biagio.

llvm-svn: 183005
```
65281bf3

ARM: fix VEXT encoding corner case · 1bb672da

Tim Northover authored May 31, 2013

The disassembly of VEXT instructions was too lax in the bits checked. This
fixes the case where the instruction affects Q-registers but a misaligned lane
was specified (should be UNDEFINED).

Patch by Amaury de la Vieuville

llvm-svn: 183003

1bb672da

[SystemZ] Don't use LOAD and STORE REVERSED for volatile accesses · 30efd87f

Richard Sandiford authored May 31, 2013

Unlike most -- hopefully "all other", but I'm still checking -- memory
instructions we support, LOAD REVERSED and STORE REVERSED may access
the memory location several times.  This means that they are not suitable
for volatile loads and stores.

This patch is a prerequisite for better atomic load and store support.
The same principle applies there: almost all memory instructions we
support are inherently atomic ("block concurrent"), but LOAD REVERSED
and STORE REVERSED are exceptions.

Other instructions continue to allow volatile operands.  I will add
positive "allows volatile" tests at the same time as the "allows atomic
load or store" tests.

llvm-svn: 183002

30efd87f

[NVPTX] Re-enable support for virtual registers in the final output · dbb3b2f4

Justin Holewinski authored May 31, 2013

Now that 3.3 is branched, we are re-enabling virtual registers to help
iron out bugs before the next release. Some of the post-RA passes do
not play well with virtual registers, so we disable them for now. The
needed functionality of the PrologEpilogInserter pass is copied to a
new backend-specific NVPTXPrologEpilog pass.

The test for this commit is not breaking the existing tests.

llvm-svn: 182998

dbb3b2f4

[msan] Handle mixed track-origins and keep-going settings (llvm part). · 888385e4

Evgeniy Stepanov authored May 31, 2013

Before this change, each module defined a weak_odr global __msan_track_origins
with a value of 1 if origin tracking is enabled, 0 if disabled. If there are
modules with different values, any of them may win. If 0 wins, and there is at
least one module with 1, the program will most likely crash.

With this change, __msan_track_origins is only emitted if origin tracking is
on. Then runtime library detects if there is at least one module with origin
tracking, and enables runtime support for it.

llvm-svn: 182997

888385e4

X86: change MOV64ri64i32 into MOV32ri64 · d4736d67

Tim Northover authored May 31, 2013

The MOV64ri64i32 instruction required hacky MCInst lowering because it was
allocated as setting a GR64, but the eventual instruction ("movl") only set a
GR32. This converts it into a so-called "MOV32ri64" which still accepts a
(appropriate) 64-bit immediate but defines a GR32. This is then converted to
the full GR64 by a SUBREG_TO_REG operation, thus keeping everyone happy.

llvm-svn: 182991

d4736d67

Fix ScalarEvolution::ComputeExitLimitFromCond for 'or' conditions. · 5b245a16

Andrew Trick authored May 31, 2013

Fixes PR16130 - clang produces incorrect code with loop/expression at -O2.

This is a 2+ year old bug that's now holding up the release. It's a
case where we knowingly made aggressive assumptions about undefined
behavior. These assumptions are wrong when SCEV is computing a
subexpression that does not directly control the branch. With this
fix, we avoid making assumptions in those cases but still optimize the
common case. SCEV's trip count computation for exits controlled by
'or' expressions is now analagous to the trip count computation for
loops with multiple exits. I had already fixed the multiple exit case
to be conservative.

llvm-svn: 182989

5b245a16

[mips] Big-endian code generation for atomic instructions. · 2bf97336
Akira Hatanaka authored May 31, 2013
```
Patch by Jyun-Yan You.

llvm-svn: 182984
```
2bf97336
Reapply with r182909 with a fix to the calculation of the new indices for · a2b77206
Nick Lewycky authored May 31, 2013
```
insertelement instructions.

llvm-svn: 182976
```
a2b77206
Remove debug print added in r182949. · fb5138bd
Ahmed Bougacha authored May 30, 2013
```
llvm-svn: 182973
```
fb5138bd

May 30, 2013

Revert r182937 and r182877. · 99bd2ae4

Rafael Espindola authored May 30, 2013

r182877 broke MCJIT tests on ARM and r182937 was working around another failure
by r182877.

This should make the ARM bots green.

llvm-svn: 182960

99bd2ae4

Use the const_cast only where necessary. · a800f955
Bill Wendling authored May 30, 2013
```
llvm-svn: 182950
```
a800f955

MCObjectSymbolizer: Switch from IntervalMap to sorted vector, following r182625. · 75633ba1

Ahmed Bougacha authored May 30, 2013

This removes the need for the missing SectionRef operator< workaround, and fixes
an IntervalMap assert about alignment on MSVC.

llvm-svn: 182949

75633ba1

Implement IEEE-754R 2008 nextUp/nextDown functions in the guise of the... · 0c622ea8

Michael Gottesman authored May 30, 2013

Implement IEEE-754R 2008 nextUp/nextDown functions in the guise of the function APFloat::next(bool nextDown).

rdar://13852078

llvm-svn: 182945

0c622ea8

X86: use sub-register sequences for MOV*r0 operations · 64ec0ff4

Tim Northover authored May 30, 2013

Instead of having a bunch of separate MOV8r0, MOV16r0, ... pseudo-instructions,
it's better to use a single MOV32r0 (which will expand to "xorl %reg, %reg")
and obtain other sizes with EXTRACT_SUBREG and SUBREG_TO_REG. The encoding is
smaller and partial register updates can sometimes be avoided.

Until recently, this sequence was a barrier to rematerialization though. That
should now be fixed so it's an appropriate time to make the change.

llvm-svn: 182928

64ec0ff4

Fix rematerialization into physical registers. · 69cd121d

Tim Northover authored May 30, 2013

r182872 introduced a bug in how the register-coalescer's rematerialization
handled defining a physical register. It relied on the output of the
coalescer's setRegisters method to determine whether the replacement
instruction needed an implicit-def. However, this value isn't necessarily the
same as the CopyMI's actual destination register which is what the rest of the
basic-block expects us to be defining.

The commit changes the rematerializer to use the actual register attached to
CopyMI in its decision.

This will be tested soon by an X86 patch which moves everything to using
MOV32r0 instead of other sizes.

llvm-svn: 182925

69cd121d

[NVPTX] Fix case where a sext load of an i1 type may produce an · 994d66a3
Justin Holewinski authored May 30, 2013
```
ld.u1 instead of an ld.u8.

llvm-svn: 182924
```
994d66a3