Commits · 774fe2e29a21815d7d107c00dc52a37eb91ecbc5 · Roger Ferrer / llvm-epi-0.8

Jun 03, 2013
- Sparc: When storing 0, use %g0 directly in the store instruction instead of · 774fe2e2
  Venkatraman Govindaraju authored Jun 03, 2013
```
       using two instructions (sethi and store).

llvm-svn: 183090
```
  774fe2e2
Jun 02, 2013
- Sparc: Combine add/or/sethi instruction with restore if possible. · 0bbe1b21
  Venkatraman Govindaraju authored Jun 02, 2013
```
llvm-svn: 183088
```
  0bbe1b21
- Sparc: Perform leaf procedure optimization by default · 3e8c7d98
  Venkatraman Govindaraju authored Jun 02, 2013
```
llvm-svn: 183083
```
  3e8c7d98
Jun 01, 2013
- When determining the new index for an insertelement, we may not assume that an · 3f715e26
  Nick Lewycky authored Jun 01, 2013
```
index greater than the size of the vector is invalid. The shuffle may be
shrinking the size of the vector. Fixes a crash!

Also drop the maximum recursion depth of the safety check for this
optimization to five.

llvm-svn: 183080
```
  3f715e26
- Sparc: Mark functions calling llvm.vastart and llvm.returnaddress intrinsics as non-leaf functions. · 28e2cd0e
  Venkatraman Govindaraju authored Jun 01, 2013
```
llvm-svn: 183079
```
  28e2cd0e
- SimplifyCFG: Fix typo in comment for ComputeSpeculationCost · 91142c48
  David Majnemer authored Jun 01, 2013
```
llvm-svn: 183078
```
  91142c48
- Move getRealLinkageName to a common place and remove all the duplicates of it. · 7c275640
  Benjamin Kramer authored Jun 01, 2013
```
Also simplify code a bit while there. No functionality change.

llvm-svn: 183076
```
  7c275640
- Move object construction into [] so the temporary can be moved. · 320682fe
  Benjamin Kramer authored Jun 01, 2013
```
No functionality change.

llvm-svn: 183075
```
  320682fe
- APInt: Simplify code. No functionality change. · b565f899
  Benjamin Kramer authored Jun 01, 2013
```
llvm-svn: 183073
```
  b565f899
- APFloat: Use isDenormal instead of hand-rolled code to check for denormals. · 6bef24f3
  Benjamin Kramer authored Jun 01, 2013
```
llvm-svn: 183072
```
  6bef24f3
- Revert r183069: "TMP: LEA64_32r fixing" · 339bf154
  Tim Northover authored Jun 01, 2013
```
Very sorry, it was committed from the wrong branch by mistake.

llvm-svn: 183070
```
  339bf154
- TMP: LEA64_32r fixing · 57954f04
  Tim Northover authored Jun 01, 2013
```
llvm-svn: 183069
```
  57954f04
- X86: change MOV64ri64i32 into MOV32ri64 · 3a1fd4c0
  Tim Northover authored Jun 01, 2013
```
The MOV64ri64i32 instruction required hacky MCInst lowering because it
was allocated as setting a GR64, but the eventual instruction ("movl")
only set a GR32. This converts it into a so-called "MOV32ri64" which
still accepts a (appropriate) 64-bit immediate but defines a GR32.
This is then converted to the full GR64 by a SUBREG_TO_REG operation,
thus keeping everyone happy.

This fixes a typo in the opcode field of the original patch, which
should make the legact JIT work again (& adds test for that problem).

llvm-svn: 183068
```
  3a1fd4c0
- [Sparc] Generate correct code for leaf functions with stack objects · 3521dcdc
  Venkatraman Govindaraju authored Jun 01, 2013
```
llvm-svn: 183067
```
  3521dcdc
- Make SubRegIndex size mandatory, following r183020. · b1a4d9da
  Ahmed Bougacha authored May 31, 2013
```
This also makes TableGen able to compute sizes/offsets of synthesized
indices representing tuples.

llvm-svn: 183061
```
  b1a4d9da
- Prevent loop-unroll from making assumptions about undefined behavior. · ee9143ac
  Andrew Trick authored May 31, 2013
```
Fixes rdar:14036816, PR16130.

There is an opportunity to compute precise trip counts for 'or'
expressions and multi-exit loops.
rdar:14038809: Optimize trip count computation for multi-exit loops.

To do this we need to record the fact that ExitLimit assumes NSW. When
it does not we can safely assume that the loop trip count is the
minimum ExitLimt across all subexpressions and loop exits.

llvm-svn: 183060
```
  ee9143ac
- Temporarily Revert "X86: change MOV64ri64i32 into MOV32ri64" as it · e1e57e5e
  Eric Christopher authored May 31, 2013
```
seems to have caused PR16192 and other JIT related failures.

llvm-svn: 183059
```
  e1e57e5e
- Const-ify some printing and dumping code for DIEValues. · 65ac02ad
  Eric Christopher authored May 31, 2013
```
llvm-svn: 183057
```
  65ac02ad
- Add support for adding the contents of a StringRef to the MD5 hash. · 1ec87e8b
  Eric Christopher authored May 31, 2013
```
llvm-svn: 183054
```
  1ec87e8b
- Convert more unsigned char -> uint8_t. · 85bd745e
  Eric Christopher authored May 31, 2013
```
llvm-svn: 183053
```
  85bd745e
- Fix comment. · d0910436
  Eric Christopher authored May 31, 2013
```
llvm-svn: 183052
```
  d0910436
- Move "unsigned char" -> "uint8_t". · 606ecda4
  Eric Christopher authored May 31, 2013
```
llvm-svn: 183051
```
  606ecda4
May 31, 2013

LoopVectorize: Change API call to get the backedge taken count · 7b1b4db3

Arnold Schwaighofer authored May 31, 2013

Use ScalarEvolution's getBackedgeTakenCount API instead of getExitCount since
that is really what we want to know. Using the more specific getExitCount was
safe because we made sure that there is only one exiting block.

No functionality change.

llvm-svn: 183047

7b1b4db3

Loop Strength Reduce: Scaling factor cost. · bf490d4a

Quentin Colombet authored May 31, 2013

Account for the cost of scaling factor in Loop Strength Reduce when rating the
formulae. This uses a target hook.

The default implementation of the hook is: if the addressing mode is legal, the
scaling factor is free.

<rdar://problem/13806271>

llvm-svn: 183045

bf490d4a

Rename COFFYaml.h to COFFYAML.h for consistency. · 3f1c99a6
Rafael Espindola authored May 31, 2013
```
llvm-svn: 183042
```
3f1c99a6
Don't allocate temporary string for section data. · a3310e0b
Rafael Espindola authored May 31, 2013
```
llvm-svn: 183040
```
a3310e0b

LoopVectorize: PHIs with only outside users should prevent vectorization · 70a9be52

Arnold Schwaighofer authored May 31, 2013

We check that instructions in the loop don't have outside users (except if
they are reduction values). Unfortunately, we skipped this check for
if-convertable PHIs.

Fixes PR16184.

llvm-svn: 183035

70a9be52

NVPTX: Don't even create a regalloc if we're not going to use it. · fae7ff12
Benjamin Kramer authored May 31, 2013
```
Fixes a leak found by valgrind.

llvm-svn: 183031
```
fae7ff12

Modify how the formulae are rated in Loop Strength Reduce. · 8aa7abe2

Quentin Colombet authored May 31, 2013

Namely, check if the target allows to fold more that one register in the
addressing mode and if yes, adjust the cost accordingly.

Prior to this commit, reg1 + scale * reg2 accesses were artificially preferred
to reg1 + reg2 accesses. Indeed, the cost model wrongly assumed that reg1 + reg2
needs a temporary register for the computation, whereas it was correctly
estimated for reg1 + scale * reg2.

<rdar://problem/13973908>

llvm-svn: 183021

8aa7abe2

Add a way to define the bit range covered by a SubRegIndex. · f1ed334d

Ahmed Bougacha authored May 31, 2013

NOTE: If this broke your out-of-tree backend, in *RegisterInfo.td, change
the instances of SubRegIndex that have a comps template arg to use the
ComposedSubRegIndex class instead.

In TableGen land, this adds Size and Offset attributes to SubRegIndex,
and the ComposedSubRegIndex class, for which the Size and Offset are
computed by TableGen. This also adds an accessor in MCRegisterInfo, and
Size/Offsets for the X86 and ARM subreg indices.

llvm-svn: 183020

f1ed334d

Remove useless code from transitioning to new EH scheme · e1823b6b

Kai Nacke authored May 31, 2013

Removes all uses of the variable UsesNewEH. Simply return false in case that no
resume instructions were found.

llvm-svn: 183016

e1823b6b

ARM: permit upper-case BE/LE on setend instruction · 4d141440
Tim Northover authored May 31, 2013
```
Patch by Amaury de la Vieuville.

llvm-svn: 183012
```
4d141440

ARM: add fstmx and fldmx instructions for assembly · 4173e29a

Tim Northover authored May 31, 2013

These instructions are deprecated oddities, but we still need to be able to
disassemble (and reassemble) them if and when they're encountered.

Patch by Amaury de la Vieuville.

llvm-svn: 183011

4173e29a

Simplify multiplications by vectors whose elements are powers of 2. · 65281bf3
Rafael Espindola authored May 31, 2013
```
Patch by Andrea Di Biagio.

llvm-svn: 183005
```
65281bf3

ARM: fix VEXT encoding corner case · 1bb672da

Tim Northover authored May 31, 2013

The disassembly of VEXT instructions was too lax in the bits checked. This
fixes the case where the instruction affects Q-registers but a misaligned lane
was specified (should be UNDEFINED).

Patch by Amaury de la Vieuville

llvm-svn: 183003

1bb672da

[SystemZ] Don't use LOAD and STORE REVERSED for volatile accesses · 30efd87f

Richard Sandiford authored May 31, 2013

Unlike most -- hopefully "all other", but I'm still checking -- memory
instructions we support, LOAD REVERSED and STORE REVERSED may access
the memory location several times.  This means that they are not suitable
for volatile loads and stores.

This patch is a prerequisite for better atomic load and store support.
The same principle applies there: almost all memory instructions we
support are inherently atomic ("block concurrent"), but LOAD REVERSED
and STORE REVERSED are exceptions.

Other instructions continue to allow volatile operands.  I will add
positive "allows volatile" tests at the same time as the "allows atomic
load or store" tests.

llvm-svn: 183002

30efd87f

[NVPTX] Re-enable support for virtual registers in the final output · dbb3b2f4

Justin Holewinski authored May 31, 2013

Now that 3.3 is branched, we are re-enabling virtual registers to help
iron out bugs before the next release. Some of the post-RA passes do
not play well with virtual registers, so we disable them for now. The
needed functionality of the PrologEpilogInserter pass is copied to a
new backend-specific NVPTXPrologEpilog pass.

The test for this commit is not breaking the existing tests.

llvm-svn: 182998

dbb3b2f4

[msan] Handle mixed track-origins and keep-going settings (llvm part). · 888385e4

Evgeniy Stepanov authored May 31, 2013

Before this change, each module defined a weak_odr global __msan_track_origins
with a value of 1 if origin tracking is enabled, 0 if disabled. If there are
modules with different values, any of them may win. If 0 wins, and there is at
least one module with 1, the program will most likely crash.

With this change, __msan_track_origins is only emitted if origin tracking is
on. Then runtime library detects if there is at least one module with origin
tracking, and enables runtime support for it.

llvm-svn: 182997

888385e4

X86: change MOV64ri64i32 into MOV32ri64 · d4736d67

Tim Northover authored May 31, 2013

The MOV64ri64i32 instruction required hacky MCInst lowering because it was
allocated as setting a GR64, but the eventual instruction ("movl") only set a
GR32. This converts it into a so-called "MOV32ri64" which still accepts a
(appropriate) 64-bit immediate but defines a GR32. This is then converted to
the full GR64 by a SUBREG_TO_REG operation, thus keeping everyone happy.

llvm-svn: 182991

d4736d67

Fix ScalarEvolution::ComputeExitLimitFromCond for 'or' conditions. · 5b245a16

Andrew Trick authored May 31, 2013

Fixes PR16130 - clang produces incorrect code with loop/expression at -O2.

This is a 2+ year old bug that's now holding up the release. It's a
case where we knowingly made aggressive assumptions about undefined
behavior. These assumptions are wrong when SCEV is computing a
subexpression that does not directly control the branch. With this
fix, we avoid making assumptions in those cases but still optimize the
common case. SCEV's trip count computation for exits controlled by
'or' expressions is now analagous to the trip count computation for
loops with multiple exits. I had already fixed the multiple exit case
to be conservative.

llvm-svn: 182989

5b245a16