Commits · 36d3ee7c32cfeda4c76cb556781ef113c8463dff · Roger Ferrer / llvm-epi

Aug 14, 2014

Rafael Espindola authored Aug 14, 2014

auroraux.org is not resolving.

I will add this to the release notes as soon as I figure out where to put the
3.6 release notes :-)

llvm-svn: 215645

36d3ee7c

Silencing some -Wcast-qual warnings and removing some C-style casts at the same time. NFC. · 80930af3
Aaron Ballman authored Aug 14, 2014
```
llvm-svn: 215643
```
80930af3

Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly... · 61acc221

Aaron Ballman authored Aug 14, 2014

Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). NFC.

llvm-svn: 215642

61acc221

[mips] Improve robustness of some tests. · 726f1ea2

Toma Tabacu authored Aug 14, 2014

Summary:
This is done by removing some hardcoded registers like $at or expecting a single digit register to be selected.

Contains work done by Matheus Almeida.

Reviewers: matheusalmeida, dsanders

Reviewed By: dsanders

Subscribers: tomatabacu

Differential Revision: http://reviews.llvm.org/D4227

llvm-svn: 215640

726f1ea2

[x86] Begin stubbing out the AVX support in the new vector shuffle · a8311b36

Chandler Carruth authored Aug 14, 2014

lowering scheme.

Currently, this just directly bails to the fallback path of splitting
the 256-bit vector into two 128-bit vectors, operating there, and then
joining the results back together. While the results are far from
perfect, they are *shockingly* good for what we're doing here. I'll be
layering the rest of the functionality on top of this piece by piece and
updating tests as I go.

Note that 256-bit vectors in this mode are still somewhat WIP. While
I think the code paths that I'm adding here are clean and good-to-go,
there are still a lot of 128-bit assumptions that I'll need to stomp out
as I march through the functional spread here.

llvm-svn: 215637

a8311b36

[mips][microMIPS] MicroMIPS Compact Branch Instructions BEQZC and BNEZC · 73ff9487
Zoran Jovanovic authored Aug 14, 2014
```
Differential Revision: http://reviews.llvm.org/D3545

llvm-svn: 215636
```
73ff9487
Make message about building sphinx documentation with CMake more · 992ca1d8
Dan Liew authored Aug 14, 2014
```
informative by stating where the output is going.

llvm-svn: 215635
```
992ca1d8

Add SPHINX_WARNINGS_AS_ERRORS CMake option to allow warnings to not be · c2867bab

Dan Liew authored Aug 14, 2014

treated as errors (which is still the default). This is useful when
working on documentation that has existing errors.

llvm-svn: 215634

c2867bab

[mips] Add assembler support for the "la $reg,symbol" pseudo-instruction. · 0d64b20c

Toma Tabacu authored Aug 14, 2014

Summary:
This pseudo-instruction allows the programmer to load an address from a symbolic expression into a register.

Patch by David Chisnall.
His work was sponsored by: DARPA, AFRL

I've made some minor changes to the original, such as improving the formatting and adding some comments, and I've also added a test case.

Reviewers: dsanders

Reviewed By: dsanders

Differential Revision: http://reviews.llvm.org/D4808

llvm-svn: 215630

0d64b20c

[mips] Rename [gs]etCanHaveModuleDir to more natural names · cdb45fa3

Daniel Sanders authored Aug 14, 2014

Summary:
getCanHaveModuleDir() is renamed to isModuleDirectiveAllowed(), and
setCanHaveModuleDir() is renamed to forbidModuleDirective() since it is only
ever given a false argument.

Reviewers: vmedic

Reviewed By: vmedic

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D4885

llvm-svn: 215628

cdb45fa3

[SDAG] Fix a bug in the DAG combiner where we would fail to return the · 7cd15be7

Chandler Carruth authored Aug 14, 2014

input node after manually adding it to the worklist and using CombineTo.

Once we use CombineTo the input node may have been deleted. Despite this
being *completely confusing* and somewhat broken, the only way to
"correctly" return from a DAG combine after potentially deleting the
input node is to return *that exact node*....

But really, this code should just never have used CombineTo. It won't do
what it wants (returning the node as mentioned above just causes the
combine to infloop). The correct way to combine away a casted load to
a load of the correct type is to RAUW the chain directly and then return
the loaded value to replace the actual value node.

I managed to find this with the vector shuffle fuzzer even though it
clearly has nothing at all to do with vector shuffles and rather those
happen to trigger a load of a constant pool that hits this combine *just
right*. I've included the test as it is small and a nice stress test
that the infrastructure isn't asserting.

llvm-svn: 215622

7cd15be7

InstCombine: ((A | ~B) ^ (~A | B)) to A ^ B · 698dca0b

David Majnemer authored Aug 14, 2014

Proof using CVC3 follows:
$ cat t.cvc
A, B : BITVECTOR(32);
QUERY BVXOR((A | ~B),(~A |B)) = BVXOR(A,B);
$ cvc3 t.cvc
Valid.

Patch by Mayur Pandey!

Differential Revision: http://reviews.llvm.org/D4883

llvm-svn: 215621

698dca0b

AArch64: Silence warning in AArch64FastISel · c307c667
David Majnemer authored Aug 14, 2014
```
GCC was emitting a signed vs unsigned comparison warning.

llvm-svn: 215620
```
c307c667

Added InstCombine Transform for ((B | C) & A) | B -> B | (A & C) · f1eda235

David Majnemer authored Aug 14, 2014

Transform ((B | C) & A) | B --> B | (A & C)

Z3 Link: http://rise4fun.com/Z3/hP6p

Patch by Sonam Kumari!

Differential Revision: http://reviews.llvm.org/D4865

llvm-svn: 215619

f1eda235

MC: AsmLexer: handle multi-character CommentStrings correctly · bb67af44

Saleem Abdulrasool authored Aug 14, 2014

As X86MCAsmInfoDarwin uses '##' as CommentString although a single '#' starts a
comment a workaround for this special case is added.

Fixes divisions in constant expressions for the AArch64 assembler and other
targets which use '//' as CommentString.

Patch by Janne Grunau!

llvm-svn: 215615

bb67af44

[MCJIT] Support DisableSymbolSearching and InstallLazyFunctionCreator in MCJIT. · ea800ca5
Lang Hames authored Aug 14, 2014
```
Patch by Anthony Pesch. Thanks Anthony!

llvm-svn: 215613
```
ea800ca5

[SDAG] Fix a case where we would iteratively legalize a node during · 8039b16d

Chandler Carruth authored Aug 14, 2014

combining by replacing it with something else but not re-process the
node afterward to remove it.

In a truly remarkable stroke of bad luck, this would (in the test case
attached) end up getting some other node combined into it without ever
getting re-processed. By adding it back on to the worklist, in addition
to deleting the dead nodes more quickly we also ensure that if it
*stops* being dead for any reason it makes it back through the
legalizer. Without this, the test case will end up failing during
instruction selection due to an and node with a type we don't have an
instruction pattern for.

It took many million runs of the shuffle fuzz tester to find this.

llvm-svn: 215611

8039b16d

Remove llvm_headers_do_not_build for the benefit of XCode and Visual Studio users. · 5601dc40
Michael J. Spencer authored Aug 14, 2014
```
llvm-svn: 215610
```
5601dc40
[X86] Fix the value of the low mask for the lowering of MUL_LOHI for v4i32. · 57fb040b
Quentin Colombet authored Aug 13, 2014
```
Found by code inspection.

llvm-svn: 215604
```
57fb040b

[AArch64, fast-isel] Fall back to SelectionDAG to select tail calls. · b74db09c

Akira Hatanaka authored Aug 13, 2014

Certain functions such as objc_autoreleaseReturnValue have to be called as
tail-calls even at -O0. Since normal fast-isel doesn't emit calls as tail calls,
we have to fall back to SelectionDAG to select calls that are marked as tail.

<rdar://problem/17991614>

llvm-svn: 215600

b74db09c

[FastISel][AArch64] Add support for more addressing modes. · 98347d90

Juergen Ributzka authored Aug 13, 2014

FastISel didn't take much advantage of the different addressing modes available
to it on AArch64. This commit allows the ComputeAddress method to recognize more
addressing modes that allows shifts and sign-/zero-extensions to be folded into
the memory operation itself.

For Example:
  lsl x1, x1, #3     --> ldr x0, [x0, x1, lsl #3]
  ldr x0, [x0, x1]

  sxtw x1, w1
  lsl x1, x1, #3     --> ldr x0, [x0, x1, sxtw #3]
  ldr x0, [x0, x1]

llvm-svn: 215597

98347d90

[FastISel][X86] Add large code model support for materializing floating-point constants. · 0f8bc043

Juergen Ributzka authored Aug 13, 2014

In the large code model for X86 floating-point constants are placed in the
constant pool and materialized by loading from it. Since the constant pool
could be far away, a PC relative load might not work. Therefore we first
materialize the address of the constant pool with a movabsq and then load
from there the floating-point value.

Fixes <rdar://problem/17674628>.

llvm-svn: 215595

0f8bc043

[FastISel][X86] Use XOR to materialize the "0" value. · ba8b79e9
Juergen Ributzka authored Aug 13, 2014
```
llvm-svn: 215594
```
ba8b79e9

[FastISel][X86] Emit more efficient instructions for integer constant materialization. · 230494b3

Juergen Ributzka authored Aug 13, 2014

This mostly affects the i64 value type, which always resulted in an 15byte
mobavsq instruction to materialize any constant. The custom code checks the
value of the immediate and tries to use a different and smaller mov
instruction when possible.

This fixes <rdar://problem/17420988>.

llvm-svn: 215593

230494b3

[FastISel][AArch64] Make use of the zero register when possible. · 24080d60

Juergen Ributzka authored Aug 13, 2014

This change materializes now the value "0" from the zero register.
The zero register can be folded by several instruction, so no
materialization is need at all.

Fixes <rdar://problem/17924413>.

llvm-svn: 215591

24080d60

[FastISel] Let the target decide first if it wants to materialize a constant. · 7cee768e

Juergen Ributzka authored Aug 13, 2014

This changes the order in which FastISel tries to materialize a constant.
Originally it would try to use a simple target-independent approach, which
can lead to the generation of inefficient code.

On X86 this would result in the use of movabsq to materialize any 64bit
integer constant - even for simple and small values such as 0 and 1. Also
some very funny floating-point materialization could be observed too.

On AArch64 it would materialize the constant 0 in a register even the
architecture has an actual "zero" register.

On ARM it would generate unnecessary mov instructions or not use mvn.

This change simply changes the order and always asks the target first if it
likes to materialize the constant. This doesn't fix all the issues
mentioned above, but it enables the targets to implement such
optimizations.

Related to <rdar://problem/17420988>.

llvm-svn: 215588

7cee768e

[MachineCombiner] Removal of dangling DBG_VALUES after combining [20598] · fe2c11ff

Gerolf Hoflehner authored Aug 13, 2014

This is a cleaner solution to the problem described in r215431.
When instructions are combined a dangling DBG_VALUE is removed.
This resolves bug 20598.

llvm-svn: 215587

fe2c11ff

[FastISel][X86] Refactor constant materialization. NFCI. · 2b98e393

Juergen Ributzka authored Aug 13, 2014

Split the constant materialization code into three separate helper functions for
Integer-, Floating-Point-, and GlobalValue-Constants.

llvm-svn: 215586

2b98e393

Aug 13, 2014

[FastISel][ARM] Use MOVT/MOVW if the subtarget requests it. · a5b08385

Juergen Ributzka authored Aug 13, 2014

This change is also in preparation for a future change to make sure that
the constant materialization uses MOVT/MOVW when available and not a load
from the constant pool.

llvm-svn: 215584

a5b08385

[FastISel][ARM] Fix a bug in the integer materialization code. · 2cbcf7aa

Juergen Ributzka authored Aug 13, 2014

getRegClassFor returns the incorrect register class when in Thumb2 mode.
This fix simply manually selects the register class as in the code just a few
lines above.

There is no test case for this code, because the code is currently
unreachable. This will be changed in a future commit and existing test
cases will exercise this code.

llvm-svn: 215583

2cbcf7aa

[FastISel][AArch64] Cleanup constant materialization code. NFCI. · 5ae43a13
Juergen Ributzka authored Aug 13, 2014
```
Cleanup and prepare constant materialization code for future commits.

llvm-svn: 215582
```
5ae43a13

[Cleanup] Utility function to erase instruction and mark DBG_Values · caa8bfd1

Gerolf Hoflehner authored Aug 13, 2014

New function to erase a machine instruction and mark DBG_VALUE
for removal. A DBG_VALUE is marked for removal when it references
an operand defined in the instruction.
Use the new function to cleanup code in dead machine instruction
removal pass.

llvm-svn: 215580

caa8bfd1

[MachineDominatorTree] Provide a method to inform a MachineDominatorTree that a · abea99f6

Quentin Colombet authored Aug 13, 2014

critical edge has been split. The MachineDominatorTree will when lazy update the
underlying dominance properties when require.

** Context **

This is a follow-up of r215410.
Each time a critical edge is split this invalidates the dominator tree
information. Thus, subsequent queries of that interface will be slow until the
underlying information is actually recomputed (costly).

** Problem **

Prior to this patch, splitting a critical edge needed to query the dominator
tree to update the dominator information.
Therefore, splitting a bunch of critical edges will likely produce poor
performance as each query to the dominator tree will use the slow query path.
This happens a lot in passes like MachineSink and PHIElimination.

** Proposed Solution **

Splitting a critical edge is a local modification of the CFG. Moreover, as soon
as a critical edge is split, it is not critical anymore and thus cannot be a
candidate for critical edge splitting anymore. In other words, the predecessor
and successor of a basic block inserted on a critical edge cannot be inserted by
critical edge splitting.

Using these observations, we can pile up the splitting of critical edge and
apply then at once before updating the DT information.

The core of this patch moves the update of the MachineDominatorTree information
from MachineBasicBlock::SplitCriticalEdge to a lazy MachineDominatorTree.

** Performance **

Thanks to this patch, the motivating example compiles in 4- minutes instead of
6+ minutes. No test case added as the motivating example as nothing special but
being huge!

The binaries are strictly identical for all the llvm test-suite + SPECs with and
without this patch for both Os and O3.

Regarding compile time, I observed only noise, although on average I saw a
small improvement.

<rdar://problem/17894619>

llvm-svn: 215576

abea99f6

Fix (re-)creation of unittest lit.site.cfg for clang-tools-extra. · 46d1544d

Benjamin Kramer authored Aug 13, 2014

This has been hiding really well. Hopefully brings the builders suffering from
outdated lit.site.cfg files back to life.

llvm-svn: 215575

46d1544d

utils: Fix segfault in flattencfg · 0cd3ec6c

Jan Vesely authored Aug 13, 2014



v2: continue iterating through the rest of the bb
    use for loop

v3: initialize FlattenCFG pass in ScalarOps
    add test

v4: split off initializing flattencfg to a separate patch
    add comment

Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 215574

0cd3ec6c

Initialize FlattenCFG pass · 5a956d49
Jan Vesely authored Aug 13, 2014
```
Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 215573
```
5a956d49
Simplify memory ownership with std::unique_ptr. · bb415eac
Rafael Espindola authored Aug 13, 2014
```
llvm-svn: 215567
```
bb415eac
Simplify ownership with std::unique_ptr. NFC. · 5f2bb7d9
Rafael Espindola authored Aug 13, 2014
```
llvm-svn: 215566
```
5f2bb7d9

R600: Correctly set the src value offset for scalarized kernel args · 74ef2777

Matt Arsenault authored Aug 13, 2014

This for some reason fixes v1i64 kernel arguments on pre-SI. This
currently breaks some other cases in the kernel-args.ll test for R600,
but I'm not particularly confident in the new output. VTX_READ_* are not
used for some of the scalarized cases, and the code reading from the
constant buffer doesn't make much sense to me.

llvm-svn: 215564

74ef2777

Canonicalize header guards into a common format. · a7c40ef0

Benjamin Kramer authored Aug 13, 2014

Add header guards to files that were missing guards. Remove #endif comments
as they don't seem common in LLVM (we can easily add them back if we decide
they're useful)

Changes made by clang-tidy with minor tweaks.

llvm-svn: 215558

a7c40ef0