Commits · df7e7fb64296519a8a340b31eb0da9b3b67b4cb1 · Roger Ferrer / llvm-epi-0.8

Jul 30, 2013

Fix underscore to be the proper length. · c02da467
Bill Wendling authored Jul 30, 2013
```
llvm-svn: 187406
```
c02da467

[ARM] check bitwidth in PerformORCombine · 0c2ee5a2

Saleem Abdulrasool authored Jul 30, 2013



When simplifying a (or (and B A) (and C ~A)) to a (VBSL A B C) ensure that the
bitwidth of the second operands to both ands match before comparing the negation
of the values.

Split the check of the value of the second operands to the ands.  Move the cast
and variable declaration slightly higher to make it slightly easier to follow.

Bug-Id: 16700
Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org>
llvm-svn: 187404

0c2ee5a2

Remove more dead documentation. · d20830df
Rafael Espindola authored Jul 30, 2013
```
llvm-svn: 187403
```
d20830df
[Sparc] Use call's debugloc for the unimp instruction. · fdcc498a
Venkatraman Govindaraju authored Jul 30, 2013
```
llvm-svn: 187402
```
fdcc498a

[PowerPC] Skeletal FastISel support for 64-bit PowerPC ELF. · 0cf702fa

Bill Schmidt authored Jul 30, 2013

This is the first of many upcoming patches for PowerPC fast
instruction selection support.  This patch implements the minimum
necessary for a functional (but extremely limited) FastISel pass.  It
allows the table-generated portions of the selector to be created and
used, but in most cases selection will fall back to the DAG selector.
None of the block terminator instructions are implemented yet, and
most interesting instructions require some special handling.
Therefore there aren't any new test cases with this patch.  There will
be quite a few tests coming with future patches.

This patch adds the make/CMake support for the new code (including
tablegen -gen-fast-isel) and creates the FastISel object for PPC64 ELF
only.  It instantiates the necessary virtual functions
(TargetSelectInstruction, TargetMaterializeConstant,
TargetMaterializeAlloca, tryToFoldLoadIntoMI, and FastLowerArguments),
but of these, only TargetMaterializeConstant contains any useful
implementation.  This is present since the table-generated code
requires the ability to materialize integer constants for some
instructions.

This patch has been tested by building and running the
projects/test-suite code with -O0.  All tests passed with the
exception of a couple of long-running tests that time out using -O0
code generation.

llvm-svn: 187399

0cf702fa

[R600] Replicate old DAGCombiner behavior in target specific DAG combine. · e2e0548d

Quentin Colombet authored Jul 30, 2013

build_vector is lowered to REG_SEQUENCE, which is something the register
allocator does a good job at optimizing.

llvm-svn: 187397

e2e0548d

[DAGCombiner] insert_vector_elt: Avoid building a vector twice. · 6bf4baa4

Quentin Colombet authored Jul 30, 2013

This patch prevents the following combine when the input vector is used more
than once.
insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx
=>
build_vector elt0, ..., NewEltIdx, ..., eltN 

The reasons are:
- Building a vector may be expensive, so try to reuse the existing part of a
  vector instead of creating a new one (think big vectors).
- elt0 to eltN now have two users instead of one. This may prevent some other
  optimizations.

llvm-svn: 187396

6bf4baa4

Move file to X86 and add a triple to fix darwin bots for now. · 4ed04e2e

Eric Christopher authored Jul 30, 2013

The problem is due to the section name being explicitly mentioned in
the IR and differing between the two platforms.

llvm-svn: 187394

4ed04e2e

Fix a truly egregious thinko in anonymous namespace check, · e414ece7

Eric Christopher authored Jul 29, 2013

update testcase to make sure we generate debug info for walrus
by adding a non-trivial constructor and verify that we don't
emit an ODR signature for the type.

llvm-svn: 187393

e414ece7

Make sure we don't emit an ODR hash for types with no name and make · d853ea31
Eric Christopher authored Jul 29, 2013
```
sure the comments for each testcase are a bit easier to distinguish.

llvm-svn: 187392
```
d853ea31
Clarify comments for types contained in anonymous namespaces and · 32d1531a
Eric Christopher authored Jul 29, 2013
```
odr hashes.

llvm-svn: 187391
```
32d1531a
Elaborate a bit on the type unit and ODR conditional code. · f8542ec3
Eric Christopher authored Jul 29, 2013
```
llvm-svn: 187385
```
f8542ec3

Jul 29, 2013

Make file_status::getUniqueID const. · d123099a
Rafael Espindola authored Jul 29, 2013
```
llvm-svn: 187383
```
d123099a
Delete documentation for deleted options. · cb87e35e
Rafael Espindola authored Jul 29, 2013
```
llvm-svn: 187380
```
cb87e35e
Include st_dev to make the result of getUniqueID actually unique. · 7f822a93
Rafael Espindola authored Jul 29, 2013
```
This will let us use getUniqueID instead of st_dev directly on clang.

llvm-svn: 187378
```
7f822a93
Debug Info: enable verifier for testing cases. · 620e978f
Manman Ren authored Jul 29, 2013
```
llvm-svn: 187375
```
620e978f
[mips] Add comment and simplify function. · 52dd808b
Akira Hatanaka authored Jul 29, 2013
```
llvm-svn: 187371
```
52dd808b
Add the C source code to the test to make it easier to update when debug info changes. · 16e9dd4d
Nadav Rotem authored Jul 29, 2013
```
Thanks Eric.

llvm-svn: 187368
```
16e9dd4d
SLPVectorier: update the debug location for the new instructions. · d9c74cc6
Nadav Rotem authored Jul 29, 2013
```
llvm-svn: 187363
```
d9c74cc6
Debug Info: update testing cases to pass verifier. · e9a52e18
Manman Ren authored Jul 29, 2013
```
llvm-svn: 187362
```
e9a52e18

Use proper section suffix for COFF weak symbols · 7fdaee8f

Nico Rieck authored Jul 29, 2013

32-bit symbols have "_" as global prefix, but when forming the name of
COMDAT sections this prefix is ignored. The current behavior assumes that
this prefix is always present which is not the case for 64-bit and names
are truncated.

llvm-svn: 187356

7fdaee8f

Proper va_arg/va_copy lowering on win64 · 06d17c80

Nico Rieck authored Jul 29, 2013

Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like
other 64-bit targets.

llvm-svn: 187355

06d17c80

Re-application of 187310. Re-enabling warning C4275 for MSVC 11 and up, but... · f5aacd34

Aaron Ballman authored Jul 29, 2013

Re-application of 187310.  Re-enabling warning C4275 for MSVC 11 and up, but not MSVC 10 since it is still required there.

llvm-svn: 187354

f5aacd34

Add support for the 's' operation to llvm-ar. · b6b5f52e

Rafael Espindola authored Jul 29, 2013

If no other operation is specified, 's' becomes an operation instead of an
modifier. The s operation just creates a symbol table. It is the same as
running ranlib.

We assume the archive was created by a sane ar (like llvm-ar or gnu ar) and
if the symbol table is present, then it is current. We use that to optimize
the most common case: a broken build system that thinks it has to run ranlib.

llvm-svn: 187353

b6b5f52e

MC: Support larger COFF string tables · 2c9c89b2

Nico Rieck authored Jul 29, 2013

Single-slash encoded entries do not require a terminating null. This bumps
the maximum table size from ~1MB to ~9.5MB.

llvm-svn: 187352

2c9c89b2

ExceptionDemo.cpp: Tweak a @param. [-Wdocumentation] · 1373b743
NAKAMURA Takumi authored Jul 29, 2013
```
llvm-svn: 187351
```
1373b743
Some Intel Penryn CPUs come with SSE4 disabled. Detect them as core 2. · fb34989a
Benjamin Kramer authored Jul 29, 2013
```
PR16721.

llvm-svn: 187350
```
fb34989a

Allow generation of vmla.f32 instructions when targeting Cortex-A15. The patch... · 91ddaa1b

Silviu Baranga authored Jul 29, 2013

Allow generation of vmla.f32 instructions when targeting Cortex-A15. The patch also adds the VFP4 feature to Cortex-A15 and fixes the DontUseFusedMAC predicate so that we can still generate vmla.f32 instructions on non-darwin targets with VFP4.

llvm-svn: 187349

91ddaa1b

test commit · 862b0451
Robert Lytton authored Jul 29, 2013
```
llvm-svn: 187348
```
862b0451

Teach the AllocaPromoter which is wrapped around the SSAUpdater · cd7c8cdf

Chandler Carruth authored Jul 29, 2013

infrastructure to do promotion without a domtree the same smarts about
looking through GEPs, bitcasts, etc., that I just taught mem2reg about.
This way, if SROA chooses to promote an alloca which still has some
noisy instructions this code can cope with them.

I've not used as principled of an approach here for two reasons:
1) This code doesn't really need it as we were already set up to zip
   through the instructions used by the alloca.
2) I view the code here as more of a hack, and hopefully a temporary one.

The SSAUpdater path in SROA is a real sore point for me. It doesn't make
a lot of architectural sense for many reasons:
- We're likely to end up needing the domtree anyways in a subsequent
  pass, so why not compute it earlier and use it.
- In the future we'll likely end up needing the domtree for parts of the
  inliner itself.
- If we need to we could teach the inliner to preserve the domtree. Part
  of the re-work of the pass manager will allow this to be very powerful
  even in large SCCs with many functions.
- Ultimately, computing a domtree has gotten significantly faster since
  the original SSAUpdater-using code went into ScalarRepl. We no longer
  use domfrontiers, and much of domtree is lazily done based on queries
  rather than eagerly.
- At this point keeping the SSAUpdater-based promotion saves a total of
  0.7% on a build of the 'opt' tool for me. That's not a lot of
  performance given the complexity!

So I'm leaving this a bit ugly in the hope that eventually we just
remove all of this nonsense.

I can't even readily test this because this code isn't reachable except
through SROA. When I re-instate the patch that fast-tracks allocas
already suitable for promotion, I'll add a testcase there that failed
before this change. Before that, SROA will fix any test case I give it.

llvm-svn: 187347

cd7c8cdf

Don't vectorize when the attribute NoImplicitFloat is used. · 750e42cb
Nadav Rotem authored Jul 29, 2013
```
llvm-svn: 187340
```
750e42cb
Fix -Wdocumentation warnings. · caa776be
Rafael Espindola authored Jul 28, 2013
```
llvm-svn: 187336
```
caa776be

Update comments for SSAUpdater to use the modern doxygen comment · 6b55dbea

Chandler Carruth authored Jul 28, 2013

standards for LLVM. Remove duplicated comments on the interface from the
implementation file (implementation comments are left there of course).
Also clean up, re-word, and fix a few typos and errors in the commenst
spotted along the way.

This is in preparation for changes to these files and to keep the
uninteresting tidying in a separate commit.

llvm-svn: 187335

6b55dbea

Jul 28, 2013

Remove use of sprintf added to X86 disassembler tablegen code. Send message... · 9469e906

Craig Topper authored Jul 28, 2013

Remove use of sprintf added to X86 disassembler tablegen code. Send message with instruction name to errs() instead and use a generic message for the llvm_unreachable. Consistent with other places in this file.

llvm-svn: 187333

9469e906

Partial revert of 187310; it seems MSVC 10 still spits out this warning, but MSVC 11 does not. · d1594bda
Aaron Ballman authored Jul 28, 2013
```
llvm-svn: 187331
```
d1594bda
Temporarily revert r187323 until I update SSAUpdater to match mem2reg. · d31370e0
Chandler Carruth authored Jul 28, 2013
```
I forgot that we had two totally independent things here. :: sigh ::

llvm-svn: 187327
```
d31370e0
fixed compilation issue · baf51e3e
Elena Demikhovsky authored Jul 28, 2013
```
llvm-svn: 187325
```
baf51e3e
Added encoding prefixes for KNL instructions (EVEX). · 003e7d73
Elena Demikhovsky authored Jul 28, 2013
```
Added 512-bit operands printing.
Added instruction formats for KNL instructions.

llvm-svn: 187324
```
003e7d73

Now that mem2reg understands how to cope with a slightly wider set of · 9d96100f

Chandler Carruth authored Jul 28, 2013

uses of an alloca, we can pre-compute promotability while analyzing an
alloca for splitting in SROA. That lets us short-circuit the common case
of a bunch of trivially promotable allocas. This cuts 20% to 30% off the
run time of SROA for typical frontend-generated IR sequneces I'm seeing.
It gets the new SROA to within 20% of ScalarRepl for such code. My
current benchmark for these numbers is PR15412, but it fits the general
pattern of IR emitted by Clang so it should be widely applicable.

llvm-svn: 187323

9d96100f

Thread DataLayout through the callers and into mem2reg. This will be · d5b806a2

Chandler Carruth authored Jul 28, 2013

useful in a subsequent patch, but causes an unfortunate amount of noise,
so I pulled it out into a separate patch.

llvm-svn: 187322

d5b806a2