Commits · b5e1e6cc11a3c58e46c787e8d272ec05e449a152 · Roger Ferrer / llvm-epi-0.8

Sep 19, 2013

Revert "Encapsulate PassManager debug flags to avoid static init and cxa_exit." · b5e1e6cc
Andrew Trick authored Sep 19, 2013
```
Working on a better solution to this.

This reverts commit 7d4e9934e7ca83094c5cf41346966c8350179ff2.

llvm-svn: 190990
```
b5e1e6cc

Encapsulate PassManager debug flags to avoid static init and cxa_exit. · f33d6df8

Andrew Trick authored Sep 18, 2013

This puts all the global PassManager debugging flags, like
-print-after-all and -time-passes, behind a managed static. This
eliminates their static initializers and, more importantly, exit-time
destructors.

The only behavioral change I anticipate is that tools need to
initialize the PassManager before parsing the command line in order to
export these options, which makes sense. Tools that already initialize
the standard passes (opt/llc) don't need to do anything new.

llvm-svn: 190974

f33d6df8

whitespace · dc073add
Andrew Trick authored Sep 18, 2013
```
llvm-svn: 190973
```
dc073add

Fix two issues regarding Got pointer (GP) setup. · d6aadc79

Reed Kotler authored Sep 18, 2013

1) make sure that the first two instructions of the sequence cannot
separate from each other. The linker requires that they be sequential.
If they get separated, it can still work but it will not work in all
cases because the first of the instructions mostly involves the hi part
of the pc relative offset and that part changes slowly. You would have
to be at the right boundary for this to matter.
2) make sure that this sequence begins on a longword boundary.
There appears to be a bug in binutils which makes some of these calculations
get messed up if the instruction sequence does not begin on a longword
boundary. This is being investigated with the appropriate binutils folks.

llvm-svn: 190966

d6aadc79

Debug info: Get rid of the VLA indirection hack in FastISel. · 262bcf45

Adrian Prantl authored Sep 18, 2013

Use the DIVariable::isIndirect() flag set by the frontend instead of
guessing whether to set the machine location's indirection bit.
Paired commit with CFE.

llvm-svn: 190961

262bcf45

Sep 18, 2013

Attempt to fix llvm-ppc64-linux2 buildbot failure by adding · dd9891f2
Preston Gurd authored Sep 18, 2013
```
-march=x86 to SLM test.

llvm-svn: 190958
```
dd9891f2
Verify that llvm can generate the prefetchw instruction when the CPU is · 457daddc
Preston Gurd authored Sep 18, 2013
```
Atom Silvermont.

Patch by Sriram Murali.

llvm-svn: 190957
```
457daddc

Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as · 57093e88

Filip Pizlo authored Sep 18, 2013

advertised - but it does have the caveat that calls to DynamicLibrary::AddSymbol will
"reset" if you shutdown llvm and try to come back for seconds. This is a subtle
behavior change, but I'm assuming that nobody is affected by it.

llvm-svn: 190946

57093e88

More XCore TTI cleanup -- remove an unused private field flagged by · dbf6589b
Chandler Carruth authored Sep 18, 2013
```
-Wunused-private-field with Clang.

llvm-svn: 190941
```
dbf6589b
Name the XCore target-specific subdirectories canonically. · b5a34963
Chandler Carruth authored Sep 18, 2013
```
llvm-svn: 190940
```
b5a34963
[asan] call __asan_stack_malloc_N only if use-after-return detection is... · f322382e
Kostya Serebryany authored Sep 18, 2013
```
[asan] call __asan_stack_malloc_N only if use-after-return detection is enabled with the run-time option

llvm-svn: 190939
```
f322382e

A couple of tests, in llvm/test/Transforms/*/xcore, are XCore-specific. They... · 69ae1b9a

NAKAMURA Takumi authored Sep 18, 2013

A couple of tests, in llvm/test/Transforms/*/xcore, are XCore-specific. They should be excluded when XCore is not built.

llvm-svn: 190938

69ae1b9a

Target/XCore/CMakeLists.txt: Add XCoreTargetTransformInfo.cpp. · 0b642ec1
NAKAMURA Takumi authored Sep 18, 2013
```
llvm-svn: 190937
```
0b642ec1

Prevent LoopVectorizer and SLPVectorizer running if the target has no vector registers. · f637e2cb

Robert Lytton authored Sep 18, 2013

XCore target: Add XCoreTargetTransformInfo
This is where getNumberOfRegisters() resides, which in turn returns the
number of vector registers (=0).

llvm-svn: 190936

f637e2cb

Re-add tests from r179291 which were accidentally removed by r181177. · 1f5d74d8
Andrea Di Biagio authored Sep 18, 2013
```
llvm-svn: 190934
```
1f5d74d8

[SystemZ] Add unsigned compare-and-branch instructions · 93183ee7

Richard Sandiford authored Sep 18, 2013

For some reason I never got around to adding these at the same time as
the signed versions.  No idea why.

I'm not sure whether this SystemZII::BranchC* stuff is useful, or whether
it should just be replaced with an "is normal" flag.  I'll leave that
for later though.

There are some boundary conditions that can be tweaked, such as preferring
unsigned comparisons for equality with [128, 256), and "<= 255" over "< 256",
but again I'll leave those for a separate patch.

llvm-svn: 190930

93183ee7

'svn add' the test cases. · 36b2e5de
Joey Gouly authored Sep 18, 2013
```
llvm-svn: 190929
```
36b2e5de
[ARMv8] Add CRC instructions. · 2f8890ed
Joey Gouly authored Sep 18, 2013
```
Patch by Bradley Smith!

llvm-svn: 190928
```
2f8890ed

Revert r190921. It broke Windows. · 591f1541

Filip Pizlo authored Sep 18, 2013

I'll roll it back in when I have a chance to look at it in detail.

llvm-svn: 190923

591f1541

Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as · 4389ee38

Filip Pizlo authored Sep 18, 2013

llvm-svn: 190921

4389ee38

Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if... · 358c7989

Craig Topper authored Sep 18, 2013

Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if they've already been enabled. The extra call ends up clearing the bit in FeatureBits since its a 'toggle'. Can't prove that anything was broken because of this since I don't think the FeatureBits for these are used.

llvm-svn: 190920

358c7989

Fix X86 subtarget to not overwrite the autodetected features by calling... · a8442344

Craig Topper authored Sep 18, 2013

Fix X86 subtarget to not overwrite the autodetected features by calling InitMCProcessorInfo right after detecting them. Instead add a new function that only updates the scheduling model and call that.

llvm-svn: 190919

a8442344

Revert accidental commit I had to make to get the test case in PR17268 to still work correctly. · be3e01e6
Craig Topper authored Sep 18, 2013
```
llvm-svn: 190917
```
be3e01e6
Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. · 98064b9f
Craig Topper authored Sep 18, 2013
```
llvm-svn: 190916
```
98064b9f
ifndef NDEBUG-out an asserts-only constant committed in r190863 · eacc287b
David Blaikie authored Sep 18, 2013
```
llvm-svn: 190905
```
eacc287b

Fix a constant folding address space place I missed. · d12e8020

Matt Arsenault authored Sep 17, 2013

If address space 0 was smaller than the address space
in a constant inttoptr/ptrtoint pair, the wrong mask size
would be used.

llvm-svn: 190899

d12e8020

COFF: Ensure that objects produced by LLVM link with /safeseh · c1e7621e

Reid Kleckner authored Sep 17, 2013

Summary:
We indicate that the object files are safe by emitting a @feat.00
absolute address symbol.  The address is presumably interpreted as a
bitfield of features that the compiler would like to enable.  Bit 0 is
documented in the PE COFF spec to opt in to "registered SEH", which is
what /safeseh enables.

LLVM's object files are safe by default because LLVM doesn't know how to
produce SEH handlers.

Reviewers: Bigcheese

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D1691

llvm-svn: 190898

c1e7621e

Missed using check type enum in one place · ce3e4fc9
Matt Arsenault authored Sep 17, 2013
```
llvm-svn: 190897
```
ce3e4fc9
Use function's argument instead of the global flag. · c4d2d471
Matt Arsenault authored Sep 17, 2013
```
For now it happens the argument is always the same.

llvm-svn: 190896
```
c4d2d471
FileCheck refactor: use enum instead of bunch of bools · 38820972
Matt Arsenault authored Sep 17, 2013
```
llvm-svn: 190893
```
38820972

Revert the load slicing done in r190870. · 870b6627

Quentin Colombet authored Sep 17, 2013

To avoid regressions with bitfield optimizations, this slicing should take place
later, like ISel time.

llvm-svn: 190891

870b6627

Sep 17, 2013

COFF: Emit all MCSymbols rather than filtering out some of them · 3ea536fe

Reid Kleckner authored Sep 17, 2013

In particular, this means we emit non-external symbols defined to
variables, such as aliases or absolute addresses.

This is needed to implement /safeseh, and it appears there was some
confusion about what symbols to emit previously.

llvm-svn: 190888

3ea536fe

COFF: Remove ExportSection, which has been dead since r114823 · 50689eb9
Reid Kleckner authored Sep 17, 2013
```
llvm-svn: 190887
```
50689eb9
Move variable into assert to avoid unused variable warning. · e7af7bd8
Eric Christopher authored Sep 17, 2013
```
llvm-svn: 190886
```
e7af7bd8

Cleanup handling of constant function casts. · e6952f28

Matt Arsenault authored Sep 17, 2013

Some of this code is no longer necessary since int<->ptr casts are no
longer occur as of r187444.

This also fixes handling vectors of pointers, and adds a bunch of new
testcases for vectors and address spaces.

llvm-svn: 190885

e6952f28

[PowerPC] Add a FIXME. · bdae03f2

Bill Schmidt authored Sep 17, 2013

Documenting a design choice to generate only medium model sequences for TLS
addresses at this time.  Small and large code models could be supported if
necessary.

llvm-svn: 190883

bdae03f2

[PowerPC] Fix problems with large code model (PR17169). · bb381d70

Bill Schmidt authored Sep 17, 2013

Large code model on PPC64 requires creating and referencing TOC entries when
using the addis/ld form of addressing. This was not being done in all cases.
The changes in this patch to PPCAsmPrinter::EmitInstruction() fix this. Two
test cases are also modified to reflect this requirement.

Fast-isel was not creating correct code for loading floating-point constants
using large code model. This also requires the addis/ld form of addressing.
Previously we were using the addis/lfd shortcut which is only applicable to
medium code model. One test case is modified to reflect this requirement.

llvm-svn: 190882

bb381d70

Costmodel: Add support for horizontal vector reductions · cae8735a

Arnold Schwaighofer authored Sep 17, 2013

Upcoming SLP vectorization improvements will want to be able to estimate costs
of horizontal reductions. Add infrastructure to support this.

We model reductions as a series of (shufflevector,add) tuples ultimately
followed by an extractelement. For example, for an add-reduction of <4 x float>
we could generate the following sequence:

 (v0, v1, v2, v3)
   \   \  /  /
     \  \  /
       +  +

 (v0+v2, v1+v3, undef, undef)
    \      /
 ((v0+v2) + (v1+v3), undef, undef)

 %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef,
                           <4 x i32> <i32 2, i32 3, i32 undef, i32 undef>
 %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf
 %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef,
                          <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
 %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7
 %r = extractelement <4 x float> %bin.rdx8, i32 0

This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)"
that will allow clients to ask for the cost of such a reduction (as backends
might generate more efficient code than the cost of the individual instructions
summed up). This interface is excercised by the CostModel analysis pass which
looks for reduction patterns like the one above - starting at extractelements -
and if it sees a matching sequence will call the cost model interface.

We will also support a second form of pairwise reduction that is well supported
on common architectures (haddps, vpadd, faddp).

 (v0, v1, v2, v3)
  \   /    \  /
 (v0+v1, v2+v3, undef, undef)
    \     /
 ((v0+v1)+(v2+v3), undef, undef, undef)

  %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef>
  %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 1, i32 3, i32 undef, i32 undef>
  %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1
  %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>
  %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
  %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1
  %r = extractelement <4 x float> %bin.rdx.1, i32 0

llvm-svn: 190876

cae8735a

SLPVectorizer: Don't vectorize phi nodes that use invoke values · 4a3dcaa1

Arnold Schwaighofer authored Sep 17, 2013

We can't insert an insertelement after an invoke. We would have to split a
critical edge. So when we see a phi node that uses an invoke we just give up.

radar://14990770

llvm-svn: 190871

4a3dcaa1

[InstCombiner] Slice a big load in two loads when the elements are next to each · b8d672ef

Quentin Colombet authored Sep 17, 2013

other in memory.

The motivation was to get rid of truncate and shift right instructions that get
in the way of paired load or floating point load.
E.g.,
Consider the following example:
struct Complex {
  float real;
  float imm;
};

When accessing a complex, llvm was generating a 64-bits load and the imm field
was obtained by a trunc(lshr) sequence, resulting in poor code generation, at
least for x86.

The idea is to declare that two load instructions is the canonical form for
loading two arithmetic type, which are next to each other in memory.

Two scalar loads at a constant offset from each other are pretty
easy to detect for the sorts of passes that like to mess with loads. 

<rdar://problem/14477220>

llvm-svn: 190870

b8d672ef