Commits · c16f5dca27c693f934b12c948b7d53e98ce9111d · Roger Ferrer / llvm-epi

Mar 20, 2015
- Use the cached subtarget on the MachineFunction when the AsmPrinter · d83003ea
  Eric Christopher authored Mar 19, 2015
```
will have a MachineFunction, i.e. in places other than the module
level doInitialize/doFinalize.

llvm-svn: 232783
```
  d83003ea
- Use the cached subtarget off of the machine function. · 7585fb2d
  Eric Christopher authored Mar 19, 2015
```
llvm-svn: 232782
```
  7585fb2d
- move insert, extract, concat helper functions closer to related helper functions; NFCI · 803fb7c8
  Sanjay Patel authored Mar 19, 2015
```
llvm-svn: 232781
```
  803fb7c8
Mar 19, 2015

Fix a nasty bug in DAGCombine of STORE nodes. · db420123

Owen Anderson authored Mar 19, 2015

This is very related to the bug fixed in r174431.  The problem is that
SelectionDAG does not include alignment in the uniquing of loads and
stores.  When an otherwise no-op DAGCombine would increase the alignment
of a load or store, the original node would be returned (with the
alignment increased), which would cause the node not to be processed by
any further DAGCombines.

I don't have a direct testcase for this that manifests on an in-tree
target, but I did see some noise in the tests for other targets and have
updated them for it.

llvm-svn: 232780

db420123

Remove unused headers. · cf7b5f5f
Eric Christopher authored Mar 19, 2015
```
llvm-svn: 232777
```
cf7b5f5f

Add an MCSubtargetInfo variable to the TargetMachine. · 12cf76fe

Eric Christopher authored Mar 19, 2015

This enables us to remove calls to the subtarget from the TargetMachine
and with a small hack for backends that require global subtarget
information for module level code generation, e.g. mips abi flags, as
mentioned in a fixme in the code.

llvm-svn: 232776

12cf76fe

Add a TargetMachine local MCRegisterInfo and MCInstrInfo so that · 72e23a21
Eric Christopher authored Mar 19, 2015
```
they can be used without a subtarget in constructing subtarget
independent passes.

llvm-svn: 232775
```
72e23a21

WinEH: Make llvm.eh.actions emission match the EH docs · c759fe90

Reid Kleckner authored Mar 19, 2015

This switches the sense of the i32 values and updates the test cases.

We can also use CHECK-SAME to clean up some tests, and reduce the visual
noise from bitcasts.

llvm-svn: 232774

c759fe90

[X86, AVX] use blends instead of insert128 with index 0 · d5c2d287

Sanjay Patel authored Mar 19, 2015

Another case of x86-specific shuffle strength reduction:
avoid generating insert*128 instructions with index 0 because
they are slower than their non-lane-changing blend equivalents.

Shuffle lowering already catches most of these cases, but
the zero vector case and some other paths such as in the
modified test in vector-shuffle-256-v32.ll were getting
through.

Differential Revision: http://reviews.llvm.org/D8366

llvm-svn: 232773

d5c2d287

Verifier: Remove the separate -verify-di pass · ab58a568

Duncan P. N. Exon Smith authored Mar 19, 2015

Remove `DebugInfoVerifierLegacyPass` and the `-verify-di` pass.
Instead, call into the `DebugInfoVerifier` from inside
`VerifierLegacyPass::finalizeModule()`.  This better matches the logic
in `verifyModule()` (used by the new PassManager), avoids requiring two
separate passes to verify the IR, and makes the API for "add a pass to
verify the IR" simple.

Note: the `-verify-debug-info` flag still works (for now, at least;
eventually it might make sense to just remove it).

llvm-svn: 232772

ab58a568

Fix build failure. · 78240e06
Peter Collingbourne authored Mar 19, 2015
```
llvm-svn: 232771
```
78240e06

LowerBitSets: Avoid reusing byte set addresses. · 994ba3d2

Peter Collingbourne authored Mar 19, 2015

Each use of the byte array uses a different alias. This makes the
backend less likely to reuse previously computed byte array addresses,
improving the security of the CFI mechanism based on this pass.

Differential Revision: http://reviews.llvm.org/D8455

llvm-svn: 232770

994ba3d2

libLTO, llvm-lto, gold: Introduce flag for controlling optimization level. · 070843d6

Peter Collingbourne authored Mar 19, 2015

This change also introduces a link-time optimization level of 1. This
optimization level runs only the globaldce pass as well as cleanup passes for
passes that run at -O0, specifically simplifycfg which cleans up lowerbitsets.

http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150316/266951.html

llvm-svn: 232769

070843d6

PassManagerBuilder: Remove effectively dead 'StripDebug' option · 0a93e2db

Duncan P. N. Exon Smith authored Mar 19, 2015

`StripDebug` was only used by tools/opt/opt.cpp in
`AddStandardLinkPasses()`, but opt.cpp adds the same pass based on its
command-line flag before it calls `AddStandardLinkPasses()`.  Stripping
debug info twice isn't very useful.

llvm-svn: 232765

0a93e2db

Switch lowering: extract NextBlock function. NFC. · b4db1420
Hans Wennborg authored Mar 19, 2015
```
llvm-svn: 232759
```
b4db1420
Unxfail test/CodeGen/Generic/vector.ll now passing on Hexagon · 02fd2945
Krzysztof Parzyszek authored Mar 19, 2015
```
llvm-svn: 232758
```
02fd2945
gold: Make powerpc support optional for the tests. · 152b9366
Peter Collingbourne authored Mar 19, 2015
```
Differential Revision: http://reviews.llvm.org/D8400

llvm-svn: 232744
```
152b9366

GlobalDCE: Improve performance for large modules containing comdats. · 0dbc7088

Peter Collingbourne authored Mar 19, 2015

When we encounter a global with a comdat, rather than iterating over
every global in the module to find globals in the same comdat, store the
members in a multimap. This effectively lowers the complexity to O(N log N),
improving performance significantly for large modules such as might be
encountered during LTO.

It looks like we used to do something like this until r219191.

No functional change.

Differential Revision: http://reviews.llvm.org/D8431

llvm-svn: 232743

0dbc7088

docs: Update llvm-cov docs for the -use-color flag · 7bf61d31
Justin Bogner authored Mar 19, 2015
```
llvm-svn: 232742
```
7bf61d31

Add support for __nvvm_reflect changes in libdevice in CUDA-7.0 · 9e8a0393

Artem Belevich authored Mar 19, 2015

Summary:
CUDA 7.0's libdevice uses slightly different IR to call __nvvm_reflect
and that triggers an assertion in nvvm_reflect optimization pass. This
change allows nvvm_reflect pass to deal with both old and new ways to
pass an argument to __nvvm_reflect.

Test Plan: ninja check-all

Reviewers: eliben, echristo

Subscribers: jholewinski, llvm-commits

Differential Revision: http://reviews.llvm.org/D8399

llvm-svn: 232732

9e8a0393

Fixing dependencies for native tablegen. · ca4f699c

Chris Bieneman authored Mar 19, 2015

The dependencies for cross-built tablegen were a bit confused. This fixes that. The following dependencies are now enforced:

(1) Tablegen tasks depend on the native tablegen
(2) Native tablegen depends on the cross-compiled tablegen

Although the native tablegen doesn't actually require the cross tablegen, having this dependency forces the native tablegen to rebuild whenever the cross tablegen changes.

llvm-svn: 232730

ca4f699c

Switch lowering: remove unnecessary ConstantInt casts. NFC. · 78325438
Hans Wennborg authored Mar 19, 2015
```
llvm-svn: 232729
```
78325438
[Hexagon] Add support for vector instructions · 42113347
Krzysztof Parzyszek authored Mar 19, 2015
```
llvm-svn: 232728
```
42113347

[CMake] Don't pass in MSVC warning flags as definitions. · 8aa3000d

Greg Bedwell authored Mar 19, 2015

NFC currently but required as a prerequisite for using
the Microsoft resource compiler in conjunction with
CMake's ninja generator, which knows how to filter flags
appropriately, but not definitions.

Differential Revision: http://reviews.llvm.org/D8188

llvm-svn: 232727

8aa3000d

[Hexagon] ENDLOOP is a non-reversible conditional branch · c6f19333
Krzysztof Parzyszek authored Mar 19, 2015
```
llvm-svn: 232725
```
c6f19333
Internalize PEI. NFC. · 717e973a
Benjamin Kramer authored Mar 19, 2015
```
llvm-svn: 232722
```
717e973a
[sparc] Small fix to r232719 to make 2007-12-17-InvokeAsm.ll pass on the buildbot. · b1fbacab
Daniel Sanders authored Mar 19, 2015
```
llvm-svn: 232720
```
b1fbacab

[sparc] Only support the 'm' inline assembly memory constraint. NFC. · f5d11100

Daniel Sanders authored Mar 19, 2015

Summary:
SPARC doesn't seem to support any additional constraints. Therefore remove
the target hook.

No functional change intended.

Reviewers: venkatra

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D8214

llvm-svn: 232719

f5d11100

[InstCombine] Don't fold a GEP into itself through a PHI node · 5add63f2

Daniel Jasper authored Mar 19, 2015

This can only occur (I think) through the back-edge of the loop.

However, folding a GEP into itself means that the value of the previous
iteration needs to be stored in the meantime, thus requiring an
additional register variable to be live, but not actually achieving
anything (the gep still needs to be executed once per loop iteration).

The attached test case is derived from:
  typedef unsigned uint32;
  typedef unsigned char uint8;
  inline uint8 *f(uint32 value, uint8 *target) {
    while (value >= 0x80) {
      value >>= 7;
      ++target;
    }
    ++target;
    return target;
  }
  uint8 *g(uint32 b, uint8 *target) {
    target = f(b, f(42, target));
    return target;
  }

What happens is that the GEP stored in incptr2 is folded into itself
through the loop's back-edge and the phi-node stored in loopptr,
effectively incrementing the ptr by "2" in each iteration instead of "1".

In this case, it is actually increasing the number of GEPs required as
the GEP before the loop can't be folded away anymore. For comparison:

With this patch:
  define i8* @test4(i32 %value, i8* %buffer) {
  entry:
    %cmp = icmp ugt i32 %value, 127
    br i1 %cmp, label %loop.header, label %exit

  loop.header:                                      ; preds = %entry
    br label %loop.body

  loop.body:                                        ; preds = %loop.body, %loop.header
    %buffer.pn = phi i8* [ %buffer, %loop.header ], [ %loopptr, %loop.body ]
    %newval = phi i32 [ %value, %loop.header ], [ %shr, %loop.body ]
    %loopptr = getelementptr inbounds i8, i8* %buffer.pn, i64 1
    %shr = lshr i32 %newval, 7
    %cmp2 = icmp ugt i32 %newval, 16383
    br i1 %cmp2, label %loop.body, label %loop.exit

  loop.exit:                                        ; preds = %loop.body
    br label %exit

  exit:                                             ; preds = %loop.exit, %entry
    %0 = phi i8* [ %loopptr, %loop.exit ], [ %buffer, %entry ]
    %incptr3 = getelementptr inbounds i8, i8* %0, i64 2
    ret i8* %incptr3
  }

Without this patch:
  define i8* @test4(i32 %value, i8* %buffer) {
  entry:
    %incptr = getelementptr inbounds i8, i8* %buffer, i64 1
    %cmp = icmp ugt i32 %value, 127
    br i1 %cmp, label %loop.header, label %exit

  loop.header:                                      ; preds = %entry
    br label %loop.body

  loop.body:                                        ; preds = %loop.body, %loop.header
    %0 = phi i8* [ %buffer, %loop.header ], [ %loopptr, %loop.body ]
    %loopptr = phi i8* [ %incptr, %loop.header ], [ %incptr2, %loop.body ]
    %newval = phi i32 [ %value, %loop.header ], [ %shr, %loop.body ]
    %shr = lshr i32 %newval, 7
    %incptr2 = getelementptr inbounds i8, i8* %0, i64 2
    %cmp2 = icmp ugt i32 %newval, 16383
    br i1 %cmp2, label %loop.body, label %loop.exit

  loop.exit:                                        ; preds = %loop.body
    br label %exit

  exit:                                             ; preds = %loop.exit, %entry
    %ptr2 = phi i8* [ %incptr2, %loop.exit ], [ %incptr, %entry ]
    %incptr3 = getelementptr inbounds i8, i8* %ptr2, i64 1
    ret i8* %incptr3
  }

Review: http://reviews.llvm.org/D8245
llvm-svn: 232718

5add63f2

llvm-cov: Rename -color={always|never} to -use-color[=0] · 9deb1d40

Justin Bogner authored Mar 19, 2015

This is an ugly hack to fix the configure --enable-shared build. It
turns out that *every cl::opt in LLVM* shows up in *every tool* in
that configuration, which is hopelessly broken. This skirts around the
issue by not colliding with another option's name, for now.

I've also simplified the option implementation - the other "color"
option used cl::boolOrDefault and was much nicer than what I'd written
before.

llvm-svn: 232704

9deb1d40

Note that we don't support COFF on PPC. · 7f45440b
Rafael Espindola authored Mar 19, 2015
```
Should bring back the windows bots.

llvm-svn: 232701
```
7f45440b

llvm-cov: Continue trying to appease a bot · efa42782

Justin Bogner authored Mar 19, 2015

This bot doesn't like me. I don't know why:

    http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/24425

Move the color option enum's definition out of the function that
creates the cl::opt.

llvm-svn: 232700

efa42782

Split the object streamer callback in one per file format. · cd584a80

Rafael Espindola authored Mar 19, 2015

There are two main advantages to doing this

* Targets that only need to handle one of the formats specially don't have
  to worry about the others. For example, x86 now only registers a
  constructor for the COFF streamer.

* Changes to the arguments passed to one format constructor will not impact
  the other formats.

llvm-svn: 232699

cd584a80

llvm-cov: Try to appease a bot · 58e2492f

Justin Bogner authored Mar 19, 2015

The clang-hexagon elf bot was complaining that "Option 'color'
registered more than once!":

    http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/24425

I don't understand why this error is happening, and I don't see it on
any other bots or on my own machine, so I'm kind of grasping at
straws. Try using an unscoped enum and specifying a cl::init to see if
they help.

llvm-svn: 232698

58e2492f

SelectionDAGBuilder: update comment in HandlePHINodesInSuccessorBlocks. · 5b64657e

Hans Wennborg authored Mar 19, 2015

From what I can tell, the code is checking for PHIs that expect any value from
this block, not just constants.

llvm-svn: 232697

5b64657e

Do not track subregister liveness when it brings no benefits · a25e13aa

Matthias Braun authored Mar 19, 2015

Some subregisters are only to indicate different access sizes, while not
providing any way to actually divide the register up into multiple
disjunct parts. Avoid tracking subregister liveness in these cases as it
is not beneficial.

Differential Revision: http://reviews.llvm.org/D8429

llvm-svn: 232695

a25e13aa

llvm-cov: Only emit colour by default if the output is a tty · cfb53e49

Justin Bogner authored Mar 19, 2015

This replaces the -no-color flag with a -color={auto|always|never}
option, with auto as the default, which is much saner.

llvm-svn: 232693

cfb53e49

SelectionDAGIsel: Fix comment about terminators being "handled below". · 81cfeb19
Hans Wennborg authored Mar 19, 2015
```
That changed in r102128.

llvm-svn: 232692
```
81cfeb19
[CodeGenPrepare] Remove broken, dead, code. · 7bdd50d2
Quentin Colombet authored Mar 18, 2015
```
NFC.

llvm-svn: 232690
```
7bdd50d2
two or more, use a for. · 69244c3e
Rafael Espindola authored Mar 18, 2015
```
llvm-svn: 232688
```
69244c3e