Commits · aafe0918bc34ec7045002f90da1b70a3e10ed644 · Roger Ferrer / llvm-epi-0.8

Jun 29, 2012

Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h · aafe0918

Chandler Carruth authored Jun 29, 2012

This was always part of the VMCore library out of necessity -- it deals
entirely in the IR. The .cpp file in fact was already part of the VMCore
library. This is just a mechanical move.

I've tried to go through and re-apply the coding standard's preferred
header sort, but at 40-ish files, I may have gotten some wrong. Please
let me know if so.

I'll be committing the corresponding updates to Clang and Polly, and
Duncan has DragonEgg.

Thanks to Bill and Eric for giving the green light for this bit of cleanup.

llvm-svn: 159421

aafe0918

The DIBuilder class is just a wrapper around debug info creation · f799efde
Bill Wendling authored Jun 29, 2012
```
(a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore
instead.

llvm-svn: 159414
```
f799efde
make simplifyCFG erase invokes to readonly/readnone functions · b97a4e8b
Nuno Lopes authored Jun 28, 2012
```
llvm-svn: 159385
```
b97a4e8b
make instcombine produce calls to llvm.donothing instead of a random intrinsic · 9ac4661a
Nuno Lopes authored Jun 28, 2012
```
llvm-svn: 159384
```
9ac4661a

Jun 28, 2012

[asan] set a hard limit on the number of instructions instrumented pear each... · c387ca7b

Kostya Serebryany authored Jun 28, 2012

[asan] set a hard limit on the number of instructions instrumented pear each BB. This is (hopefully temporary) workaround for PR13225 

llvm-svn: 159344

c387ca7b

Precompute SCEV pointer analysis prior to instruction fusion in BBVectorize. · 918ca2b8

Hal Finkel authored Jun 28, 2012

When both a load/store and its address computation are being vectorized, it can
happen that the address-computation vectorization destroys SCEV's ability
to analyize the relative pointer offsets. As a result (like with the aliasing
analysis info), we need to precompute the necessary information prior to
instruction fusing.

This was found during stress testing (running through the test suite with a very
low required chain length); unfortunately, I don't have a small test case.

llvm-svn: 159332

918ca2b8

Remove a useless check in BBVectorize. · 0873d73c

Hal Finkel authored Jun 28, 2012

A shuffle mask will always be a constant, but I did not realize that
when I originally wrote the code.

llvm-svn: 159331

0873d73c

Allow BBVectorize to form non-2^n-length vectors. · f2dcb9a9

Hal Finkel authored Jun 28, 2012

The original algorithm only used recursive pair fusion of equal-length
types. This is now extended to allow pairing of any types that share
the same underlying scalar type. Because we would still generally
prefer the 2^n-length types, those are formed first. Then a second
set of iterations form the non-2^n-length types.

Also, a call to SimplifyInstructionsInBlock has been added after each
pairing iteration. This takes care of DCE (and a few other things)
that make the following iterations execute somewhat faster. For the
same reason, some of the simple shuffle-combination cases are now
handled internally.

There is some additional refactoring work to be done, but I've had
many requests for this feature, so additional refactoring will come
soon in future commits (as will additional test cases).

llvm-svn: 159330

f2dcb9a9

Refactor operation equivalence checking in BBVectorize by extending Instruction::isSameOperationAs. · 74e5225c

Hal Finkel authored Jun 28, 2012

Maintaining this kind of checking in different places is dangerous, extending
Instruction::isSameOperationAs consolidates this logic into one place. Here
I've added an optional flags parameter and two flags that are important for
vectorization: CompareIgnoringAlignment and CompareUsingScalarTypes.

llvm-svn: 159329

74e5225c

Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and · e38859dc

Bill Wendling authored Jun 28, 2012

include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h.

The reasoning is because the DebugInfo module is simply an interface to the
debug info MDNodes and has nothing to do with analysis.

llvm-svn: 159312

e38859dc

Jun 27, 2012

Revert r159136 due to PR13124. · a5886231

Matt Beaumont-Gay authored Jun 27, 2012

Original commit message:

If a constant or a function has linkonce_odr linkage and unnamed_addr, mark it
hidden. Being linkonce_odr guarantees that it is available in every dso that
needs it. Being a constant/function with unnamed_addr guarantees that the
copies don't have to be merged.

llvm-svn: 159272

a5886231

Some reassociate optimizations create new instructions, which they insert just · 514db117

Duncan Sands authored Jun 27, 2012

before the expression root. Any existing operators that are changed to use one
of them needs to be moved between it and the expression root, and recursively
for the operators using that one. When I rewrote RewriteExprTree I accidentally
inverted the logic, resulting in the compacting going down from operators to
operands rather than up from operands to the operators using them, oops. Fix
this, resolving PR12963.

llvm-svn: 159265

514db117

Remove a instcombine transform that (no longer?) makes sense: · 319be53a

Evan Cheng authored Jun 26, 2012

    // C - zext(bool) -> bool ? C - 1 : C
    if (ZExtInst *ZI = dyn_cast<ZExtInst>(Op1))
      if (ZI->getSrcTy()->isIntegerTy(1))
        return SelectInst::Create(ZI->getOperand(0), SubOne(C), C);

This ends up forming sext i1 instructions that codegen to terrible code. e.g.
int blah(_Bool x, _Bool y) {
  return (x - y) + 1;
}
=>
        movzbl  %dil, %eax
        movzbl  %sil, %ecx
        shll    $31, %ecx
        sarl    $31, %ecx
        leal    1(%rax,%rcx), %eax
        ret


Without the rule, llvm now generates:
        movzbl  %sil, %ecx
        movzbl  %dil, %eax
        incl    %eax
        subl    %ecx, %eax
        ret

It also helps with ARM (and pretty much any target that doesn't have a sext i1 :-).

The transformation was done as part of Eli's r75531. He has given the ok to
remove it.

rdar://11748024

llvm-svn: 159230

319be53a

Jun 26, 2012

Replacing zero-sized alloca's with a null pointer is too aggressive, instead · 8bc764ae

Duncan Sands authored Jun 26, 2012

merge all zero-sized alloca's into one, fixing c43204g from the Ada ACATS
conformance testsuite. What happened there was that a variable sized object
was being allocated on the stack, "alloca i8, i32 %size". It was then being
passed to another function, which tested that the address was not null (raising
an exception if it was) then manipulated %size bytes in it (load and/or store).
The optimizers cleverly managed to deduce that %size was zero (congratulations
to them, as it isn't at all obvious), which made the alloca zero size, causing
the optimizers to replace it with null, which then caused the check mentioned
above to fail, and the exception to be raised, wrongly. Note that no loads
and stores were actually being done to the alloca (the loop that does them is
executed %size times, i.e. is not executed), only the not-null address check.

llvm-svn: 159202

8bc764ae

revert my previous commit (r159173), since as Eli pointed out, it's perfectly... · 31b54a53
Nuno Lopes authored Jun 25, 2012
```
revert my previous commit (r159173), since as Eli pointed out, it's perfectly ok to mark realloc as noalias

llvm-svn: 159175
```
31b54a53

do not set realloc() as NotAlias, since it can return the same pointer. This... · 75eaa72d

Nuno Lopes authored Jun 25, 2012

do not set realloc() as NotAlias, since it can return the same pointer. This whole thing should be upgraded to use the MemoryBuiltin interface anyway..

llvm-svn: 159173

75eaa72d

Jun 25, 2012

Fix the objc_autoreleasedReturnValue optimization code to locate · 5f725cd1
Dan Gohman authored Jun 25, 2012
```
the call correctly even in the case where it is an invoke. This
fixes rdar://11714057.

llvm-svn: 159157
```
5f725cd1

improve optimization of invoke instructions: · 07594cba

Nuno Lopes authored Jun 25, 2012

 - simplifycfg:  invoke undef/null -> unreachable
 - instcombine:  invoke new  -> invoke expect(0, 0)  (an arbitrary NOOP intrinsic;  only done if the allocated memory is unused, of course)
 - verifier:  allow invoke of intrinsics  (to make the previous step work)

llvm-svn: 159146

07594cba

If a constant or a function has linkonce_odr linkage and unnamed_addr, mark it · 540c3d23

Rafael Espindola authored Jun 25, 2012

hidden. Being linkonce_odr guarantees that it is available in every dso that
needs it. Being a constant/function with unnamed_addr guarantees that the
copies don't have to be merged.

llvm-svn: 159136

540c3d23

The name (and comment describing) of llvm::GetFirstDebuigLocInBasicBlock no... · f0ad3606

Eli Bendersky authored Jun 25, 2012

The name (and comment describing) of llvm::GetFirstDebuigLocInBasicBlock no longer represents what the function does. Therefore, the function is removed and its functionality is folded into the only place in the code-base where it was being used.

llvm-svn: 159133

f0ad3606

Jun 24, 2012
- llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. · 704de074
  NAKAMURA Takumi authored Jun 24, 2012
```
llvm-svn: 159112
```
  704de074
- Allow controlling vectorization of boolean values separately from other integer types. · 3099ce94
  Hal Finkel authored Jun 24, 2012
```
These are used as the result of comparisons, and often handled differently from larger integer types.

llvm-svn: 159111
```
  3099ce94
- Remove dyn_cast + dereference pattern by replacing it with a cast and changing · 0a045bbe
  Nick Lewycky authored Jun 24, 2012
```
the safety check to look for the same type we're going to actually cast to.
Fixes PR13180!

llvm-svn: 159110
```
  0a045bbe
- Tab to spaces. No functionality change. · b74ae9c5
  Nick Lewycky authored Jun 24, 2012
```
llvm-svn: 159104
```
  b74ae9c5
- Remove a dangling reference to a deleted instruction. Fixes PR13185! · bfb07fb5
  Nick Lewycky authored Jun 24, 2012
```
llvm-svn: 159096
```
  bfb07fb5
Jun 23, 2012

Allow BBVectorize to fuse compare instructions. · 4b06b1a0
Hal Finkel authored Jun 23, 2012
```
llvm-svn: 159088
```
4b06b1a0

Extend the IL for selecting TLS models (PR9788) · cbe34b4c

Hans Wennborg authored Jun 23, 2012

This allows the user/front-end to specify a model that is better
than what LLVM would choose by default. For example, a variable
might be declared as

  @x = thread_local(initialexec) global i32 42

if it will not be used in a shared library that is dlopen'ed.

If the specified model isn't supported by the target, or if LLVM can
make a better choice, a different model may be used.

llvm-svn: 159077

cbe34b4c

Optimized usage of new SwitchInst case values (IntegersSubset type) in... · 8e00efea

Stepan Dyatkovskiy authored Jun 23, 2012

Optimized usage of new SwitchInst case values (IntegersSubset type) in Local.cpp, Execution.cpp and BitcodeWriter.cpp.
I got about 1% of compile-time improvement on my machines (Ubuntu 11.10 i386 and Ubuntu 12.04 x64).

llvm-svn: 159076

8e00efea

BoundsChecking: attach debug info to traps to make my life a bit more sane · de8c6fb2
Nuno Lopes authored Jun 23, 2012
```
llvm-svn: 159055
```
de8c6fb2

Jun 22, 2012

Revert remaining part of r93200: "Disable folding sext(trunc(x)) -> x" · c5c4e96f

Jakob Stoklund Olesen authored Jun 22, 2012

This fixes PR5997.

These transforms were disabled because codegen couldn't deal with other
uses of trunc(x). This is now handled by the peephole pass.

This causes no regressions on x86-64.

llvm-svn: 159003

c5c4e96f

Fixed r158979. · a6c8cc30

Stepan Dyatkovskiy authored Jun 22, 2012

Original message:
Performance optimizations:
- SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges.
- Optimized IntItem, added APInt value caching.
- Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only.

llvm-svn: 158997

a6c8cc30

fix whitespace in my last commit. · 0b60ebbf
Nuno Lopes authored Jun 22, 2012
```
sorry for the churn :S  enough for today; going to sleep.

llvm-svn: 158953
```
0b60ebbf

remove extractMallocCallFromBitCast, since it was tailor maded for its sole... · 9792d683

Nuno Lopes authored Jun 22, 2012

remove extractMallocCallFromBitCast, since it was tailor maded for its sole user. Update GlobalOpt accordingly.

llvm-svn: 158952

9792d683

instcombine: disable optimization of 'invoke null/undef'. I'll move this... · 771e7bd4

Nuno Lopes authored Jun 21, 2012

instcombine: disable optimization of 'invoke null/undef'. I'll move this functionality to SimplifyCFG (since we cannot make changes to the CFG here).
Fixes the crashes with the attached test case

llvm-svn: 158951

771e7bd4

Look pass zext to strength reduce an udiv. Patch by David Majnemer. rdar://11721329 · 32c7cc8e
Evan Cheng authored Jun 21, 2012
```
llvm-svn: 158946
```
32c7cc8e

Jun 21, 2012

Add support for invoke to the MemoryBuiltin analysid. · dc6085e5

Nuno Lopes authored Jun 21, 2012

Update comments accordingly.

Make instcombine remove useless invokes to C++'s 'new' allocation function (test attached).

llvm-svn: 158937

dc6085e5

port the BoundsChecking patch to the new MemoryBuiltin API (i.e., remove most... · 0e967e01

Nuno Lopes authored Jun 21, 2012

port the BoundsChecking patch to the new MemoryBuiltin API (i.e., remove most of the code from here).
Remove the alloc_size.ll test until we settle on a metadata format that makes everyone happy..

llvm-svn: 158920

0e967e01

refactor the MemoryBuiltin analysis: · 55fff834

Nuno Lopes authored Jun 21, 2012

 - provide more extensive set of functions to detect library allocation functions (e.g., malloc, calloc, strdup, etc)
 - provide an API to compute the size and offset of an object pointed by

Move a few clients (GVN, AA, instcombine, ...) to the new API.
This implementation is a lot more aggressive than each of the custom implementations being replaced.

Patch reviewed by Nick Lewycky and Chandler Carruth, thanks.

llvm-svn: 158919

55fff834

Add a number of threshold arguments to the SRA pass. · 4e9012c2
Nadav Rotem authored Jun 21, 2012
```
A patch by Tom Stellard with minor changes.

llvm-svn: 158918
```
4e9012c2

Jun 20, 2012

replace usage of EmitGEPOffset() with TargetData::getIndexedOffset() when the... · 3fa32f24

Nuno Lopes authored Jun 20, 2012

replace usage of EmitGEPOffset() with TargetData::getIndexedOffset() when the GEP offset is known to be constant.
With this change, we avoid relying on the IR Builder to constant fold the operations.

No functionality change intended.

llvm-svn: 158829

3fa32f24