Commits · 10bd29f1d467c0fb518ca3cb48f153e1d47f7e73 · Roger Ferrer / llvm-epi-0.8

Dec 13, 2010

Add a couple dag combines to transform mulhi/mullo into a wider multiply · 10bd29f1

Chris Lattner authored Dec 13, 2010

when the wider type is legal.  This allows us to compile:

define zeroext i16 @test1(i16 zeroext %x) nounwind {
entry:
	%div = udiv i16 %x, 33
	ret i16 %div
}

into:

test1:                                  # @test1
	movzwl	4(%esp), %eax
	imull	$63551, %eax, %eax      # imm = 0xF83F
	shrl	$21, %eax
	ret

instead of:

test1:                                  # @test1
        movw    $-1985, %ax             # imm = 0xFFFFFFFFFFFFF83F
        mulw    4(%esp)
        andl    $65504, %edx            # imm = 0xFFE0
        movl    %edx, %eax
        shrl    $5, %eax
        ret

Implementing rdar://8760399 and example #4 from:
http://blog.regehr.org/archives/320

We should implement the same thing for [su]mul_hilo, but I don't
have immediate plans to do this.

llvm-svn: 121696

10bd29f1

reinstate my patch: the miscompile was caused by an inverted branch in the · fb836f8c
Chris Lattner authored Dec 13, 2010
```
'and' case.

llvm-svn: 121695
```
fb836f8c
Remove a type that got reduced away from this test case but not actually deleted. · 78494f50
Chandler Carruth authored Dec 13, 2010
```
llvm-svn: 121694
```
78494f50
Completely disable the optimization I added in r121680 until · 79db357d
Chris Lattner authored Dec 13, 2010
```
I can track down a miscompile.  This should bring the buildbots
back to life

llvm-svn: 121693
```
79db357d

Fix PR8774 by restricting when hasInit returns true. Previously, it · 7a6d2e9c

Chandler Carruth authored Dec 13, 2010

would return true if the initializer pointer union had *any* non-null
pointer in it, even if the pointer wasn't one that would actually be
returned via getInit(). This makes it more accurately model the logic of
'getInit() != NULL'.

This still isn't completely satisfying. From a principled stance,
I suspect we should make hasInit() and getInit() *always* return false
and NULL (resp.) for ParmVarDecl. We shouldn't at the API level treat
initializers and default arguments as the same thing.

llvm-svn: 121692

7a6d2e9c

remove the verbose-asm "constant pool double" comments that we were printing · f8d180b8

Chris Lattner authored Dec 13, 2010

for each constant pool entry.  Using WriteTypeSymbolic here takes time
proportional to the size of the module, for each constant pool entry.

This speeds up -verbose-asm llc on 252.eon (a random testcase at my disposal)
from 4.4s to 2.137s.  llc takes 2.11s with asm-verbose off, so this is now a
pretty reasonable cost for verbose comments.

llvm-svn: 121691

f8d180b8

Make simplifycfg reprocess newly formed "br (cond1 | cond2)" conditions · fbeb5584

Chris Lattner authored Dec 13, 2010

when simplifying, allowing them to be eagerly turned into switches.  This
is the last step required to get "Example 7" from this blog post:
http://blog.regehr.org/archives/320

On X86, we now generate this machine code, which (to my eye) seems better
than the ICC generated code:

_crud:                                  ## @crud
## BB#0:                                ## %entry
	cmpb	$33, %dil
	jb	LBB0_4
## BB#1:                                ## %switch.early.test
	addb	$-34, %dil
	cmpb	$58, %dil
	ja	LBB0_3
## BB#2:                                ## %switch.early.test
	movzbl	%dil, %eax
	movabsq	$288230376537592865, %rcx ## imm = 0x400000017001421
	btq	%rax, %rcx
	jb	LBB0_4
LBB0_3:                                 ## %lor.rhs
	xorl	%eax, %eax
	ret
LBB0_4:                                 ## %lor.end
	movl	$1, %eax
	ret

llvm-svn: 121690

fbeb5584

make this logic a bit simpler. · 1d05761d
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121689
```
1d05761d
split all the guts of SimplifyCFGOpt::run out into one function · 25c3af35
Chris Lattner authored Dec 13, 2010
```
per terminator kind.

llvm-svn: 121688
```
25c3af35
fix a bug in r121680 that upset the various buildbots. · cb570f87
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121687
```
cb570f87
refactor the speculative execution logic to be factored into the cond branch code instead of · a6db741f
Chris Lattner authored Dec 13, 2010
```
doing a cfg search for every block simplified.

llvm-svn: 121686
```
a6db741f
simplify a bunch of code. · 466f54ff
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121685
```
466f54ff
move HoistThenElseCodeToIf up to a more logical and efficient-to-handle place. · 6df7bdd8
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121684
```
6df7bdd8
move 'MergeBlocksIntoPredecessor' call earlier. Use · 2e3832d9
Chris Lattner authored Dec 13, 2010
```
getSinglePredecessor to simplify code.

llvm-svn: 121683
```
2e3832d9
make these tests a bit less fragile · bc9e6d9d
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121682
```
bc9e6d9d
factor new code out to a SimplifyBranchOnICmpChain helper function. · a69c4434
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121681
```
a69c4434

enhance the "change or icmp's into switch" xform to handle one value in an · a442f24a

Chris Lattner authored Dec 13, 2010

'or sequence' that it doesn't understand.  This allows us to optimize
something insane like this:

int crud (unsigned char c, unsigned x)
 {
   if(((((((((( (int) c <= 32 ||
                    (int) c == 46) || (int) c == 44)
                  || (int) c == 58) || (int) c == 59) || (int) c == 60)
               || (int) c == 62) || (int) c == 34) || (int) c == 92)
            || (int) c == 39) != 0)
     foo();
 }

into:

define i32 @crud(i8 zeroext %c, i32 %x) nounwind ssp noredzone {
entry:
  %cmp = icmp ult i8 %c, 33
  br i1 %cmp, label %if.then, label %switch.early.test

switch.early.test:                                ; preds = %entry
  switch i8 %c, label %if.end [
    i8 39, label %if.then
    i8 44, label %if.then
    i8 58, label %if.then
    i8 59, label %if.then
    i8 60, label %if.then
    i8 62, label %if.then
    i8 46, label %if.then
    i8 92, label %if.then
    i8 34, label %if.then
  ]

by pulling the < comparison out ahead of the newly formed switch.

llvm-svn: 121680

a442f24a

merge two tests · a737721d
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121679
```
a737721d
merge two very similar functions into one that has a bool argument. · 5a177e68
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121678
```
5a177e68
Disable auto-detection of AVX support since AVX codegen support is not ready. · f8b4c003
Evan Cheng authored Dec 13, 2010
```
llvm-svn: 121677
```
f8b4c003
don't bother handling non-canonical icmp's · 9b1af510
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121676
```
9b1af510
inline a function, making the result much simpler. · 395252d9
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121675
```
395252d9
Fix my previous patch to handle a degenerate case that the llvm-gcc · 62cc76e9
Chris Lattner authored Dec 13, 2010
```
bootstrap buildbot tripped over.

llvm-svn: 121674
```
62cc76e9
convert some methods to be static functions · 11dafaa3
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121673
```
11dafaa3
zap two more std::sorts. · 4642d79f
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121672
```
4642d79f

fix a fairly serious oversight with switch formation from · d9bacc08

Chris Lattner authored Dec 13, 2010

or'd conditions.  Previously we'd compile something like this:

int crud (unsigned char c) {
   return c == 62 || c == 34 || c == 92;
}

into:

  switch i8 %c, label %lor.rhs [
    i8 62, label %lor.end
    i8 34, label %lor.end
  ]

lor.rhs:                                          ; preds = %entry
  %cmp8 = icmp eq i8 %c, 92
  br label %lor.end

lor.end:                                          ; preds = %entry, %entry, %lor.rhs
  %0 = phi i1 [ true, %entry ], [ %cmp8, %lor.rhs ], [ true, %entry ]
  %lor.ext = zext i1 %0 to i32
  ret i32 %lor.ext

which failed to merge the compare-with-92 into the switch.  With this patch
we simplify this all the way to:

  switch i8 %c, label %lor.rhs [
    i8 62, label %lor.end
    i8 34, label %lor.end
    i8 92, label %lor.end
  ]

lor.rhs:                                          ; preds = %entry
  br label %lor.end

lor.end:                                          ; preds = %entry, %entry, %entry, %lor.rhs
  %0 = phi i1 [ true, %entry ], [ false, %lor.rhs ], [ true, %entry ], [ true, %entry ]
  %lor.ext = zext i1 %0 to i32
  ret i32 %lor.ext

which is much better for codegen's switch lowering stuff.  This kicks in 33 times
on 176.gcc (for example) cutting 103 instructions off the generated code.

llvm-svn: 121671

d9bacc08

simplify code and reduce indentation · 73a58627
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121670
```
73a58627
convert an std::sort to array_pod_sort. · 7c8e6047
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121669
```
7c8e6047

move the "br (X == 0 | X == 1), T, F" -> switch optimization to a new · 14759876

Chris Lattner authored Dec 13, 2010

location in simplifycfg.  In the old days, SimplifyCFG was never run on
the entry block, so we had to scan over all preds of the BB passed into
simplifycfg to do this xform, now we can just check blocks ending with
a condbranch.  This avoids a scan over all preds of every simplified 
block, which should be a significant compile-time perf win on functions
with lots of edges.  No functionality change.

llvm-svn: 121668

14759876

reduce indentation and generally simplify code, no functionality change. · 4088e2b8
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121667
```
4088e2b8

Add support for using the `!if' operator when initializing variables: · 73ce4a6f

Bill Wendling authored Dec 13, 2010

  class A<bit a, bits<3> x, bits<3> y> {
    bits<3> z;
    let z = !if(a, x, y);
  }

The variable z will get the value of x when 'a' is 1 and 'y' when a is '0'.

llvm-svn: 121666

73ce4a6f

Reduce the number of builtin operator overload candidates added in certain · 00a38336

Chandler Carruth authored Dec 13, 2010

cases. First, omit all builtin overloads when no non-record type is in the set
of candidate types. Second, avoid arithmetic type overloads for non-arithmetic
or enumeral types (counting vector types as arithmetic due to Clang
extensions). When heavily using constructs such as STL's '<<' based stream
logging, this can have a significant impact. One logging-heavy test case's
compile time dropped by 10% with this. Self-host shows 1-2% improvement in
compile time, but that's likely in the noise.

llvm-svn: 121665

00a38336

use getFirstNonPHIOrDbg to simplify this code. · 7cb7867d
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121664
```
7cb7867d

Updated to latest Clang revision. This involved · 48114479

Sean Callanan authored Dec 13, 2010

very minor changes, changing how we get the target
type from a TypedefType, adding a parameter to
EnumDecl::Create(), and other minor tweaks.

llvm-svn: 121663

48114479

reduce indentation by using continue, no functionality change. · cb404360
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121662
```
cb404360
Move <map> include out of .h and into .cpp. · 5343836f
Bill Wendling authored Dec 13, 2010
```
llvm-svn: 121661
```
5343836f
Merge DEBUG statements. · dc913e64
Bill Wendling authored Dec 13, 2010
```
llvm-svn: 121660
```
dc913e64
eliminate the Records global variable, patch by Garrison Venn! · 77d369c8
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121659
```
77d369c8
clean up RecordKeeper::getAllDerivedDefinitions, patch by Garrison Venn! · 9704907a
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121658
```
9704907a
further fixes. · bd61444c
Chris Lattner authored Dec 13, 2010
```
llvm-svn: 121657
```
bd61444c