Commits · 1152ca9101388e247450bfef72c90ca92d163f0d · Roger Ferrer / llvm-epi-0.8

May 23, 2008
- Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to · f96e1371
  Dan Gohman authored May 23, 2008
```
use it instead of duplicating its functionality.

llvm-svn: 51499
```
  f96e1371
May 16, 2008

API change for {BinaryOperator|CmpInst|CastInst}::create*() --> Create. Legacy... · e1f6e4b2

Gabor Greif authored May 16, 2008

API change for {BinaryOperator|CmpInst|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time.  (Merge from use-diet branch.)

llvm-svn: 51200

e1f6e4b2

May 13, 2008

Clean up the use of static and anonymous namespaces. This turned up · d78c400b

Dan Gohman authored May 13, 2008

several things that were neither in an anonymous namespace nor static
but not intended to be global.

llvm-svn: 51017

d78c400b

May 08, 2008
- Improve pass documentation and comments. · 829046b0
  Gordon Henriksen authored May 08, 2008
```
Patch by Matthijs Kooijman!

llvm-svn: 50861
```
  829046b0
Apr 27, 2008

Implement a signficant optimization for inline asm: · 22379734

Chris Lattner authored Apr 27, 2008

When choosing between constraints with multiple options,
like "ir", test to see if we can use the 'i' constraint and
go with that if possible.  This produces more optimal ASM in
all cases (sparing a register and an instruction to load it),
and fixes inline asm like this:

void test () {
  asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14));
}

Previously we would dump "42" into a memory location (which
is ok for the 'm' constraint) which would cause a problem
because the 'c' modifier is not valid on memory operands.

Isn't it great how inline asm turns 'missed optimization'
into 'compile failed'??

Incidentally, this was the todo in 
PowerPC/2007-04-24-InlineAsm-I-Modifier.ll

Please do NOT pull this into Tak.

llvm-svn: 50315

22379734

Move a bunch of inline asm code out of line. · 4793515a
Chris Lattner authored Apr 27, 2008
```
llvm-svn: 50313
```
4793515a

Apr 25, 2008

Remove the code from CodeGenPrepare that moved getresult instructions · ca95a5f4

Dan Gohman authored Apr 25, 2008

to the block that defines their operands. This doesn't work in the
case that the operand is an invoke, because invoke is a terminator
and must be the last instruction in a block.

Replace it with support in SelectionDAGISel for copying struct values
into sequences of virtual registers.

llvm-svn: 50279

ca95a5f4

Apr 06, 2008
- silence a warning when assertions are disabled. · a39cfc5c
  Chris Lattner authored Apr 06, 2008
```
llvm-svn: 49283
```
  a39cfc5c
Mar 21, 2008
- Handle getresult instructions in different basic blocks · a25dde6f
  Dan Gohman authored Mar 21, 2008
```
from their aggregate operands by moving the getresult
instructions.

llvm-svn: 48657
```
  a25dde6f
Mar 19, 2008
- Remove dead options. · a90fdc43
  Evan Cheng authored Mar 19, 2008
```
llvm-svn: 48556
```
  a90fdc43
Feb 26, 2008
- fix http://llvm.org/bugs/show_bug.cgi?id=2097 · aa261720
  Gabor Greif authored Feb 26, 2008
```
llvm-svn: 47615
```
  aa261720
- Fix for pr2093: direct operands aren't necessarily addresses, so don't · 666bbe34
  Eli Friedman authored Feb 26, 2008
```
try to simplify them.

llvm-svn: 47610
```
  666bbe34
- Fix PR2076. CodeGenPrepare now sinks address computation for inline asm memory · 1da25009
  Evan Cheng authored Feb 26, 2008
```
operands into inline asm block.

llvm-svn: 47589
```
  1da25009
Jan 20, 2008
- Make sure the caller doesn't use freed memory. · afa84da4
  Duncan Sands authored Jan 20, 2008
```
Fixes PR1935.

llvm-svn: 46203
```
  afa84da4
Dec 29, 2007
- Remove attribution from file headers, per discussion on llvmdev. · f3ebc3f3
  Chris Lattner authored Dec 29, 2007
```
llvm-svn: 45418
```
  f3ebc3f3
Dec 25, 2007

Don't break critical edges for single-bb loops, this helps with PR1877, though · ef1bbfc7

Chris Lattner authored Dec 25, 2007

it is only a partial fix. This change is noise for most programs, but
speeds up Shootout-C++/matrix by 20%, Ptrdist/ks by 24%, smg2000 by 8%,
hexxagon by 9%, bzip2 by 9% (not sure I trust this), ackerman by 13%, etc.

OTOH, it slows down Shootout/fib2 by 40% (I'll update PR1877 with this info).

llvm-svn: 45354

ef1bbfc7

Dec 24, 2007
- add a -backedge-hack llc-beta option to codegenprepare. · 62a806d5
  Chris Lattner authored Dec 24, 2007
```
When specified, don't split backedges of single-bb loops.
This helps address PR1877

llvm-svn: 45344
```
  62a806d5
Dec 13, 2007
- Fix typo. · 2011df4e
  Evan Cheng authored Dec 13, 2007
```
llvm-svn: 44997
```
  2011df4e
- Be extra careful with extension use optimation. Now turned on by default. · 37c36ed7
  Evan Cheng authored Dec 13, 2007
```
llvm-svn: 44981
```
  37c36ed7
Dec 12, 2007
- Don't muck with phi nodes; bug fixes. · 63d33cfd
  Evan Cheng authored Dec 12, 2007
```
llvm-svn: 44905
```
  63d33cfd
- Bug fix. Only safe to perform extension uses optimization if the source of... · 7bc89425
  Evan Cheng authored Dec 12, 2007
```
Bug fix. Only safe to perform extension uses optimization if the source of extension is also defined in the same BB as the extension.

llvm-svn: 44896
```
  7bc89425
Dec 06, 2007

If both result of the {s|z}xt and its source are live out, rewrite all uses of... · d3d8017b

Evan Cheng authored Dec 05, 2007

If both result of the {s|z}xt and its source are live out, rewrite all uses of the source with result of extension.

llvm-svn: 44643

d3d8017b

Nov 06, 2007
- fix const correctness, BB is const, so its predecessors are too · 8201a9bc
  Chris Lattner authored Nov 06, 2007
```
llvm-svn: 43780
```
  8201a9bc
Nov 01, 2007

Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. · 44b8721d

Duncan Sands authored Nov 01, 2007

The meaning of getTypeSize was not clear - clarifying it is important
now that we have x86 long double and arbitrary precision integers.
The issue with long double is that it requires 80 bits, and this is
not a multiple of its alignment.  This gives a primitive type for
which getTypeSize differed from getABITypeSize.  For arbitrary precision
integers it is even worse: there is the minimum number of bits needed to
hold the type (eg: 36 for an i36), the maximum number of bits that will
be overwriten when storing the type (40 bits for i36) and the ABI size
(i.e. the storage size rounded up to a multiple of the alignment; 64 bits
for i36).

This patch removes getTypeSize (not really - it is still there but
deprecated to allow for a gradual transition).  Instead there is:

(1) getTypeSizeInBits - a number of bits that suffices to hold all
values of the type.  For a primitive type, this is the minimum number
of bits.  For an i36 this is 36 bits.  For x86 long double it is 80.
This corresponds to gcc's TYPE_PRECISION.

(2) getTypeStoreSizeInBits - the maximum number of bits that is
written when storing the type (or read when reading it).  For an
i36 this is 40 bits, for an x86 long double it is 80 bits.  This
is the size alias analysis is interested in (getTypeStoreSize
returns the number of bytes).  There doesn't seem to be anything
corresponding to this in gcc.

(3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded
up to a multiple of the alignment.  For an i36 this is 64, for an
x86 long double this is 96 or 128 depending on the OS.  This is the
spacing between consecutive elements when you form an array out of
this type (getABITypeSize returns the number of bytes).  This is
TYPE_SIZE in gcc.

Since successive elements in a SequentialType (arrays, pointers
and vectors) need to be aligned, the spacing between them will be
given by getABITypeSize.  This means that the size of an array
is the length times the getABITypeSize.  It also means that GEP
computations need to use getABITypeSize when computing offsets.
Furthermore, if an alloca allocates several elements at once then
these too need to be aligned, so the size of the alloca has to be
the number of elements multiplied by getABITypeSize.  Logically
speaking this doesn't have to be the case when allocating just
one element, but it is simpler to also use getABITypeSize in this
case.  So alloca's and mallocs should use getABITypeSize.  Finally,
since gcc's only notion of size is that given by getABITypeSize, if
you want to output assembler etc the same as gcc then getABITypeSize
is the size you want.

Since a store will overwrite no more than getTypeStoreSize bytes,
and a read will read no more than that many bytes, this is the
notion of size appropriate for alias analysis calculations.

In this patch I have corrected all type size uses except some of
those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard
cases).  I will get around to auditing these too at some point,
but I could do with some help.

Finally, I made one change which I think wise but others might
consider pointless and suboptimal: in an unpacked struct the
amount of space allocated for a field is now given by the ABI
size rather than getTypeStoreSize.  I did this because every
other place that reserves memory for a type (eg: alloca) now
uses getABITypeSize, and I didn't want to make an exception
for unpacked structs, i.e. I did it to make things more uniform.
This only effects structs containing long doubles and arbitrary
precision integers.  If someone wants to pack these types more
tightly they can always use a packed struct.

llvm-svn: 43620

44b8721d

Aug 02, 2007
- wrap some long lines. Major offenders that are left include · 27406944
  Chris Lattner authored Aug 02, 2007
```
gvn, gvnpre, dse, and predsimplify.  To see these, use:

  make check-line-length

llvm-svn: 40738
```
  27406944
Aug 01, 2007
- More explicit keywords. · 34d442f2
  Dan Gohman authored Aug 01, 2007
```
llvm-svn: 40673
```
  34d442f2
Jun 12, 2007
- Sink CmpInst's to their uses to reduce register pressure. · edfec0b5
  Dale Johannesen authored Jun 12, 2007
```
llvm-svn: 37554
```
  edfec0b5
May 08, 2007
- Don't generate branch to entry block. · 86e1dcf5
  Dale Johannesen authored May 08, 2007
```
llvm-svn: 36917
```
  86e1dcf5
May 06, 2007
- Fix typo in comment. · e7da2d6a
  Nick Lewycky authored May 06, 2007
```
llvm-svn: 36873
```
  e7da2d6a
May 03, 2007
- Drop 'const' · 8c78a0bf
  Devang Patel authored May 03, 2007
```
llvm-svn: 36662
```
  8c78a0bf
May 02, 2007

Use 'static const char' instead of 'static const int'. · e95c6ad8

Devang Patel authored May 02, 2007

Due to darwin gcc bug, one version of darwin linker coalesces
static const int, which defauts PassID based pass identification.

llvm-svn: 36652

e95c6ad8

May 01, 2007
- Do not use typeinfo to identify pass in pass manager. · 09f162ca
  Devang Patel authored May 01, 2007
```
llvm-svn: 36632
```
  09f162ca
Apr 25, 2007

Fix · d3208523

Devang Patel authored Apr 25, 2007

http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20070423/048376.html

llvm-svn: 36417

d3208523

Apr 14, 2007
- use an accessor to simplify code. · 164b7656
  Chris Lattner authored Apr 14, 2007
```
llvm-svn: 35979
```
  164b7656
Apr 13, 2007

Completely rewrite addressing-mode related sinking of code. In particular, · feee64e9

Chris Lattner authored Apr 13, 2007

this fixes problems where codegenprepare would sink expressions into load/stores
that are not valid, and fixes cases where it would miss important valid ones.

This fixes several serious codesize and perf issues, particularly on targets
with complex addressing modes like arm and x86.  For example, now we compile
CodeGen/X86/isel-sink.ll to:

_test:
        movl 8(%esp), %eax
        movl 4(%esp), %ecx
        cmpl $1233, %eax
        ja LBB1_2       #F
LBB1_1: #T
        movl $4, (%ecx,%eax,4)
        movl $141, %eax
        ret
LBB1_2: #F
        movl (%ecx,%eax,4), %eax
        ret

instead of:

_test:
        movl 8(%esp), %eax
        leal (,%eax,4), %ecx
        addl 4(%esp), %ecx
        cmpl $1233, %eax
        ja LBB1_2       #F
LBB1_1: #T
        movl $4, (%ecx)
        movl $141, %eax
        ret
LBB1_2: #F
        movl (%ecx), %eax
        ret

llvm-svn: 35970

feee64e9

Apr 10, 2007
- eliminate the last uses of some TLI methods. · 3e9690f9
  Chris Lattner authored Apr 09, 2007
```
llvm-svn: 35844
```
  3e9690f9
Apr 02, 2007

Various passes before isel split edges and do other CFG-restructuring changes. · c3748562

Chris Lattner authored Apr 02, 2007

isel has its own particular features that it wants in the CFG, in order to
reduce the number of times a constant is computed, etc.  Make sure that we
clean up the CFG before doing any other things for isel.  Doing so can
dramatically reduce the number of split edges and reduce the number of
places that constants get computed.  For example, this shrinks
CodeGen/Generic/phi-immediate-factoring.ll from 44 to 37 instructions on X86,
and from 21 to 17 MBB's in the output.  This is primarily a code size win,
not a performance win.

This implements CodeGen/Generic/phi-immediate-factoring.ll and PR1296.

llvm-svn: 35575

c3748562

Mar 31, 2007
- Split the sdisel code munging stuff out into its own opt-pass, CodeGenPrepare. · f2836d17
  Chris Lattner authored Mar 31, 2007
```
llvm-svn: 35528
```
  f2836d17