Commits · 81ee731852b4faa36fa321037d9833a9e6a646c7 · Roger Ferrer / llvm-epi-0.8

Dec 15, 2008

Add a testcase for GCC PR 23455, which lpre handles now. Add some · 81ee7318
Chris Lattner authored Dec 15, 2008
```
comments about why we're not getting other cases.

llvm-svn: 61032
```
81ee7318
Update generated files after nocapture syntax change. · bc07dbd1
Nick Lewycky authored Dec 15, 2008
```
llvm-svn: 61031
```
bc07dbd1

It turns out that "align 1" and unaligned are different. Add a bias to the · 8d2ea183

Nick Lewycky authored Dec 15, 2008

alignment attribute such that 0 means unaligned.

This will probably require a rebuild of llvm-gcc because of the change to
Attributes.h. If you see many test failures on "make check", please rebuild
your llvm-gcc.

llvm-svn: 61030

8d2ea183

Added support to LegalizeType for expanding the operands of scalar to vector · ac4e1209

Mon P Wang authored Dec 15, 2008

and insert vector element.  Modified extract vector element to extend the
result to match the expected promoted type.

llvm-svn: 61029

ac4e1209

gvn now hoists this load out of the hot non-call path. · 3c2c36b5
Chris Lattner authored Dec 15, 2008
```
llvm-svn: 61028
```
3c2c36b5

Enable Load PRE. This teaches GVN to push partially redundant loads up the · 0c68ae06

Chris Lattner authored Dec 15, 2008

CFG when there is exactly one predecessor where the load is not available.
This is designed to not increase code size but still eliminate partially
redundant loads. This fires 1765 times on 403.gcc even though it doesn't
do critical edge splitting yet (the most common reason for it to fail).

llvm-svn: 61027

0c68ae06

if we have a phi translation failure of the start block, · 7ed5ccc5
Chris Lattner authored Dec 15, 2008
```
return *just* a clobber of the start block, not other 
random stuff as well.

llvm-svn: 61026
```
7ed5ccc5
Ifdef out some code that I didn't mean to enable by default yet. · 03aacbae
Owen Anderson authored Dec 15, 2008
```
llvm-svn: 61024
```
03aacbae

make GVN try to rename inputs to the resultant replaced values, which · 69131fd8

Chris Lattner authored Dec 15, 2008

cleans up the generated code a bit.  This should have the added benefit of
not randomly renaming functions/globals like my previous patch did. :)

llvm-svn: 61023

69131fd8

Implement initial support for PHI translation in memdep. This means that · ff9f3dba

Chris Lattner authored Dec 15, 2008

memdep keeps track of how PHIs affect the pointer in dep queries, which 
allows it to eliminate the load in cases like rle-phi-translate.ll, which
basically end up being:

BB1:
   X = load P
   br BB3
BB2:
   Y = load Q
   br BB3
BB3:
   R = phi [P] [Q]
   load R

turning "load R" into a phi of X/Y.  In addition to additional exposed
opportunities, this makes memdep safe in many cases that it wasn't before
(which is required for load PRE) and also makes it substantially more 
efficient.  For example, consider:


bb1:  // has many predecessors.
   P = some_operator()
   load P

In this example, previously memdep would scan all the predecessors of BB1
to see if they had something that would mustalias P.  In some cases (e.g.
test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end
up eliminating something.  In many other cases though, it would scan and not
find anything useful.  MemDep now stops at a block if the pointer is defined
in that block and cannot be phi translated to predecessors.  This causes it
to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not
scanning tons of stuff that is unlikely to be useful.  For example, this
speeds up GVN as a whole from 3.928s to 2.448s (60%)!.  IMO, scalar GVN 
should be enhanced to simplify the rle-must-alias pointer base anyway, which
would allow the loads to be eliminated.

In the future, this should be enhanced to phi translate through geps and 
bitcasts as well (as indicated by FIXMEs) making memdep even more powerful.

llvm-svn: 61022

ff9f3dba

Add support for slow-path GVN with full phi construction for scalars. This is... · bfe133e4

Owen Anderson authored Dec 15, 2008

Add support for slow-path GVN with full phi construction for scalars. This is disabled for now, as it actually pessimizes code in the abscence
of phi translation for load elimination. This slow down GVN a bit, by about 2% on 403.gcc.

llvm-svn: 61021

bfe133e4

Fix whitespace in comment. · e88d388f

Nick Lewycky authored Dec 15, 2008

Remove TODO; icmp isn't a binary operator, so this function will never deal
with them.

llvm-svn: 61020

e88d388f

Introducing nocapture, a parameter attribute for pointers to indicate that the · ddffe620

Nick Lewycky authored Dec 15, 2008

callee will not introduce any new aliases of that pointer.

The attributes had all bits allocated already, so I decided to collapse
alignment. Alignment was previously stored as a 16-bit integer from bits 16 to
32 of the attribute, but it was required to be a power of 2. Now it's stored in
log2 encoded form in five bits from 16 to 21. That gives us 11 more bits of
space.

You may have already noticed that you only need four bits to encode a 16-bit
power of two, so why five bits? Because the AsmParser accepted 32-bit
alignments, even though we couldn't store them (they were silently discarded).
Now we can store them in memory, but not in the bitcode.

The bitcode format was already storing these as 64-bit VBR integers. So, the
bitcode format stays the same, keeping the alignment values stored as 16 bit
raw values. There's some hideous code in the reader and writer that deals with
this, waiting to be ripped out the moment we run out of bits again and have to
replace the parameter attributes table encoding.

llvm-svn: 61019

ddffe620

Dec 14, 2008

silence warning when asserts disabled. · a66e9f42
Chris Lattner authored Dec 14, 2008
```
llvm-svn: 61014
```
a66e9f42
silence warning when asserts disabled. · c3d36efb
Chris Lattner authored Dec 14, 2008
```
llvm-svn: 61013
```
c3d36efb
eliminate warning when asserts disabled. · f5eef9f6
Chris Lattner authored Dec 14, 2008
```
llvm-svn: 61012
```
f5eef9f6
Generalize GVN's phi construciton routine to work for things other than loads. · e34c2399
Owen Anderson authored Dec 14, 2008
```
llvm-svn: 61009
```
e34c2399
Reapply r60997, this time without forgetting that · f312dc77
Duncan Sands authored Dec 14, 2008
```
target constants are allowed to have an illegal
type.

llvm-svn: 61006
```
f312dc77

Temporarily revert r60997. It was causing this failure: · e5af6f19

Bill Wendling authored Dec 13, 2008

Running /Users/void/llvm/llvm.src/test/CodeGen/Generic/dg.exp ...
FAIL: /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll
Failed with exit(1) at line 1
while running:  llvm-as < /Users/void/llvm/llvm.src/test/CodeGen/Generic/asm-large-immediate.ll |  llc | /usr/bin/grep 68719476738
Assertion failed: ((TypesNeedLegalizing || getTypeAction(VT) == Legal) && "Illegal type introduced after type legalization?"), function HandleOp, file /Users/void/llvm/llvm.src/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp, line 493.
0   llc               0x0085392e char const* std::find<char const*, char>(char const*, char const*, char const&) + 98
1   llc               0x00853e63 llvm::sys::PrintStackTraceOnErrorSignal() + 593
2   libSystem.B.dylib 0x96cac09b _sigtramp + 43
3   libSystem.B.dylib 0xffffffff _sigtramp + 1765097359
4   libSystem.B.dylib 0x96d24ec2 raise + 26
5   libSystem.B.dylib 0x96d3447f abort + 73
6   libSystem.B.dylib 0x96d26063 __assert_rtn + 101
7   llc               0x004f9018 llvm::cast_retty<llvm::SubprogramDesc, llvm::DebugInfoDesc*>::ret_type llvm::cast<llvm::Sub
...

llvm-svn: 61001

e5af6f19

Dec 13, 2008

LegalizeDAG is not supposed to introduce illegal · 24092271
Duncan Sands authored Dec 13, 2008
```
types into the DAG if they were not already there.
Check this with an assertion.

llvm-svn: 60997
```
24092271
These messages should always be emitted when NDEBUG is unset, not when · 695bc778
Chris Lattner authored Dec 13, 2008
```
NDEBUG is unset and -debug is passed.

llvm-svn: 60986
```
695bc778

Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: · 293b9181

Bill Wendling authored Dec 13, 2008

llvm[2]: Linking Release executable opt (without symbols)
...
Undefined symbols:
  "llvm::APFloat::IEEEsingle", referenced from:
      __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o)
      __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o)
      __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o)
  "llvm::APFloat::IEEEdouble", referenced from:
      __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o)
      __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o)
      __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o)
ld: symbol(s) not found

This is in release mode. To replicate, compile llvm and llvm-gcc in optimized
mode. Then build llvm, in optimized mode, with the newly created compiler.

llvm-svn: 60977

293b9181

Fix getFieldAs() to use the parameter instead of 6. · 040688f1
Torok Edwin authored Dec 13, 2008
```
Add missing DIType constructor, needed by DIVariable::getType().

llvm-svn: 60976
```
040688f1
Remove assertion to allow promotion of a truncating store operand · 472cd640
Mon P Wang authored Dec 13, 2008
```
llvm-svn: 60975
```
472cd640
Added basic support for expanding VSETCC · f95bd207
Mon P Wang authored Dec 13, 2008
```
llvm-svn: 60974
```
f95bd207
make RLE preserve the name of the load that it replaces. This is just · 1e29f7c9
Chris Lattner authored Dec 13, 2008
```
a pretification of the IR.

llvm-svn: 60973
```
1e29f7c9

On big-endian machines it is wrong to do a full · b6f09933

Duncan Sands authored Dec 13, 2008

width register load followed by a truncating
store for the copy, since the load will not place
the value in the lower bits.  Probably partial
loads/stores can never happen here, but fix it
anyway.

llvm-svn: 60972

b6f09933

Fix spelling. · 234b44ad
Misha Brukman authored Dec 13, 2008
```
llvm-svn: 60971
```
234b44ad

Dec 12, 2008

Do not print empty DW_AT_comp_dir. · 42828e81
Devang Patel authored Dec 12, 2008
```
llvm-svn: 60965
```
42828e81

When expanding unaligned loads and stores do not make · 8f352fe1

Duncan Sands authored Dec 12, 2008

use of illegal integer types: instead, use a stack slot
and copying via integer registers.  The existing code
is still used if the bitconvert is to a legal integer
type.

This fires on the PPC testcases 2007-09-08-unaligned.ll
and vec_misaligned.ll.  It looks like equivalent code
is generated with these changes, just permuted, but
it's hard to tell.

With these changes, nothing in LegalizeDAG produces
illegal integer types anymore.  This is a prerequisite
for removing the LegalizeDAG type legalization code.

While there I noticed that the existing code doesn't
handle trunc store of f64 to f32: it turns this into
an i64 store, which represents a 4 byte stack smash.
I added a FIXME about this.  Hopefully someone more
motivated than I am will take care of it.

llvm-svn: 60964

8f352fe1

- Use patterns instead of creating completely new instruction matching patterns, · c4499feb

Bill Wendling authored Dec 12, 2008

  which are identical to the original patterns.

- Change the multiply with overflow so that we distinguish between signed and
  unsigned multiplication. Currently, unsigned multiplication with overflow
  isn't working!

llvm-svn: 60963

c4499feb

Fix add/sub expansion: don't create ADD / SUB with two results (seems like... · 3270a1de

Evan Cheng authored Dec 12, 2008

Fix add/sub expansion: don't create ADD / SUB with two results (seems like everyone is doing this these days :-). Patch by Daniel M Gessel!

llvm-svn: 60958

3270a1de

Revert my re-instated reverted commit, fixes the bootstrap build on x86-64 linux. · 729bf137
Nick Lewycky authored Dec 12, 2008
```
llvm-svn: 60951
```
729bf137

When using a 4 byte jump table on a 64 bit machine, · e4bcb8e2

Duncan Sands authored Dec 12, 2008

do an extending load of the 4 bytes rather than a
potentially illegal (type) i32 load followed by a
sign extend.

llvm-svn: 60945

e4bcb8e2

Don't make use of an illegal type (i64) when · dd6f3dbd
Duncan Sands authored Dec 12, 2008
```
lowering f64 function arguments.

llvm-svn: 60944
```
dd6f3dbd
Added support for SELECT v8i8 v4i16 for X86 (MMX) · 9c2d26d2
Mon P Wang authored Dec 12, 2008
```
Added support for TRUNC v8i16 to v8i8 for X86 (MMX)

llvm-svn: 60916
```
9c2d26d2

Redo the arithmetic with overflow architecture. I was changing the semantics of · 1a317678

Bill Wendling authored Dec 12, 2008

ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace
the intrinsic with an ISD::SADDO node. Then custom lower that into an
X86ISD::ADD node with a associated SETCC that checks the correct condition code
(overflow or carry). Then that gets lowered into the correct X86::ADDOvf
instruction.

Similar for SUB and MUL instructions.

llvm-svn: 60915

1a317678

Dec 11, 2008
- Fix a 80 col. violation. · a52c3b4b
  Evan Cheng authored Dec 11, 2008
```
llvm-svn: 60901
```
  a52c3b4b
- Sneaky, sneaky: move the -1 to the outside of the SMax. Reinstate the · 6a344e09
  Nick Lewycky authored Dec 11, 2008
```
optimization of SGE/SLE with unit stride, now that it works properly.

llvm-svn: 60881
```
  6a344e09
- fix grammar, thanks Duncan! · 32bfb5de
  Torok Edwin authored Dec 11, 2008
```
llvm-svn: 60875
```
  32bfb5de