Commits · 117ff9005c7f7b8bd26b50ad0250d06a0b0545be · Roger Ferrer / llvm-epi-0.8

Jan 07, 2011

Revert r122955. It seems using movups to lower memcpy can cause massive... · a048c83f

Evan Cheng authored Jan 07, 2011

Revert r122955. It seems using movups to lower memcpy can cause massive regression (even on Nehalem) in edge cases. I also didn't see any real performance benefit.

llvm-svn: 123015

a048c83f

Add ARM patterns to match EXTRACT_SUBVECTOR nodes. · 8265d566

Bob Wilson authored Jan 07, 2011

Also fix an off-by-one in SelectionDAGBuilder that was preventing shuffle
vectors from being translated to EXTRACT_SUBVECTOR.
Patch by Tim Northover.

The test changes are needed to keep those spill-q tests from testing aligned
spills and restores.  If the only aligned stack objects are spill slots, we
no longer realign the stack frame.  Prior to this patch, an EXTRACT_SUBVECTOR
was legalized by loading from the stack, which created an aligned frame index.
Now, however, there is nothing except the spill slot in the stack frame, so
I added an aligned alloca.

llvm-svn: 122995

8265d566

Jan 06, 2011
- With Benjamin's recent amazing patches, we should be able to do even better things :) · 84184b72
  Chris Lattner authored Jan 06, 2011
```
llvm-svn: 122978
```
  84184b72
- PR8921: LDM/POP do not support interworking prior to v5t. · 914df82a
  Bob Wilson authored Jan 06, 2011
```
llvm-svn: 122970
```
  914df82a
- Remove extra whitespace. · e0bafd93
  Bob Wilson authored Jan 06, 2011
```
llvm-svn: 122969
```
  e0bafd93
- Fix comment typo. · 7c2c6268
  Bob Wilson authored Jan 06, 2011
```
llvm-svn: 122968
```
  7c2c6268
- Add a note from llvmdev, this time with more info. · 1e01ade2
  Benjamin Kramer authored Jan 06, 2011
```
llvm-svn: 122966
```
  1e01ade2
- Correctly disassemble truncated asm. · 9f9a1069
  Rafael Espindola authored Jan 06, 2011
```
Patch by Richard Simth.

llvm-svn: 122962
```
  9f9a1069
- EarlyCSE does this now (and GVN always did it). · 605f21a6
  Benjamin Kramer authored Jan 06, 2011
```
llvm-svn: 122960
```
  605f21a6
- InstCombine: If we call llvm.objectsize on a malloc call we can replace it... · 799b0112
  Benjamin Kramer authored Jan 06, 2011
```
InstCombine: If we call llvm.objectsize on a malloc call we can replace it with the size passed to malloc.

llvm-svn: 122959
```
  799b0112
- Remove dead code and silence warnings. · 3aa955e9
  Benjamin Kramer authored Jan 06, 2011
```
llvm-svn: 122957
```
  3aa955e9
- Use movups to lower memcpy and memset even if it's not fast (like corei7). · 7998b1d6
  Evan Cheng authored Jan 06, 2011
```
The theory is it's still faster than a pair of movq / a quad of movl. This
will probably hurt older chips like P4 but should run faster on current
and future Intel processors. rdar://8817010

llvm-svn: 122955
```
  7998b1d6
- add a note about object size from drystone, add a poorly optimized loop from 179.art. · 245de78e
  Chris Lattner authored Jan 06, 2011
```
llvm-svn: 122954
```
  245de78e
- add a trivial instcombine missed in Dhrystone · 73552c2c
  Chris Lattner authored Jan 06, 2011
```
llvm-svn: 122953
```
  73552c2c
- Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy · 3ae2b79a
  Evan Cheng authored Jan 06, 2011
```
etc. takes an option OptSize. If OptSize is true, it would return
the inline limit for functions with attribute OptSize.

llvm-svn: 122952
```
  3ae2b79a
- PR8919 - LLVM incorrectly generates "_alloca" as the stack probing call. That · 3b949bca
  Bill Wendling authored Jan 06, 2011
```
works only on MinGW32. On 64-bit, the function to call is "__chkstk".
Patch by KS Sreeram!

llvm-svn: 122934
```
  3b949bca
- PR8918 - When used with MinGW64, LLVM generates a "calll __main" at the · 81d40711
  Bill Wendling authored Jan 06, 2011
```
beginning of the "main" function. The assembler complains about the invalid
suffix for the 'call' instruction. The right instruction is "callq __main".
Patch by KS Sreeram!

llvm-svn: 122933
```
  81d40711
Jan 05, 2011
- fix PR8900, a shuffle miscompilation. Patch by Nadav Rotem! · 872908fd
  Chris Lattner authored Jan 05, 2011
```
llvm-svn: 122921
```
  872908fd
- silence more self assignment warnings. · 2d7df026
  Chris Lattner authored Jan 05, 2011
```
llvm-svn: 122920
```
  2d7df026
- fix some -Wself-assign warnings. · 1b3f5b9f
  Chris Lattner authored Jan 05, 2011
```
llvm-svn: 122893
```
  1b3f5b9f
- Commit 122778 broke DWARF debug output when using the MBlaze backend. Fixed by... · 07fd1efc
  Wesley Peck authored Jan 05, 2011
```
Commit 122778 broke DWARF debug output when using the MBlaze backend. Fixed by overriding TargetFrameInfo::getFrameIndexOffset to take into account the new frame index information.

llvm-svn: 122889
```
  07fd1efc
Jan 04, 2011
- Use the EdgeBundles analysis in X86FloatingPoint instead of recomputing CFG · 01d4d865
  Jakob Stoklund Olesen authored Jan 04, 2011
```
bundles in the pass.

llvm-svn: 122833
```
  01d4d865
- Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. · f96ae684
  Jakob Stoklund Olesen authored Jan 04, 2011
```
The analysis will be needed by both the greedy register allocator and the
X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't
change.

This pass is very fast, usually showing up as 0.0% wall time.

llvm-svn: 122832
```
  f96ae684
- Eliminate a warning compiling with llvm-gcc. (IMO the · e45a2389
  Dale Johannesen authored Jan 04, 2011
```
warning is overzealous but gcc is what it is.)

llvm-svn: 122829
```
  e45a2389
- Fix the ARM IIC_iCMPsi itinerary and add an important assert. · 163a2442
  Andrew Trick authored Jan 04, 2011
```
llvm-svn: 122794
```
  163a2442
- Formatting changes. No functionality change. · 4466e981
  Bill Wendling authored Jan 03, 2011
```
llvm-svn: 122789
```
  4466e981
Jan 03, 2011

Use pushq / popq instead of subq $8, %rsp / addq $8, %rsp to adjust stack in · 65089fc6

Evan Cheng authored Jan 03, 2011

prologue and epilogue if the adjustment is 8. Similarly, use pushl / popl if
the adjustment is 4 in 32-bit mode.

In the epilogue, takes care to pop to a caller-saved register that's not live
at the exit (either return or tailcall instruction).
rdar://8771137

llvm-svn: 122783

65089fc6

Fix more stack layout issues in the MBlaze backend. · 6941044c
Wesley Peck authored Jan 03, 2011
```
llvm-svn: 122778
```
6941044c

Jan 02, 2011

Try to reuse the value when lowering memset. · 25e6e06e

Benjamin Kramer authored Jan 02, 2011

This allows us to compile:
  void test(char *s, int a) {
    __builtin_memset(s, a, 15);
  }
into 1 mul + 3 stores instead of 3 muls + 3 stores.

llvm-svn: 122710

25e6e06e

A workaround for a bug in cmake 2.8.3 diagnosed on PR 8885. · 68b7bb95
Oscar Fuentes authored Jan 02, 2011
```
llvm-svn: 122706
```
68b7bb95
update a bunch of entries. · 51415d26
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122700
```
51415d26

Allow loop-idiom to run on multiple BB loops, but still only scan the loop · ddf58010

Chris Lattner authored Jan 02, 2011

header for now for memset/memcpy opportunities.  It turns out that loop-rotate
is successfully rotating loops, but *DOESN'T MERGE THE BLOCKS*, turning "for 
loops" into 2 basic block loops that loop-idiom was ignoring.

With this fix, we form many *many* more memcpy and memsets than before, including
on the "history" loops in the viterbi benchmark, which look like this:

        for (j=0; j<MAX_history; ++j) {
          history_new[i][j+1] = history[2*i][j];
        }

Transforming these loops into memcpy's speeds up the viterbi benchmark from
11.98s to 3.55s on my machine.  Woo.

llvm-svn: 122685

ddf58010

Jan 01, 2011
- a missed __builtin_object_size case. · 6c3fc0a5
  Chris Lattner authored Jan 01, 2011
```
llvm-svn: 122676
```
  6c3fc0a5
- various updates. · e5d5a41a
  Chris Lattner authored Jan 01, 2011
```
llvm-svn: 122675
```
  e5d5a41a
- Add support for the 'H' modifier. · d606e547
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122667
```
  d606e547
- Model operand restrictions of mul-like instructions on ARMv5 via · 62acecd7
  Anton Korobeynikov authored Jan 01, 2011
```
earlyclobber stuff. This should fix PRs 2313 and 8157.

Unfortunately, no testcase, since it'd be dependent on register
assignments.

llvm-svn: 122663
```
  62acecd7
- Revert commit 122654 at the request of Chris, who reckons that instsimplify · 772749ae
  Duncan Sands authored Jan 01, 2011
```
is the wrong hammer for this nail, and is probably right.

llvm-svn: 122661
```
  772749ae
- Fix a README item by having InstructionSimplify do a mild form of value · e3c53958
  Duncan Sands authored Jan 01, 2011
```
numbering, in which it considers (for example) "%a = add i32 %x, %y" and
"%b = add i32 %x, %y" to be equal because the operands are equal and the
result of the instructions only depends on the values of the operands.
This has almost no effect (it removes 4 instructions from gcc-as-one-file),
and perhaps slows down compilation: I measured a 0.4% slowdown on the large
gcc-as-one-file testcase, but it wasn't statistically significant.

llvm-svn: 122654
```
  e3c53958
- ptx: remove reg-reg addressing mode and st.const · 5451fc91
  Che-Liang Chiou authored Jan 01, 2011
```
llvm-svn: 122653
```
  5451fc91
- ptx: add store instruction · 15e8d2c5
  Che-Liang Chiou authored Jan 01, 2011
```
llvm-svn: 122652
```
  15e8d2c5