Commits · 07f0f7762972cd1fc3a58b24f56df52930da4d7b · Roger Ferrer / llvm-epi-0.8

Feb 03, 2012
- Incorporate suggestions Chad, Jakob and Evan's suggestions on r149957. · bb682450
  Lang Hames authored Feb 03, 2012
```
llvm-svn: 149655
```
  bb682450
- Narrow test further. Make bot and test happy. · 1df8cdc5
  Jim Grosbach authored Feb 03, 2012
```
llvm-svn: 149650
```
  1df8cdc5
- Tidy up. Trailing whitespace. · 7815f56b
  Jim Grosbach authored Feb 03, 2012
```
llvm-svn: 149649
```
  7815f56b
- Restrict InstCombine from converting varargs to or from fixed args. · e84ae7bf
  Jim Grosbach authored Feb 03, 2012
```
More targetted fix replacing d0e277d272d517ca1cda368267d199f0da7cad95.

llvm-svn: 149648
```
  e84ae7bf
- Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." · 0ab54184
  Jim Grosbach authored Feb 03, 2012
```
This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95.

llvm-svn: 149647
```
  0ab54184
- Require non-NULL register masks. · 5e1ac45b
  Jakob Stoklund Olesen authored Feb 02, 2012
```
It doesn't seem worthwhile to give meaning to a NULL register mask
pointer. It complicates all the code using register mask operands.

llvm-svn: 149646
```
  5e1ac45b
Feb 02, 2012

Add pseudo-registers for pairs, triples, and quads of D registers. · caed1c93

Jakob Stoklund Olesen authored Feb 02, 2012

NEON loads and stores accept single and double spaced pairs, triples,
and quads of D registers.  This patch adds new register classes to
accurately model those constraints:

  Dn, Dn+1    Dn, Dn+2
  ----------------------
  DPair       DPairSpc
  DTriple     DTripleSpc
  DQuad       DQuadSpc

Also extend the existing QQ and QQQQ register classes to contains all Q
pairs and quads instead of just the aligned ones.

These new register classes will make it possible to accurately model
constraints on NEON loads and stores, and we can get rid of all the NEON
pseudo-instructions.  The late scheduler will be able to accurately
model instruction dependencies from the explicit operands.

This more than doubles the number of ARM registers, but the backend
passes are quite good at handling this. The llc -O0 compile time only
regresses by 1.5%.  Future work on register mask operands will recover
this regression.

llvm-svn: 149640

caed1c93

BBVectorize: Simplify code, no functionality change. · f61f60d9
Benjamin Kramer authored Feb 02, 2012
```
Also silences warnings about bodyless for loops.

llvm-svn: 149612
```
f61f60d9

Minor changes from review. · 8cf51b87

Hal Finkel authored Feb 02, 2012

As suggested by Nick Lewycky, the tree traversal queues have been changed to SmallVectors and the associated loops have been rotated. Also, an 80-col violation was fixed.

llvm-svn: 149607

8cf51b87

Minor change in signature of the getZeroVector() · 6fbb4d28
Elena Demikhovsky authored Feb 02, 2012
```
llvm-svn: 149601
```
6fbb4d28
Optimization for SIGN_EXTEND operation on AVX. · fb44980b
Elena Demikhovsky authored Feb 02, 2012
```
Special handling was added for v4i32 -> v4i64 and v8i16 -> v8i32
extensions.

llvm-svn: 149600
```
fb44980b
Unbreak the MSVC build. · 26f302d5
Francois Pichet authored Feb 02, 2012
```
llvm-svn: 149599
```
26f302d5

Re-apply the coalescer fix from r149147. Commit r149597 should have fixed the... · a808dc45

Lang Hames authored Feb 02, 2012

Re-apply the coalescer fix from r149147. Commit r149597 should have fixed the llvm-gcc and clang self-host issues.

llvm-svn: 149598

a808dc45

Set EFLAGS correctly in EmitLoweredSelect on X86. · 0269caaf
Lang Hames authored Feb 02, 2012
```
llvm-svn: 149597
```
0269caaf
Break as soon as the MustMapCurValNos flag is set - no need to reiterate. · 4d04f753
Lang Hames authored Feb 02, 2012
```
llvm-svn: 149596
```
4d04f753

Vectorize long blocks in groups. · 0f3298e8

Hal Finkel authored Feb 02, 2012

Long basic blocks with many candidate pairs (such as in the SHA implementation in Perl 5.14; thanks to Roman Divacky for the example) used to take an unacceptably-long time to compile. Instead, break long blocks into groups so that no group has too many candidate pairs.

llvm-svn: 149595

0f3298e8

PR11868. The previous loop in LiveIntervals::join would sometimes fall over if · 3a20bc36

Lang Hames authored Feb 02, 2012

more than two adjacent ranges needed to be merged. The new version should be
able to handle an arbitrary sequence of adjancent ranges.

llvm-svn: 149588

3a20bc36

Set the correct stack pointer register. · 961883c1
Akira Hatanaka authored Feb 02, 2012
```
 

llvm-svn: 149585
```
961883c1
Expand EHSELECTION and EHSELECTION nodes. Set the correct exception pointer and · f029537e
Akira Hatanaka authored Feb 02, 2012
```
selector registers.
 

llvm-svn: 149584
```
f029537e
Add DWARF numbers of 64-bit registers. · d9fef177
Akira Hatanaka authored Feb 02, 2012
```
 

llvm-svn: 149583
```
d9fef177
Typo · c1a6f981
Pete Cooper authored Feb 01, 2012
```
llvm-svn: 149562
```
c1a6f981
Fix the cmake build · 77295818
Rafael Espindola authored Feb 01, 2012
```
llvm-svn: 149561
```
77295818

Instruction scheduling itinerary for Intel Atom. · 8523b16f

Andrew Trick authored Feb 01, 2012

Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT.

Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches.

Adds a test to verify that the scheduler is working.

Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP.

Patch by Preston Gurd!

llvm-svn: 149558

8523b16f

Move ARM subreg index compositions to the SubRegIndex itself. · c7024a48
Jakob Stoklund Olesen authored Feb 01, 2012
```
llvm-svn: 149557
```
c7024a48

Feb 01, 2012

fix cmake · 3441597f
Andrew Trick authored Feb 01, 2012
```
llvm-svn: 149553
```
3441597f
Avoid creating an extract element to an illegal type after LegalizeTypes has run. · 9f052066
Mon P Wang authored Feb 01, 2012
```
llvm-svn: 149548
```
9f052066

VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). · d06df96a

Andrew Trick authored Feb 01, 2012

This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling.

Patch by Sergei Larin!

llvm-svn: 149547

d06df96a

Tidy up. · e273cb08
Chad Rosier authored Feb 01, 2012
```
llvm-svn: 149521
```
e273cb08
Passing AVX 256-bit structures in Win64 was wrong. · 824eed70
Elena Demikhovsky authored Feb 01, 2012
```
Fixed Win64 calling conventions.

llvm-svn: 149494
```
824eed70
Shortened code in shuffle masks · 34cca175
Elena Demikhovsky authored Feb 01, 2012
```
llvm-svn: 149493
```
34cca175
Optimization for "truncate" operation on AVX. · 0e48c70b
Elena Demikhovsky authored Feb 01, 2012
```
Truncating v4i64 -> v4i32 and v8i32 -> v8i16 may be done with set of shuffles.

llvm-svn: 149485
```
0e48c70b

SwitchInst refactoring. · 513aaa56

Stepan Dyatkovskiy authored Feb 01, 2012

The purpose of refactoring is to hide operand roles from SwitchInst user (programmer). If you want to play with operands directly, probably you will need lower level methods than SwitchInst ones (TerminatorInst or may be User). After this patch we can reorganize SwitchInst operands and successors as we want.

What was done:

1. Changed semantics of index inside the getCaseValue method:
getCaseValue(0) means "get first case", not a condition. Use getCondition() if you want to resolve the condition. I propose don't mix SwitchInst case indexing with low level indexing (TI successors indexing, User's operands indexing), since it may be dangerous.
2. By the same reason findCaseValue(ConstantInt*) returns actual number of case value. 0 means first case, not default. If there is no case with given value, ErrorIndex will returned.
3. Added getCaseSuccessor method. I propose to avoid usage of TerminatorInst::getSuccessor if you want to resolve case successor BB. Use getCaseSuccessor instead, since internal SwitchInst organization of operands/successors is hidden and may be changed in any moment.
4. Added resolveSuccessorIndex and resolveCaseIndex. The main purpose of these methods is to see how case successors are really mapped in TerminatorInst.
4.1 "resolveSuccessorIndex" was created if you need to level down from SwitchInst to TerminatorInst. It returns TerminatorInst's successor index for given case successor.
4.2 "resolveCaseIndex" converts low level successors index to case index that curresponds to the given successor.

Note: There are also related compatability fix patches for dragonegg, klee, llvm-gcc-4.0, llvm-gcc-4.2, safecode, clang.
llvm-svn: 149481

513aaa56

Add pass printer passes in the right place. · cbc845f9

Andrew Trick authored Feb 01, 2012

The pass pointer should never be referenced after sending it to
schedulePass(), which may delete the pass. To fix this bug I had to
clean up the design leading to more goodness.

You may notice now that any non-analysis pass is printed. So things like loop-simplify and lcssa show up, while target lib, target data, alias analysis do not show up. Normally, analysis don't mutate the IR, but you can now check this by using both -print-after and -print-before. The effects of analysis will now show up in between the two.

The llc path is still in bad shape. But I'll be improving it in my next checkin. Meanwhile, print-machineinstrs still works the same way. With print-before/after, many llc passes that were not printed before now are, some of these should be converted to analysis. A few very important passes, isel and scheduler, are not properly initialized, so not printed.

llvm-svn: 149480

cbc845f9

Don't create VBROADCAST nodes if any nodes use the chain result from the load. Fixes PR11900. · 9cdb8bdf
Craig Topper authored Feb 01, 2012
```
llvm-svn: 149478
```
9cdb8bdf
BBVectorize.cpp: Try to fix MSVC build. map::iterator and multimap::iterator are incompatible. · e1d61f66
NAKAMURA Takumi authored Feb 01, 2012
```
llvm-svn: 149475
```
e1d61f66
A few of the changes suggested in code review (by Nick Lewycky) · 8a3aebe5
Hal Finkel authored Feb 01, 2012
```
llvm-svn: 149472
```
8a3aebe5
Revert Chris' commits up to r149348 that started causing VMCoreTests unit test to fail. · 17c981a4
Argyrios Kyrtzidis authored Feb 01, 2012
```
These are:

r149348
r149351
r149352
r149354
r149356
r149357
r149361
r149362
r149364
r149365

llvm-svn: 149470
```
17c981a4

Add a basic-block autovectorization pass. · c34e5113

Hal Finkel authored Feb 01, 2012

This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure.
Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser).

llvm-svn: 149468

c34e5113

Disable InstCombine unsafe folding bitcasts of calls w/ varargs. · 9fa04815

Jim Grosbach authored Feb 01, 2012

Changing arguments from being passed as fixed to varargs is unsafe, as
the ABI may require they be handled differently (stack vs. register, for
example).

Remove two tests which rely on the bitcast being folded into the direct
call, which is exactly the transformation that's unsafe.

llvm-svn: 149457

9fa04815

Tidy up. One more return type mismatch fix. · a2147ce3
Jim Grosbach authored Jan 31, 2012
```
llvm-svn: 149452
```
a2147ce3