Commits · 6096d5a279e85499e00fb207f9495c9f7f48f9c3 · Roger Ferrer / llvm-epi-0.8

Mar 26, 2010
- Debug info shouldn't affect kills. · 6096d5a2
  Dale Johannesen authored Mar 26, 2010
```
llvm-svn: 99637
```
  6096d5a2
- Add a paragram describing how to extract line number information. · 5bc1c4bd
  Devang Patel authored Mar 26, 2010
```
llvm-svn: 99636
```
  5bc1c4bd
- http://englishplus.com/grammar/00000296.htm · 66de0493
  Gabor Greif authored Mar 26, 2010
```
llvm-svn: 99635
```
  66de0493
- Fix SmallVector's insert to handle non-random-access iterators. · 42e77df7
  Dan Gohman authored Mar 26, 2010
```
llvm-svn: 99633
```
  42e77df7
- vldm/vstm can only do up to 16 double-word registers at a time. · bf59859b
  Jim Grosbach authored Mar 26, 2010
```
Radar 7797856

llvm-svn: 99630
```
  bf59859b
- Add N3RegFrm to represent "NEON 3 vector register format" instructions. · 8fc94d63
  Johnny Chen authored Mar 26, 2010
```
Examples are VABA (Vector Absolute Difference and Accumulate), VABAL (Vector
Absolute Difference and Accumulate Long), and VABD (Vector Absolute Difference).

llvm-svn: 99628
```
  8fc94d63
- Do not sibcall if stack needs to be dynamically aligned. · 3365fb14
  Evan Cheng authored Mar 26, 2010
```
llvm-svn: 99620
```
  3365fb14
- Allow trivial sibcall of vararg callee when no arguments are being passed. · 00a620c6
  Evan Cheng authored Mar 26, 2010
```
llvm-svn: 99598
```
  00a620c6
- LiveVariables should clear kill / dead markers first. This allows us to remove... · eb50ac5c
  Evan Cheng authored Mar 26, 2010
```
LiveVariables should clear kill / dead markers first. This allows us to remove a hack in the scheduler.

llvm-svn: 99597
```
  eb50ac5c
- Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily · 5d4e917d
  Johnny Chen authored Mar 26, 2010
```
dispatch to the appropriate routines to handle the different interpretations of
the shift amount encoded in the imm6 field.  The Vd, Vm fields are interpreted
the same between the two, though.

See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and
A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format.

llvm-svn: 99590
```
  5d4e917d
- Avoid leaking argv and env arrays from lli. · bfd38abb
  Jeffrey Yasskin authored Mar 26, 2010
```
llvm-svn: 99589
```
  bfd38abb
- Ignore debug intrinsics in yet more places. · d42e09d9
  Dan Gohman authored Mar 26, 2010
```
llvm-svn: 99580
```
  d42e09d9
- Try trivial remat before the coalescer gives up on a vr / physreg coalescing... · 7b4a1a22
  Evan Cheng authored Mar 26, 2010
```
Try trivial remat before the coalescer gives up on a vr / physreg coalescing for fear of tying up a physical register.

llvm-svn: 99575
```
  7b4a1a22
- Handle DEBUG_VALUE in this pass. · 5d99d7fe
  Dale Johannesen authored Mar 26, 2010
```
llvm-svn: 99573
```
  5d99d7fe
- switch the flag for using NEON for SP floating point to a subtarget 'feature'. · 71fcb4fe
  Jim Grosbach authored Mar 25, 2010
```
Re-commit. This time complete with testsuite updates.

llvm-svn: 99570
```
  71fcb4fe
- need to fix 'make check' tests first. revert for a moment. · 42bb89c7
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99569
```
  42bb89c7
- switch the flag for using NEON for SP floating point to a subtarget 'feature' · 7fce4e39
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99568
```
  7fce4e39
- rename pred_const_iterator to const_pred_iterator for consistency's sake · 6c6b2fd2
  Gabor Greif authored Mar 25, 2010
```
llvm-svn: 99567
```
  6c6b2fd2
- Removed instruction class NI from ARMInstrFormats.td. · a3617ec8
  Johnny Chen authored Mar 25, 2010
```
It doesn't seem to be used anywhere.

llvm-svn: 99566
```
  a3617ec8
- switch the use-vml[as] instructions flag to a subtarget 'feature' · a43386ba
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99565
```
  a43386ba
- rename use_const_iterator to const_use_iterator for consistency's sake · c78d720f
  Gabor Greif authored Mar 25, 2010
```
llvm-svn: 99564
```
  c78d720f
Mar 25, 2010

llvm-mc: Add a -mc-relax-all option, which relaxes every fixup. We always need · d821f4ac

Daniel Dunbar authored Mar 25, 2010

exactly two passes in that case, and don't ever need to recompute any layout,
so this is a nice baseline for relaxation performance.

llvm-svn: 99563

d821f4ac

Add NVDupLnFrm and change NVDupLane class to use that format. · 91d27744
Johnny Chen authored Mar 25, 2010
```
llvm-svn: 99557
```
91d27744
ARM cortex-a8 doesn't do vmla/vmls well. disable them by default for that cpu · 4b3b2ef6
Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99549
```
4b3b2ef6
Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to · d82f9002
Johnny Chen authored Mar 25, 2010
```
expect a Format arg.  N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format.

llvm-svn: 99548
```
d82f9002
Add nounwind. · dbcf861a
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99546
```
dbcf861a
Code clean up. · 510bda20
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99544
```
510bda20

MC: Stop restarting layout on every relaxation. · 6432bd74

Daniel Dunbar authored Mar 25, 2010

 - Still O(N^2), just a faster form, and now its the MCAsmLayout's fault.

On the .s I am tuning against (combine.s from 403.gcc):
--
ddunbar@lordcrumb:MC$ diff stats-before.txt stats-after.txt
5,10c5,10
<    1728 assembler - Number of assembler layout and relaxation steps
<    7707 assembler - Number of emitted assembler fragments
<  120588 assembler - Number of emitted object file bytes
< 2233448 assembler - Number of evaluated fixups
<    1727 assembler - Number of relaxed instructions
< 6723845 mcexpr    - Number of MCExpr evaluations
---
>      3 assembler - Number of assembler layout and relaxation steps
>   7707 assembler - Number of emitted assembler fragments
> 120588 assembler - Number of emitted object file bytes
>  14796 assembler - Number of evaluated fixups
>   1727 assembler - Number of relaxed instructions
>  67889 mcexpr    - Number of MCExpr evaluations
--
Feel free to LOL at the -before numbers, if you like.

I am a little surprised we make more than 2 relaxation passes. It's pretty
trivial for us to do relaxation out-of-order if that would give a speedup.

llvm-svn: 99543

6432bd74

Fix -Asserts warning, again. · d919276b
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99542
```
d919276b
Tag SSE2 integer instructions as SSEPackedInt. · 3758ff91
Jakob Stoklund Olesen authored Mar 25, 2010
```
llvm-svn: 99540
```
3758ff91

Teach TableGen to understand X.Y notation in the TSFlagsFields strings. · f8d7eda6

Jakob Stoklund Olesen authored Mar 25, 2010

Remove much horribleness from X86InstrFormats as a result. Similar
simplifications are probably possible for other targets.

llvm-svn: 99539

f8d7eda6

fix a valgrind error on copy-constructor-synthesis.cpp, which is caused when · fc4ec253

Chris Lattner authored Mar 25, 2010

the custom insertion hook deletes the instruction, then we try to set dead
flags on it.  Neither the code that I added nor the code that was there 
before was safe.

llvm-svn: 99538

fc4ec253

Remove an unused option. · a1d0a027
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99537
```
a1d0a027
MC: Simplify main section layout process by moving alignment into LayoutSection. · 0ba6a671
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99529
```
0ba6a671
MC: Sink Section address assignment into LayoutSection. · 25d114b2
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99528
```
25d114b2

Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 49e121d5

Jakob Stoklund Olesen authored Mar 25, 2010

On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
in a different domain than where it was defined. Some instructions have
equvivalents for different domains, like por/orps/orpd.

The SSEDomainFix pass tries to minimize the number of domain crossings by
changing between equvivalent opcodes where possible.

This is a work in progress, in particular the pass doesn't do anything yet. SSE
instructions are tagged with their execution domain in TableGen using the last
two bits of TSFlags. Note that not all instructions are tagged correctly. Life
just isn't that simple.

The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
issue handled by NEONMoveFixPass. This pass may become target independent to
handle both.

llvm-svn: 99524

49e121d5

Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, · 45ab3f3c
Johnny Chen authored Mar 25, 2010
```
instead of the current N2V.  Format of NVDupLane instances are set to NEONFrm
currently.

llvm-svn: 99518
```
45ab3f3c
Reapply Kevin's change 94440, now that Chris has fixed the limitation on · e543e7fc
Bob Wilson authored Mar 25, 2010
```
opcode values fitting in one byte (svn r99494).

llvm-svn: 99514
```
e543e7fc
Sketch a few Clang release notes. · eb4bc7ff
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99512
```
eb4bc7ff
Add comment. · 95cd4b9c
Devang Patel authored Mar 25, 2010
```
llvm-svn: 99507
```
95cd4b9c