Commits · a5beaf6958154b9ad6041705decbc10984227a2a · Roger Ferrer / llvm-epi-0.8

Mar 26, 2010
- Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily · 5d4e917d
  Johnny Chen authored Mar 26, 2010
```
dispatch to the appropriate routines to handle the different interpretations of
the shift amount encoded in the imm6 field.  The Vd, Vm fields are interpreted
the same between the two, though.

See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and
A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format.

llvm-svn: 99590
```
  5d4e917d
- Avoid leaking argv and env arrays from lli. · bfd38abb
  Jeffrey Yasskin authored Mar 26, 2010
```
llvm-svn: 99589
```
  bfd38abb
- Ignore debug intrinsics in yet more places. · d42e09d9
  Dan Gohman authored Mar 26, 2010
```
llvm-svn: 99580
```
  d42e09d9
- Try trivial remat before the coalescer gives up on a vr / physreg coalescing... · 7b4a1a22
  Evan Cheng authored Mar 26, 2010
```
Try trivial remat before the coalescer gives up on a vr / physreg coalescing for fear of tying up a physical register.

llvm-svn: 99575
```
  7b4a1a22
- Handle DEBUG_VALUE in this pass. · 5d99d7fe
  Dale Johannesen authored Mar 26, 2010
```
llvm-svn: 99573
```
  5d99d7fe
- switch the flag for using NEON for SP floating point to a subtarget 'feature'. · 71fcb4fe
  Jim Grosbach authored Mar 25, 2010
```
Re-commit. This time complete with testsuite updates.

llvm-svn: 99570
```
  71fcb4fe
- need to fix 'make check' tests first. revert for a moment. · 42bb89c7
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99569
```
  42bb89c7
- switch the flag for using NEON for SP floating point to a subtarget 'feature' · 7fce4e39
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99568
```
  7fce4e39
- rename pred_const_iterator to const_pred_iterator for consistency's sake · 6c6b2fd2
  Gabor Greif authored Mar 25, 2010
```
llvm-svn: 99567
```
  6c6b2fd2
- Removed instruction class NI from ARMInstrFormats.td. · a3617ec8
  Johnny Chen authored Mar 25, 2010
```
It doesn't seem to be used anywhere.

llvm-svn: 99566
```
  a3617ec8
- switch the use-vml[as] instructions flag to a subtarget 'feature' · a43386ba
  Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99565
```
  a43386ba
- rename use_const_iterator to const_use_iterator for consistency's sake · c78d720f
  Gabor Greif authored Mar 25, 2010
```
llvm-svn: 99564
```
  c78d720f
Mar 25, 2010

llvm-mc: Add a -mc-relax-all option, which relaxes every fixup. We always need · d821f4ac

Daniel Dunbar authored Mar 25, 2010

exactly two passes in that case, and don't ever need to recompute any layout,
so this is a nice baseline for relaxation performance.

llvm-svn: 99563

d821f4ac

Add NVDupLnFrm and change NVDupLane class to use that format. · 91d27744
Johnny Chen authored Mar 25, 2010
```
llvm-svn: 99557
```
91d27744
ARM cortex-a8 doesn't do vmla/vmls well. disable them by default for that cpu · 4b3b2ef6
Jim Grosbach authored Mar 25, 2010
```
llvm-svn: 99549
```
4b3b2ef6
Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to · d82f9002
Johnny Chen authored Mar 25, 2010
```
expect a Format arg.  N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format.

llvm-svn: 99548
```
d82f9002
Code clean up. · 510bda20
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99544
```
510bda20

MC: Stop restarting layout on every relaxation. · 6432bd74

Daniel Dunbar authored Mar 25, 2010

 - Still O(N^2), just a faster form, and now its the MCAsmLayout's fault.

On the .s I am tuning against (combine.s from 403.gcc):
--
ddunbar@lordcrumb:MC$ diff stats-before.txt stats-after.txt
5,10c5,10
<    1728 assembler - Number of assembler layout and relaxation steps
<    7707 assembler - Number of emitted assembler fragments
<  120588 assembler - Number of emitted object file bytes
< 2233448 assembler - Number of evaluated fixups
<    1727 assembler - Number of relaxed instructions
< 6723845 mcexpr    - Number of MCExpr evaluations
---
>      3 assembler - Number of assembler layout and relaxation steps
>   7707 assembler - Number of emitted assembler fragments
> 120588 assembler - Number of emitted object file bytes
>  14796 assembler - Number of evaluated fixups
>   1727 assembler - Number of relaxed instructions
>  67889 mcexpr    - Number of MCExpr evaluations
--
Feel free to LOL at the -before numbers, if you like.

I am a little surprised we make more than 2 relaxation passes. It's pretty
trivial for us to do relaxation out-of-order if that would give a speedup.

llvm-svn: 99543

6432bd74

Fix -Asserts warning, again. · d919276b
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99542
```
d919276b
Tag SSE2 integer instructions as SSEPackedInt. · 3758ff91
Jakob Stoklund Olesen authored Mar 25, 2010
```
llvm-svn: 99540
```
3758ff91

Teach TableGen to understand X.Y notation in the TSFlagsFields strings. · f8d7eda6

Jakob Stoklund Olesen authored Mar 25, 2010

Remove much horribleness from X86InstrFormats as a result. Similar
simplifications are probably possible for other targets.

llvm-svn: 99539

f8d7eda6

fix a valgrind error on copy-constructor-synthesis.cpp, which is caused when · fc4ec253

Chris Lattner authored Mar 25, 2010

the custom insertion hook deletes the instruction, then we try to set dead
flags on it.  Neither the code that I added nor the code that was there 
before was safe.

llvm-svn: 99538

fc4ec253

Remove an unused option. · a1d0a027
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99537
```
a1d0a027
MC: Simplify main section layout process by moving alignment into LayoutSection. · 0ba6a671
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99529
```
0ba6a671
MC: Sink Section address assignment into LayoutSection. · 25d114b2
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99528
```
25d114b2

Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 49e121d5

Jakob Stoklund Olesen authored Mar 25, 2010

On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
in a different domain than where it was defined. Some instructions have
equvivalents for different domains, like por/orps/orpd.

The SSEDomainFix pass tries to minimize the number of domain crossings by
changing between equvivalent opcodes where possible.

This is a work in progress, in particular the pass doesn't do anything yet. SSE
instructions are tagged with their execution domain in TableGen using the last
two bits of TSFlags. Note that not all instructions are tagged correctly. Life
just isn't that simple.

The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
issue handled by NEONMoveFixPass. This pass may become target independent to
handle both.

llvm-svn: 99524

49e121d5

Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, · 45ab3f3c
Johnny Chen authored Mar 25, 2010
```
instead of the current N2V.  Format of NVDupLane instances are set to NEONFrm
currently.

llvm-svn: 99518
```
45ab3f3c
Reapply Kevin's change 94440, now that Chris has fixed the limitation on · e543e7fc
Bob Wilson authored Mar 25, 2010
```
opcode values fitting in one byte (svn r99494).

llvm-svn: 99514
```
e543e7fc
Add comment. · 95cd4b9c
Devang Patel authored Mar 25, 2010
```
llvm-svn: 99507
```
95cd4b9c
MC/Mach-O: Switch to MCSectionData::getOrdinal. · 95145974
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99504
```
95145974

Scheduler assumes SDDbgValue nodes are in source order. That's true currently.... · 1889440b

Evan Cheng authored Mar 25, 2010

Scheduler assumes SDDbgValue nodes are in source order. That's true currently. But add an assertion to verify it.

llvm-svn: 99501

1889440b

MC: Explicity track section and fragment ordinals. · 41088026
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99500
```
41088026
Fix -Asserts warning. · eaa792f0
Daniel Dunbar authored Mar 25, 2010
```
llvm-svn: 99499
```
eaa792f0

Change tblgen to emit FOOISD opcode names as two · 552dddc5

Chris Lattner authored Mar 25, 2010

bytes instead of one byte.  This is important because
we're running up to too many opcodes to fit in a byte
and it is aggrevated by FIRST_TARGET_MEMORY_OPCODE
making the numbering sparse.  This just bites the
bullet and bloats out the table.  In practice, this
increases the size of the x86 isel table from 74.5K
to 76K.  I think we'll cope :)

This fixes rdar://7791648

llvm-svn: 99494

552dddc5

Include isFunctionLocal while calculating folding node set profile for a MDNode. · 44147119
Devang Patel authored Mar 25, 2010
```
llvm-svn: 99490
```
44147119
Remove a fixme that doesn't make sense any more. · 08b3364c
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99489
```
08b3364c
fix PR6642, GVN forwarding from memset to load of the base of the memset. · 05638049
Chris Lattner authored Mar 25, 2010
```
llvm-svn: 99488
```
05638049
Make sure SDDbgValue.Invalid is initialized to false by all the constructors. · 7f0b16a2
Evan Cheng authored Mar 25, 2010
```
llvm-svn: 99487
```
7f0b16a2

eliminate a bunch more parallels now that scheduling · 23bf99a9

Chris Lattner authored Mar 25, 2010

handles dead implicit results more aggressively.  More
to come, I think this is now just a data entry problem.

llvm-svn: 99486

23bf99a9

Make the NDEBUG assertion stronger and more clear what is · 4690af85

Chris Lattner authored Mar 25, 2010

happening.

Enhance scheduling to set the DEAD flag on implicit defs
more aggressively.  Before, we'd set an implicit def operand
to dead if it were present in the SDNode corresponding to
the machineinstr but had no use.  Now we do it in this case
AND if the implicit def does not exist in the SDNode at all.

This exposes a couple of problems: one is the FIXME, which
causes a live intervals crash on CodeGen/X86/sibcall.ll.
The second is that it makes machinecse and licm more 
aggressive (which is a good thing) but also exposes a case
where licm hoists a set0 and then it doesn't get resunk.

Talking to codegen folks about both these issues, but I need
this patch in in the meantime.

llvm-svn: 99485

4690af85