Commits · 7dee697faa5361fc953909b8d4547f2d08af5cab · Roger Ferrer / llvm-epi-0.8

Mar 29, 2013

Skip moving call address loading into callseq when targets prefer register indirect call. · 96b42608

Michael Liao authored Mar 28, 2013

To enable a load of a call address to be folded with that call, this
load is moved from outside of callseq into callseq. Such a moving
adds a non-glued node (that load) into a glued sequence. This non-glue
load is only removed when DAG selection folds them into a memory form
call instruction. When such instruction selection is disabled, it breaks
DAG schedule.

To prevent that, such moving is disabled when target favors register
indirect call.

Previous workaround disabling CALL32m/CALL64m insn selection is removed.

llvm-svn: 178308

96b42608

Mar 26, 2013
- Annotate control instructions with SchedRW lists. · d59419eb
  Jakob Stoklund Olesen authored Mar 26, 2013
```
This could definitely be more granular. I am not sure if it makes a
difference.

llvm-svn: 178049
```
  d59419eb
Aug 24, 2012

Mark X86::RET and RETI instructions as variadic. · b50cf8b3

Jakob Stoklund Olesen authored Aug 24, 2012

There is special magic happening when returning floating point values on
the x87 stack. The RET instructions get extra f80 operands.

llvm-svn: 162592

b50cf8b3

Jul 05, 2012

Make X86 call and return instructions non-variadic. · d14101e0

Jakob Stoklund Olesen authored Jul 04, 2012

Function argument and return value registers aren't part of the
encoding, so they should be implicit operands.

llvm-svn: 159728

d14101e0

May 09, 2012

Use ptr_rc_tailcall instead of GR32_TC. · 7e21d617

Jakob Stoklund Olesen authored May 09, 2012

The getPointerRegClass() hook will return GR32_TC, or whatever is
appropriate for the current function.

Patch by Yiannis Tsiouris!

llvm-svn: 156459

7e21d617

Apr 11, 2012
- Add retw and lretw instructions. Also, fix Intel syntax parsing for all · 74c282b5
  Charles Davis authored Apr 11, 2012
```
ret instructions.

llvm-svn: 154468
```
  74c282b5
Feb 27, 2012
- X86 disassembler support for jcxz, jecxz, and jrcxz. Fixes PR11643. Patch by Kay Tiong Khoo. · 6491c802
  Craig Topper authored Feb 27, 2012
```
llvm-svn: 151510
```
  6491c802
Feb 18, 2012
- Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430,... · b22310fd
  Jia Liu authored Feb 18, 2012
```
Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore.

llvm-svn: 150878
```
  b22310fd
Feb 16, 2012

Use the same CALL instructions for Windows as for everything else. · 97e3115d

Jakob Stoklund Olesen authored Feb 16, 2012

The different calling conventions and call-preserved registers are
represented with regmask operands that are added dynamically.

llvm-svn: 150708

97e3115d

Enable register mask operands for x86 calls. · 8a450cb2

Jakob Stoklund Olesen authored Feb 16, 2012

Call instructions no longer have a list of 43 call-clobbered registers.
Instead, they get a single register mask operand with a bit vector of
call-preserved registers.

This saves a lot of memory, 42 x 32 bytes = 1344 bytes per call
instruction, and it speeds up building call instructions because those
43 imp-def operands no longer need to be added to use-def lists. (And
removed and shifted and re-added for every explicit call operand).

Passes like LiveVariables, LiveIntervals, RAGreedy, PEI, and
BranchFolding are significantly faster because they can deal with call
clobbers in bulk.

Overall, clang -O2 is between 0% and 8% faster, uniformly distributed
depending on call density in the compiled code.  Debug builds using
clang -O0 are 0% - 3% faster.

I have verified that this patch doesn't change the assembly generated
for the LLVM nightly test suite when building with -disable-copyprop
and -disable-branch-fold.

Branch folding behaves slightly differently in a few cases because call
instructions have different hash values now.

Copy propagation flushes its data structures when it crosses a register
mask operand. This causes it to leave a few dead copies behind, on the
order of 20 instruction across the entire nightly test suite, including
SPEC. Fixing this properly would require the pass to use different data
structures.

llvm-svn: 150638

8a450cb2

Feb 02, 2012

Instruction scheduling itinerary for Intel Atom. · 8523b16f

Andrew Trick authored Feb 01, 2012

Adds an instruction itinerary to all x86 instructions, giving each a default latency of 1, using the InstrItinClass IIC_DEFAULT.

Sets specific latencies for Atom for the instructions in files X86InstrCMovSetCC.td, X86InstrArithmetic.td, X86InstrControl.td, and X86InstrShiftRotate.td. The Atom latencies for the remainder of the x86 instructions will be set in subsequent patches.

Adds a test to verify that the scheduler is working.

Also changes the scheduling preference to "Hybrid" for i386 Atom, while leaving x86_64 as ILP.

Patch by Preston Gurd!

llvm-svn: 149558

8523b16f

Jan 26, 2012

Handle call-clobbered ymm registers on Win64. · fc9dce25

Jakob Stoklund Olesen authored Jan 26, 2012

The Win64 calling convention has xmm6-15 as callee-saved while still
clobbering all ymm registers.

Add a YMM_HI_6_15 pseudo-register that aliases the clobbered part of the
ymm registers, and mark that as call-clobbered.  This allows live xmm
registers across calls.

This hack wouldn't be necessary with RegisterMask operands representing
the call clobbers, but they are not quite operational yet.

llvm-svn: 149088

fc9dce25

Jan 20, 2012
- Intel syntax: For now, disable ambiguous JMP64pcrel32 for intel syntax. · f36613cb
  Devang Patel authored Jan 20, 2012
```
llvm-svn: 148569
```
  f36613cb
Mar 24, 2011
- Target/X86: [PR8777][PR8778] Tweak alloca/chkstk for Windows targets. · 521eb7c1
  NAKAMURA Takumi authored Mar 24, 2011
```
FIXME: Some cleanups would be needed.
llvm-svn: 128206
```
  521eb7c1
Jan 26, 2011
- Target/X86: Tweak win64's tailcall. · 0cfdac07
  NAKAMURA Takumi authored Jan 26, 2011
```
llvm-svn: 124272
```
  0cfdac07
- Fix whitespace. · 9d29eff1
  NAKAMURA Takumi authored Jan 26, 2011
```
llvm-svn: 124270
```
  9d29eff1
Jan 03, 2011

Use pushq / popq instead of subq $8, %rsp / addq $8, %rsp to adjust stack in · 65089fc6

Evan Cheng authored Jan 03, 2011

prologue and epilogue if the adjustment is 8. Similarly, use pushl / popl if
the adjustment is 4 in 32-bit mode.

In the epilogue, takes care to pop to a caller-saved register that's not live
at the exit (either return or tailcall instruction).
rdar://8771137

llvm-svn: 122783

65089fc6

Nov 30, 2010
- Migrate X86InstrControl.td to use PseudoI and fix a couple of 80-col violations · a8706580
  Eric Christopher authored Nov 30, 2010
```
while I'm in there.

llvm-svn: 120466
```
  a8706580
Nov 12, 2010
- accept lret as an alias for lretl, fixing the reopened part of PR8592 · 87cf7f78
  Chris Lattner authored Nov 12, 2010
```
llvm-svn: 118916
```
  87cf7f78
- implement PR8592: empirically "lretq" is a "lret" with a rex.w prefix. · 5b013b10
  Chris Lattner authored Nov 12, 2010
```
llvm-svn: 118903
```
  5b013b10
Oct 18, 2010

Added a handful of x86-32 instructions that were missing so that llvm-mc would · b9783dd9

Kevin Enderby authored Oct 18, 2010

be more complete. These are only expected to be used by llvm-mc with assembly
source so there is no pattern, [], in the .td files. Most are being added to
X86InstrInfo.td as Chris suggested and only comments about register uses are
added. Suggestions welcome on the .td changes as I'm not sure on every detail
of the x86 records. More missing instructions will be coming.

llvm-svn: 116716

b9783dd9

Oct 05, 2010

continue moving stuff out to X86InstrSystem.td. Move · ae33f5d9

Chris Lattner authored Oct 05, 2010

control flow stuff out to X86InstrControl.td.  Move
some compiler pseudo instructions and Pat<> patterns
out to X86InstrCompiler.td

llvm-svn: 115596

ae33f5d9