Commits · 08704349da5aed388fa8694775593f69fd108070 · Roger Ferrer / llvm-epi-0.8

Apr 28, 2011

Rename getPersonalityPICSymbol to getCFIPersonalitySymbol, document it, and · 08704349

Rafael Espindola authored Apr 27, 2011

give it a bit more responsibility. Also implement it for MachO.

If hacked to use cfi, 32 bit MachO will produce

.cfi_personality 155, L___gxx_personality_v0$non_lazy_ptr

and 64 bit will produce

.cfi_presonality ___gxx_personality_v0

The general idea is that .cfi_personality gets passed the final symbol. It is
up to codegen to produce it if using indirect representation (like 32 bit
MachO), but it is up to MC to decide which relocations to create.

llvm-svn: 130341

08704349

Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is... · 406c471b

Eli Friedman authored Apr 27, 2011

Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common.  rdar://problem/9303592 .

llvm-svn: 130338

406c471b

Apr 27, 2011
- Fix a bug in the case that there is no add or subtract symbol and the offset · 886894cb
  Kevin Enderby authored Apr 27, 2011
```
value is zero so it does not add a NULL expr operand.

llvm-svn: 130330
```
  886894cb
- Revert r130178. It turned out to be not the optimal path to emit complex location expressions. · e3745fdc
  Devang Patel authored Apr 27, 2011
```
llvm-svn: 130326
```
  e3745fdc
- Refactor out code to fast-isel a memcpy operation with a small constant · bcc69141
  Eli Friedman authored Apr 27, 2011
```
length.  (I'm planning to use this to implement byval.)

llvm-svn: 130274
```
  bcc69141
- Fix an edge case involving branches in fast-isel on x86. · 0eea0293
  Eli Friedman authored Apr 27, 2011
```
rdar://problem/9303306 .

llvm-svn: 130272
```
  0eea0293
Apr 26, 2011

Transform: "icmp eq (trunc (lshr(X, cst1)), cst" to "icmp (and X, mask), cst" · 1b06c716

Chris Lattner authored Apr 26, 2011

when X has multiple uses.  This is useful for exposing secondary optimizations,
but the X86 backend isn't ready for this when X has a single use.  For example,
this can disable load folding.

This is inching towards resolving PR6627.

llvm-svn: 130238

1b06c716

ARM and Thumb2 support for atomic MIN/MAX/UMIN/UMAX loads. · d4b733e4
Jim Grosbach authored Apr 26, 2011
```
rdar://9326019

llvm-svn: 130234
```
d4b733e4

Add a TRI::getLargestLegalSuperClass hook to provide an upper limit on register class inflation. · 803a2000

Jakob Stoklund Olesen authored Apr 26, 2011

The hook will be used by the register allocator when recomputing register
classes after removing constraints.

Thumb1 code doesn't allow anything larger than tGPR, and x86 needs to ensure
that the spill size doesn't change.

llvm-svn: 130228

803a2000

Print all the moves at a given label instead of just the first one. · 80cb3cb1
Rafael Espindola authored Apr 26, 2011
```
Remove previous DwarfCFI hack.

llvm-svn: 130187
```
80cb3cb1

Let dwarf writer allocate extra space in the debug location expression. This... · cae2fbd6

Devang Patel authored Apr 26, 2011

Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables.

llvm-svn: 130178

cae2fbd6

Apr 25, 2011
- add a missed bitfield instcombine. · 6e298924
  Chris Lattner authored Apr 25, 2011
```
llvm-svn: 130137
```
  6e298924
- Lower BlockAddress node when relocation-model is static. · 0e7ee666
  Akira Hatanaka authored Apr 25, 2011
```
llvm-svn: 130131
```
  0e7ee666
- Remove some hard coded CR-LFs. Some of these were the entire files, one of · 9b73c8e2
  Chandler Carruth authored Apr 25, 2011
```
these was just one line of a file. Explicitly set the eol-style property on the
files to try and ensure this fix stays.

llvm-svn: 130125
```
  9b73c8e2
- Fix comment typo. Noticed by Liu. · 56ca6292
  Duncan Sands authored Apr 25, 2011
```
llvm-svn: 130120
```
  56ca6292
Apr 24, 2011
- Fix Target/ARM/Thumb1FrameLowering.h header guard. · 5519ff9d
  Sebastian Redl authored Apr 24, 2011
```
llvm-svn: 130097
```
  5519ff9d
Apr 23, 2011

Remove unused STL header includes. · 1a180156
Jay Foad authored Apr 23, 2011
```
llvm-svn: 130068
```
1a180156
Silence an overzealous uninitialized variable warning from GCC. · 3db05465
Benjamin Kramer authored Apr 23, 2011
```
llvm-svn: 130053
```
3db05465

Thumb2 and ARM add/subtract with carry fixes. · 0ed5778a

Andrew Trick authored Apr 23, 2011

Fixes Thumb2 ADCS and SBCS lowering: <rdar://problem/9275821>.
t2ADCS/t2SBCS are now pseudo instructions, consistent with ARM, so the
assembly printer correctly prints the 's' suffix.

Fixes Thumb2 adde -> SBC matching to check for live/dead carry flags.

Fixes the internal ARM machine opcode mnemonic for ADCS/SBCS.
Fixes ARM SBC lowering to check for live carry (potential bug).

llvm-svn: 130048

0ed5778a

whitespace · 1a1f8d46
Andrew Trick authored Apr 23, 2011
```
llvm-svn: 130046
```
1a1f8d46

Apr 22, 2011

Disassembly of A8.6.59 LDR (literal) Encoding T1 (16-bit thumb instruction) should · 57c89286
Johnny Chen authored Apr 22, 2011
```
print out ldr, not ldr.n.

rdar://problem/9267772

llvm-svn: 130008
```
57c89286

DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. · 341c11da

Benjamin Kramer authored Apr 22, 2011

On x86 this allows to fold a load into the cmp, greatly reducing register pressure.
  movzbl	(%rdi), %eax
  cmpl	$47, %eax
->
  cmpb	$47, (%rdi)

This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :)

llvm-svn: 130005

341c11da

Add asserts. · 3c39ec29
Devang Patel authored Apr 22, 2011
```
llvm-svn: 129995
```
3c39ec29

X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X &... · 4c816247

Benjamin Kramer authored Apr 22, 2011

X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X & (C2 >> C1)) & C1. (Part of PR5039)

This tends to happen a lot with bitfield code generated by clang. A simple example for x86_64 is
uint64_t foo(uint64_t x) { return (x&1) << 42; }
which used to compile into bloated code:
shlq $42, %rdi ## encoding: [0x48,0xc1,0xe7,0x2a]
movabsq $4398046511104, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x00,0x04,0x00,0x00]
andq %rdi, %rax ## encoding: [0x48,0x21,0xf8]
ret ## encoding: [0xc3]

with this patch we can fold the immediate into the and:
andq $1, %rdi ## encoding: [0x48,0x83,0xe7,0x01]
movq %rdi, %rax ## encoding: [0x48,0x89,0xf8]
shlq $42, %rax ## encoding: [0x48,0xc1,0xe0,0x2a]
ret ## encoding: [0xc3]

It's possible to save another byte by using 'andl' instead of 'andq' but I currently see no way of doing
that without making this code even more complicated. See the TODOs in the code.

llvm-svn: 129990

4c816247

In Thumb2 mode, lower frame indix references to: · c0d2004e

Evan Cheng authored Apr 22, 2011

add <rd>, sp, #<imm8>
ldr <rd>, [sp, #<imm8>]
When the offset from sp is multiple of 4 and in range of 0-1020.
This saves code size by utilizing 16-bit instructions.

rdar://9321541

llvm-svn: 129971

c0d2004e

Compute the size of the FDE encoding instead of hard coding it. Update · 5395f44f
Rafael Espindola authored Apr 22, 2011
```
X8664_ELFTargetObjectFile::getFDEEncoding to match reality.

llvm-svn: 129959
```
5395f44f
Remove unused argument. · 6aea5926
Rafael Espindola authored Apr 21, 2011
```
llvm-svn: 129955
```
6aea5926
Fix DWARF description of Q registers. · 94ad6ac1
Devang Patel authored Apr 21, 2011
```
llvm-svn: 129952
```
94ad6ac1
Fix DWARF description of S registers. · 3712c14b
Devang Patel authored Apr 21, 2011
```
llvm-svn: 129947
```
3712c14b

Apr 21, 2011
- As per ARM docs, register Dx is described as DW_OP_regx(256+x) in DWARF. · 46bda61a
  Devang Patel authored Apr 21, 2011
```
llvm-svn: 129922
```
  46bda61a
- PTX: Expand useable register space · d74d88a8
  Justin Holewinski authored Apr 21, 2011
```
llvm-svn: 129913
```
  d74d88a8
- ptx: fix parameter ordering · 14c48e5d
  Che-Liang Chiou authored Apr 21, 2011
```
This patch depends on the prior fix r129908 that changes to use std::find,
rather than std::binary_search, on unordered array.

Patch by Dan Bailey

llvm-svn: 129909
```
  14c48e5d
- ptx: PTXMachineFunctionInfo no longer sort registers and so should not use std::binary_search · cdc51569
  Che-Liang Chiou authored Apr 21, 2011
```
llvm-svn: 129908
```
  cdc51569
- Remove -use-divmod-libcall. Let targets opt in when they are available. · 5f1ba4cd
  Evan Cheng authored Apr 20, 2011
```
llvm-svn: 129884
```
  5f1ba4cd
Apr 20, 2011

Revert r129846; it's breaking a buildbot. See · c93d399e

Eli Friedman authored Apr 20, 2011

http://google1.osuosl.org:8011/builders/llvm-x86_64-linux-checks/builds/825/steps/test.llvm.stage2/logs/st.ll

llvm-svn: 129869

c93d399e

Prefer cheap registers for busy live ranges. · 0e34c1df

Jakob Stoklund Olesen authored Apr 20, 2011

On the x86-64 and thumb2 targets, some registers are more expensive to encode
than others in the same register class.

Add a CostPerUse field to the TableGen register description, and make it
available from TRI->getCostPerUse. This represents the cost of a REX prefix or a
32-bit instruction encoding required by choosing a high register.

Teach the greedy register allocator to prefer cheap registers for busy live
ranges (as indicated by spill weight).

llvm-svn: 129864

0e34c1df

Excise unintended hunk in 129858. <rdar://problem/7662569> · 7850af6e
Stuart Hastings authored Apr 20, 2011
```
llvm-svn: 129862
```
7850af6e
ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> · 45fe3c38
Stuart Hastings authored Apr 20, 2011
```
llvm-svn: 129858
```
45fe3c38

PTX: Add intrinsics to list of built-in intrinsics, which allows them to be · 7d8895e7

Justin Holewinski authored Apr 20, 2011

     used by Clang.  To help Clang integration, the PTX target has been split
     into two targets: ptx32 and ptx64, depending on the desired pointer size.

- Add GCCBuiltin class to all intrinsics
- Split PTX target into ptx32 and ptx64

llvm-svn: 129851

7d8895e7

ptx: add integer div and rem instruction · 6586f846
Che-Liang Chiou authored Apr 20, 2011
```
Patched by Dan Bailey

llvm-svn: 129848
```
6586f846