Commits · f04cbeb3570a068dfd15399325d6c59dbdcbbd36 · Roger Ferrer / llvm-epi-0.8

Jan 31, 2013
- [PEI] Pass the frame index operand number to the eliminateFrameIndex function. · df782d22
  Chad Rosier authored Jan 31, 2013
```
Each target implementation was needlessly recomputing the index.
Part of rdar://13076458

llvm-svn: 174083
```
  df782d22
- Whitespace. · 258c867c
  Eric Christopher authored Jan 31, 2013
```
llvm-svn: 174009
```
  258c867c
- Check and allow floating point registers to select the size of the · 4e3e94c1
  Eric Christopher authored Jan 31, 2013
```
register for inline asm. This conforms to how gcc allows for effective
casting of inputs into gprs (fprs is already handled).

llvm-svn: 174008
```
  4e3e94c1
Jan 30, 2013
- Restrict sin/cos optimization to 64-bit only for now. 32-bit is a bit messy and less critical. · d2ca4e2e
  Evan Cheng authored Jan 30, 2013
```
llvm-svn: 173987
```
  d2ca4e2e
Jan 29, 2013

Remove dead code. · 27e41c9f
Evan Cheng authored Jan 29, 2013
```
llvm-svn: 173812
```
27e41c9f
Fix typo in X86BaseInfo.h that I introduced in r157818. · 5deecd90
Hans Wennborg authored Jan 29, 2013
```
llvm-svn: 173798
```
5deecd90
Merge SSE and AVX shuffle instructions in the comment printer. · c048154b
Craig Topper authored Jan 29, 2013
```
llvm-svn: 173777
```
c048154b

Teach SDISel to combine fsin / fcos into a fsincos node if the following · 0e88c7d8

Evan Cheng authored Jan 29, 2013

conditions are met:
1. They share the same operand and are in the same BB.
2. Both outputs are used.
3. The target has a native instruction that maps to ISD::FSINCOS node or
   the target provides a sincos library call.

Implemented the generic optimization in sdisel and enabled it for
Mac OSX. Also added an additional optimization for x86_64 Mac OSX by
using an alternative entry point __sincos_stret which returns the two
results in xmm0 / xmm1.

rdar://13087969
PR13204

llvm-svn: 173755

0e88c7d8

Jan 28, 2013
- Fix 256-bit PALIGNR comment decoding to understand that it works on independent 256-bit lanes. · 5c683972
  Craig Topper authored Jan 28, 2013
```
llvm-svn: 173674
```
  5c683972
- Add missing break in 256-bit palignr comment printing. No test case yet... · 71d99ffe
  Craig Topper authored Jan 28, 2013
```
Add missing break in 256-bit palignr comment printing. No test case yet because the comment itself is still wrong.

llvm-svn: 173669
```
  71d99ffe
- Fix inconsistent usage of PALIGN and PALIGNR when referring to the same instruction. · 8fb09f0a
  Craig Topper authored Jan 28, 2013
```
llvm-svn: 173667
```
  8fb09f0a
Jan 26, 2013

X86: Decode PALIGN operands so I don't have to do it in my head. · 6a935965
Benjamin Kramer authored Jan 26, 2013
```
llvm-svn: 173572
```
6a935965

X86: Do splat promotion later, so the optimizer can chew on it first. · 99c68dd9

Benjamin Kramer authored Jan 26, 2013

This catches many cases where we can emit a more efficient shuffle for a
specific mask or when the mask contains undefs. Once the splat is lowered to
unpacks we can't do that anymore.

There is a possibility of moving the promotion after pshufb matching, but I'm
not sure if pshufb with a mask loaded from memory is faster than 3 shuffles, so
I avoided that for now.

llvm-svn: 173569

99c68dd9

Jan 25, 2013

In this patch, we teach X86_64TargetMachine that it has a ILP32 · 597fc123

Eli Bendersky authored Jan 25, 2013

(defined by the x32 ABI) mode, in which case its pointers are 32-bits
in size. This knowledge is also added to X86RegisterInfo that now
returns the appropriate registers in getPointerRegClass.

There are many outcomes to this change. In order to keep the patches
separate and manageable, we start by focusing on some simple testable
cases. The patch adds a test with passing a pointer to a function -
focusing on the difference between the two data models for x86-64.
Another test is added for handling of 'sret' arguments (and
functionality is added in X86ISelLowering to make it work).

A note on naming: the "x32 ABI" document refers to the AMD64
architecture (in LLVM it's distinguished by being is64Bits() in the
x86 subtarget) with two variations: the LP64 (default) data model, and
the ILP32 data model. This patch adds predicates to the subtarget
which are consistent with this naming scheme.

llvm-svn: 173503

597fc123

Moving Cost Tables up to share with other targets · d4c392e6
Renato Golin authored Jan 24, 2013
```
llvm-svn: 173382
```
d4c392e6

Jan 22, 2013

Fix an issue of pseudo atomic instruction DAG schedule · 3dffc5e2

Michael Liao authored Jan 22, 2013

- Add list of physical registers clobbered in pseudo atomic insts
  Physical registers are clobbered when pseudo atomic instructions are
  expanded. Add them in clobber list to prevent DAG scheduler to
  mis-schedule them after these insns are declared side-effect free.
- Add test case from Michael Kuperstein <michael.m.kuperstein@intel.com>

llvm-svn: 173200

3dffc5e2

X86: Make sure we account for the FMA4 register immediate value, otherwise... · fee7d21a

Benjamin Kramer authored Jan 22, 2013

X86: Make sure we account for the FMA4 register immediate value, otherwise rip-rel relocations will be off by one byte.

PR15040.

llvm-svn: 173176

fee7d21a

Initial patch for x32 ABI support. · 0893e107

Eli Bendersky authored Jan 22, 2013

Add the x32 environment kind to the triple, and separate the concept of
pointer size and callee save stack slot size, since they're not equal
on x32.

llvm-svn: 173175

0893e107

Make APFloat constructor require explicit semantics. · 29178a34

Tim Northover authored Jan 22, 2013

Previously we tried to infer it from the bit width size, with an added
IsIEEE argument for the PPC/IEEE 128-bit case, which had a default
value. This default value allowed bugs to creep in, where it was
inappropriate.

llvm-svn: 173138

29178a34

Jan 21, 2013
- Use <0 checks in place of ==-1 because it results in simpler code. · 66163a35
  Craig Topper authored Jan 21, 2013
```
llvm-svn: 173010
```
  66163a35
- Use MVT instead of EVT in LowerVECTOR_SHUFFLEtoBlend. · 9b29486f
  Craig Topper authored Jan 21, 2013
```
llvm-svn: 173009
```
  9b29486f
- Remove trailing whitespace. · 32c5406d
  Craig Topper authored Jan 21, 2013
```
llvm-svn: 173008
```
  32c5406d
- Fix some 80 column violations. · 5c84c25b
  Craig Topper authored Jan 21, 2013
```
llvm-svn: 173006
```
  5c84c25b
- Make helper method static. · 2cd37589
  Craig Topper authored Jan 21, 2013
```
llvm-svn: 173005
```
  2cd37589
Jan 20, 2013
- Convert more EVT's to MVT's in the lowering methods. · cf939779
  Craig Topper authored Jan 20, 2013
```
llvm-svn: 172995
```
  cf939779
- Capitalize lowerTRUNCATE so that it matches the other lower functions in this... · e65a08be
  Craig Topper authored Jan 20, 2013
```
Capitalize lowerTRUNCATE so that it matches the other lower functions in this file despite it not matching coding standards.

llvm-svn: 172994
```
  e65a08be
- Revert CostTable algorithm, will re-write · e1fb0593
  Renato Golin authored Jan 20, 2013
```
llvm-svn: 172992
```
  e1fb0593
- Make LowerVSETCC a static function and use MVT instead of EVT. · ce61fdf0
  Craig Topper authored Jan 20, 2013
```
llvm-svn: 172969
```
  ce61fdf0
- · 9450fcff
  Nadav Rotem authored Jan 20, 2013
```
Revert 172708.

The optimization handles esoteric cases but adds a lot of complexity both to the X86 backend and to other backends.
This optimization disables an important canonicalization of chains of SEXT nodes and makes SEXT and ZEXT asymmetrical.
Disabling the canonicalization of consecutive SEXT nodes into a single node disables other DAG optimizations that assume
that there is only one SEXT node. The AVX mask optimizations is one example. Additionally this optimization does not update the cost model.

llvm-svn: 172968
```
  9450fcff
- Make some helper methods static. · 9976974c
  Craig Topper authored Jan 20, 2013
```
llvm-svn: 172936
```
  9976974c
- Remove DebugLoc argument from static function. It can easily be obtained from the SVOp passed in. · 4ac87da5
  Craig Topper authored Jan 20, 2013
```
llvm-svn: 172935
```
  4ac87da5
- Use MVT instead of EVT in more instruction lowering code. · 3da6507c
  Craig Topper authored Jan 20, 2013
```
llvm-svn: 172933
```
  3da6507c
- Use MVT instead of EVT in more of the shuffle lowering code. · 53c7fbab
  Craig Topper authored Jan 19, 2013
```
llvm-svn: 172930
```
  53c7fbab
- Capitalize LowerVectorIntExtend to be consistent with all the other lower functions in this file. · bb772d27
  Craig Topper authored Jan 19, 2013
```
llvm-svn: 172927
```
  bb772d27
Jan 19, 2013
- On Sandybridge split unaligned 256bit stores into two xmm-sized stores. · 7b3120b9
  Nadav Rotem authored Jan 19, 2013
```
llvm-svn: 172894
```
  7b3120b9
- Use MVT instead of EVT when computing shuffle immediates since they can only... · 84b01120
  Craig Topper authored Jan 19, 2013
```
Use MVT instead of EVT when computing shuffle immediates since they can only be for legal types. Keeps compiler from generating unneeded checks and handling for extended types.

llvm-svn: 172893
```
  84b01120
- On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and... · 74312112
  Nadav Rotem authored Jan 18, 2013
```
On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction.

llvm-svn: 172868
```
  74312112
Jan 18, 2013
- Calculate vector element size more directly for VINSERTF128/VEXTRACTF128... · 1cb8aa58
  Craig Topper authored Jan 18, 2013
```
Calculate vector element size more directly for VINSERTF128/VEXTRACTF128 immediate handling. Also use MVT since this only called on legal types during pattern matching.

llvm-svn: 172797
```
  1cb8aa58
- Minor formatting fix. No functional change. · e938138d
  Craig Topper authored Jan 18, 2013
```
llvm-svn: 172795
```
  e938138d
- Spelling fix: extened->extended. Trailing whitespace in same function. · 908f7d14
  Craig Topper authored Jan 18, 2013
```
llvm-svn: 172793
```
  908f7d14