Commits · cf10f08825ff735c445286f47fcf19c98ca1e33e · Roger Ferrer / llvm-epi-0.8

Dec 20, 2011

64-bit data directive. · cf10f088
Akira Hatanaka authored Dec 20, 2011
```
llvm-svn: 147005
```
cf10f088
32-to-64-bit sext_inreg pattern. · 494fdf14
Akira Hatanaka authored Dec 20, 2011
```
llvm-svn: 147004
```
494fdf14
Add code in MipsDAGToDAGISel for selecting constant +0.0. · dac1d48d
Akira Hatanaka authored Dec 20, 2011
```
MIPS64 can generate constant +0.0 with a single DMTC1 instruction. 

llvm-svn: 146999
```
dac1d48d

Heed spill slot alignment on ARM. · b95c102c

Jakob Stoklund Olesen authored Dec 20, 2011

Use the spill slot alignment as well as the local variable alignment to
determine when the stack needs to be realigned. This works now that the
ARM target can always realign the stack by using a base pointer.

Still respect the ARMBaseRegisterInfo::canRealignStack() function
vetoing a realigned stack.  Don't use aligned spill code in that case.

llvm-svn: 146997

b95c102c

ARM target code clean up. Check for iOS, not Darwin where it makes sense. · 68132d80
Evan Cheng authored Dec 20, 2011
```
llvm-svn: 146981
```
68132d80

This is the second fix related to VZEXT_MOVL node. · ec7e6e09

Elena Demikhovsky authored Dec 20, 2011

The failure that I see in the current version is:

LLVM ERROR: Cannot select: 0x18b8f70: v4i64 = X86ISD::VZEXT_MOVL 0x18beee0 [ID=14]
  0x18beee0: v4i64 = insert_subvector 0x18b8c70, 0x18b9170, 0x18b9570 [ID=13]
    0x18b8c70: v4i64 = insert_subvector 0x18b9870, 0x18bf4e0, 0x18b9970 [ID=12]
      0x18b9870: v4i64 = undef [ID=4]
      0x18bf4e0: v2i64 = bitcast 0x18bf3e0 [ID=10]
        0x18bf3e0: v4i32 = BUILD_VECTOR 0x18b9770, 0x18b9770, 0x18b9770, 0x18b9770 [ID=8]
          0x18b9770: i32 = TargetConstant<0> [ID=6]
          0x18b9770: i32 = TargetConstant<0> [ID=6]
          0x18b9770: i32 = TargetConstant<0> [ID=6]
          0x18b9770: i32 = TargetConstant<0> [ID=6]
      0x18b9970: i32 = Constant<0> [ID=3]
    0x18b9170: v2i64 = undef [ORD=1] [ID=1]
    0x18b9570: i32 = Constant<2> [ID=5]

llvm-svn: 146975

ec7e6e09

Begin teaching the X86 target how to efficiently codegen patterns that · 24680c24

Chandler Carruth authored Dec 20, 2011

use the zero-undefined variants of CTTZ and CTLZ. These are just simple
patterns for now, there is more to be done to make real world code using
these constructs be optimized and codegen'ed properly on X86.

The existing tests are spiffed up to check that we no longer generate
unnecessary cmov instructions, and that we generate the very important
'xor' to transform bsr which counts the index of the most significant
one bit to the number of leading (most significant) zero bits. Also they
now check that when the variant with defined zero result is used, the
cmov is still produced.

llvm-svn: 146974

24680c24

Mark ARM eh_sjlj_dispatchsetup as clobbering all registers. Radar 10567930. · 75f12cc3

Bob Wilson authored Dec 20, 2011

We used to rely on the *eh_sjlj_setjmp instructions to mark that a function
with setjmp/longjmp exception handling clobbers all the registers. But with
the recent reorganization of ARM EH, those eh_sjlj_setjmp instructions are
expanded away earlier, before PEI can see them to determine what registers to
save and restore. Mark the dispatchsetup instruction in the same way, since
that instruction cannot be expanded early. This also more accurately reflects
when the registers are clobbered.

llvm-svn: 146949

75f12cc3

Move tests to FileCheck. · 3bfaefe9
Evan Cheng authored Dec 19, 2011
```
llvm-svn: 146923
```
3bfaefe9

Dec 19, 2011

Add a test case for r146900. · 37c45db1
Akira Hatanaka authored Dec 19, 2011
```
llvm-svn: 146901
```
37c45db1
Add patterns for matching immediates whose lower 16-bit is cleared. These · db47e0c4
Akira Hatanaka authored Dec 19, 2011
```
patterns emit a single LUi instruction instead of a pair of LUi and ORi.

llvm-svn: 146900
```
db47e0c4
Remove definitions of double word shift plus 32 instructions. Assembler or · 2a232d81
Akira Hatanaka authored Dec 19, 2011
```
direct-object emitter should emit the appropriate shift instruction depending
on the shift amount.

llvm-svn: 146893
```
2a232d81

Remove the restriction on the first operand of the add node in SelectAddr. · 3c9f3363

Akira Hatanaka authored Dec 19, 2011

This change reduces the number of instructions generated.

For example, 
(load (add (sub $n0, $n1), (MipsLo got(s))))

results in the following sequence of instructions:
1. sub $n2, $n0, $n1
2. lw got(s)($n2)

Previously, three instructions were needed.
1. sub $n2, $n0, $n1
2. addiu $n3, $n2, got(s)
3. lw 0($n3)

llvm-svn: 146888

3c9f3363

Dec 17, 2011
- Fix a CPSR liveness tracking bug introduced when I converted IT block to bundle. · 903231bc
  Evan Cheng authored Dec 17, 2011
```
llvm-svn: 146805
```
  903231bc
- Make sure that the lower bits on the VSELECT condition are properly set. · da07b3ad
  Lang Hames authored Dec 17, 2011
```
llvm-svn: 146800
```
  da07b3ad
- Fix off-by-one error in bucket sort. · 9790187b
  Jakob Stoklund Olesen authored Dec 16, 2011
```
The bad sorting caused a misaligned basic block when building 176.vpr in
ARM mode.

<rdar://problem/10594653>

llvm-svn: 146767
```
  9790187b
Dec 16, 2011
- Hexagon: Fix a nasty order-of-initialization bug. · 9ca2e729
  Benjamin Kramer authored Dec 16, 2011
```
Reenable the tests.

llvm-svn: 146750
```
  9ca2e729
- Don't try to match 'unpackl/h v, v' for 32xi8 and 16xi16 when only AVX1 is... · a4d411cb
  Craig Topper authored Dec 16, 2011
```
Don't try to match 'unpackl/h v, v' for 32xi8 and 16xi16 when only AVX1 is supported. Fix 'unpackh v, v' for 256-bit types to understand 128-bit lanes.

llvm-svn: 146726
```
  a4d411cb
Dec 15, 2011
- Add missing zmovl AVX patterns which were causing crashes. · 41dbf59e
  Chad Rosier authored Dec 15, 2011
```
Patch by Elena Demikhovsky <elena.demikhovsky@intel.com>!

llvm-svn: 146689
```
  41dbf59e
- Fix assert in LowerBUILD_VECTOR for v16i16 type on AVX. · 75ed9dcb
  Chad Rosier authored Dec 15, 2011
```
Patch by Elena Demikhovsky <elena.demikhovsky@intel.com>!

llvm-svn: 146684
```
  75ed9dcb
- Set specific target cpu for testcase. · 918f976e
  Lang Hames authored Dec 15, 2011
```
llvm-svn: 146678
```
  918f976e
- Added test case for r146671. · 2d6d3a2f
  Lang Hames authored Dec 15, 2011
```
llvm-svn: 146675
```
  2d6d3a2f
- Add a test case to make sure that the nop really does follow the bl on ppc64 elf · 750366f0
  Hal Finkel authored Dec 15, 2011
```
llvm-svn: 146666
```
  750366f0
- Don't try to form FGETSIGN after legalization; it is possible in some cases,... · 2ec82496
  Eli Friedman authored Dec 15, 2011
```
Don't try to form FGETSIGN after legalization; it is possible in some cases, but the existing code can't do it correctly. PR11570.

llvm-svn: 146630
```
  2ec82496
- Add support for lowering fneg when AVX is enabled. · 1940baa7
  Chad Rosier authored Dec 15, 2011
```
rdar://10566486

llvm-svn: 146625
```
  1940baa7
- Do not sink instruction, if it is not profitable. · c2686886
  Devang Patel authored Dec 14, 2011
```
On ARM, peephole optimization for ABS creates a trivial cfg triangle which tempts machine sink to sink instructions in code which is really straight line code. Sometimes this sinking may alter register allocator input such that use and def of a reg is divided by a branch in between, which may result in extra spills. Now mahine sink avoids sinking if final sink destination is post dominator.

Radar 10266272.

llvm-svn: 146604
```
  c2686886
Dec 14, 2011

Add support for local dynamic TLS model in LowerGlobalTLSAddress. Direct object · bff84e19
Akira Hatanaka authored Dec 14, 2011
```
emission is not supported yet, but a patch that adds the support should follow
soon.

llvm-svn: 146572
```
bff84e19

- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function · 7fae11b2

Evan Cheng authored Dec 14, 2011

  to finalize MI bundles (i.e. add BUNDLE instruction and computing register def
  and use lists of the BUNDLE instruction) and a pass to unpack bundles.
- Teach more of MachineBasic and MachineInstr methods to be bundle aware.
- Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to
  prevent IT blocks from being broken apart.

llvm-svn: 146542

7fae11b2

Add newline at EOF. · 4020ae75
Chad Rosier authored Dec 14, 2011
```
llvm-svn: 146538
```
4020ae75

Dec 13, 2011
- [fast-isel] Unaligned loads of floats are not supported. Therefore, convert to a regular · 563de603
  Chad Rosier authored Dec 13, 2011
```
load and then move the result from a GPR to a FPR.

llvm-svn: 146502
```
  563de603
- Move direct object emitter test to directory test/MC/Mips. Rename it to · 341850fd
  Akira Hatanaka authored Dec 13, 2011
```
elf-relsym.ll.

llvm-svn: 146470
```
  341850fd
- Relocation against a symbol, instead of against section. We had some extreme · e41963ce
  Akira Hatanaka authored Dec 13, 2011
```
test cases where there were a lot of relocations applied relative to a large
rodata section. Gas would create a symbol for each of these whereas we would
be relative to the beginning of the rodata section. This change mimics what
gas does.

Patch by Jack Carter.

llvm-svn: 146468
```
  e41963ce
- Temporarily disable Hexagon tests. They are failing on OS X · 525ca5fc
  Tony Linthicum authored Dec 13, 2011
```
llvm-svn: 146455
```
  525ca5fc
Dec 12, 2011

Test case for r146432 by Jack Carter. · 9e5908ae
Akira Hatanaka authored Dec 12, 2011
```
llvm-svn: 146433
```
9e5908ae

Implement 'e' and 'f' modifiers for Neon inline asm. <rdar://problem/10551006 > · fadc2c83

Bob Wilson authored Dec 12, 2011

These modifiers simply select either the low or high D subregister of a Neon
Q register. I've also removed the unimplemented 'p' modifier, which turns out
to be a bit different than the comment here suggests and as far as I can tell
was only intended for internal use in Apple's version of gcc.

llvm-svn: 146417

fadc2c83

Hexagon backend support · 1213a7a5
Tony Linthicum authored Dec 12, 2011
```
llvm-svn: 146412
```
1213a7a5

Manually upgrade the test suite to specify the flag to cttz and ctlz. · 6b0e34c4

Chandler Carruth authored Dec 12, 2011

I followed three heuristics for deciding whether to set 'true' or
'false':

- Everything target independent got 'true' as that is the expected
  common output of the GCC builtins.
- If the target arch only has one way of implementing this operation,
  set the flag in the way that exercises the most of codegen. For most
  architectures this is also the likely path from a GCC builtin, with
  'true' being set. It will (eventually) require lowering away that
  difference, and then lowering to the architecture's operation.
- Otherwise, set the flag differently dependending on which target
  operation should be tested.

Let me know if anyone has any issue with this pattern or would like
specific tests of another form. This should allow the x86 codegen to
just iteratively improve as I teach the backend how to differentiate
between the two forms, and everything else should remain exactly the
same.

llvm-svn: 146370

6b0e34c4

Dec 11, 2011

Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix... · 46837409

Stepan Dyatkovskiy authored Dec 11, 2011

Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Third attempt: simplified checks in test for armv7-apple-darwin11.

llvm-svn: 146341

46837409

Dec 10, 2011
- Revert associate SelectInsertValue test as well. · 1c468af8
  Chad Rosier authored Dec 10, 2011
```
llvm-svn: 146332
```
  1c468af8
- Revert r146322 to appease buildbots. Original commit message: · 6641294e
  Chad Rosier authored Dec 10, 2011
```
Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for
FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second
attempt.

llvm-svn: 146328
```
  6641294e