Commits · 7aa71f94234d071e32f9ee819e726ec3370f3c93 · Roger Ferrer / llvm-epi-0.8

Feb 04, 2011
- whitespace · 3f924e4e
  Andrew Trick authored Feb 03, 2011
```
llvm-svn: 124827
```
  3f924e4e
Feb 02, 2011

Given a pair of floating point load and store, if there are no other uses of · d42641c6

Evan Cheng authored Feb 02, 2011

the load, then it may be legal to transform the load and store to integer
load and store of the same width.

This is done if the target specified the transformation as profitable. e.g.
On arm, this can transform:
vldr.32 s0, []
vstr.32 s0, []

to

ldr r12, []
str r12, []

rdar://8944252

llvm-svn: 124708

d42641c6

Feb 01, 2011
- Take Bill Wendling's suggestion for structuring a couple of asserts. · 29c8c8fe
  Matt Beaumont-Gay authored Feb 01, 2011
```
llvm-svn: 124688
```
  29c8c8fe
Jan 31, 2011
- Keep track of incoming argument's location while emitting LiveIns. · 56cc5fdf
  Devang Patel authored Jan 31, 2011
```
llvm-svn: 124611
```
  56cc5fdf
- Fix bug where ReduceLoadWidth was creating illegal ZEXTLOAD instructions. · 272e084b
  Richard Osborne authored Jan 31, 2011
```
llvm-svn: 124587
```
  272e084b
Jan 30, 2011

Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x,... · 946e1522

Benjamin Kramer authored Jan 30, 2011

Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x, c1+c2) when c1 equals the amount of bits that are truncated off.

This happens all the time when a smul is promoted to a larger type.

On x86-64 we now compile "int test(int x) { return x/10; }" into
  movslq  %edi, %rax
  imulq $1717986919, %rax, %rax
  movq  %rax, %rcx
  shrq  $63, %rcx
  sarq  $34, %rax <- used to be "shrq $32, %rax; sarl $2, %eax"
  addl  %ecx, %eax

This fires 96 times in gcc.c on x86-64.

llvm-svn: 124559

946e1522

Jan 29, 2011

Add the missing sub identity "A-(A-B) -> B" to DAGCombine. · 65bb14d3

Benjamin Kramer authored Jan 29, 2011

This happens e.g. for code like "X - X%10" where we lower the modulo operation
to a series of multiplies and shifts that are then subtracted from X, leading to
this missed optimization.

llvm-svn: 124532

65bb14d3

Jan 28, 2011
- Fix build with stdcxx by using llvm::next. Patch by Joerg Sonnenberger! · 0af77fd4
  Nick Lewycky authored Jan 28, 2011
```
llvm-svn: 124472
```
  0af77fd4
Jan 27, 2011
- Remove a temporary workaround for a lencod miscompile. Depends on the fix in r124442. · c0ca6760
  Andrew Trick authored Jan 27, 2011
```
llvm-svn: 124443
```
  c0ca6760
- Speculatively revert r124380. · 1cec7554
  Devang Patel authored Jan 27, 2011
```
llvm-svn: 124397
```
  1cec7554
- While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. · 3b266a27
  Devang Patel authored Jan 27, 2011
```
Take 2. This includes fix for dragonegg crash.

llvm-svn: 124380
```
  3b266a27
- Try harder to not have unused variables. · a148c592
  Matt Beaumont-Gay authored Jan 27, 2011
```
llvm-svn: 124350
```
  a148c592
- Opt-mode -Wunused-variable cleanup · 0cddbf2b
  Matt Beaumont-Gay authored Jan 27, 2011
```
llvm-svn: 124346
```
  0cddbf2b
- Reapply 124301 · 92b7077f
  Devang Patel authored Jan 27, 2011
```
llvm-svn: 124339
```
  92b7077f
Jan 26, 2011
- Initialize variable to get rid of clang warning. · fb4ee9bb
  Bill Wendling authored Jan 26, 2011
```
llvm-svn: 124331
```
  fb4ee9bb
- Revert 124301. · b370bf32
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124327
```
  b370bf32
- Revert r124302 · 084e0628
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124320
```
  084e0628
- · bab5e6ed
  David Greene authored Jan 26, 2011
```
[AVX] Add INSERT_SUBVECTOR and support it on x86.  This provides a
default implementation for x86, going through the stack in a similr
fashion to how the codegen implements BUILD_VECTOR.  Eventually this
will get matched to VINSERTF128 if AVX is available.

llvm-svn: 124307
```
  bab5e6ed
- While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. · a11210b1
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124302
```
  a11210b1
- Process valid SDDbgValues even if the node does not have any order assigned. · 9d4eb2f4
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124301
```
  9d4eb2f4
- Refactor. · 1448e7c8
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124300
```
  1448e7c8
- · b6f16119
  David Greene authored Jan 26, 2011
```
[AVX] Support EXTRACT_SUBVECTOR on x86.  This provides a default
implementation of EXTRACT_SUBVECTOR for x86, going through the stack
in a similr fashion to how the codegen implements BUILD_VECTOR.
Eventually this will get matched to VEXTRACTF128 if AVX is available.

llvm-svn: 124292
```
  b6f16119
- Provide an interface to transfer SDDbgValue from one SDNode to another. · efc6b16e
  Devang Patel authored Jan 25, 2011
```
llvm-svn: 124245
```
  efc6b16e
Jan 25, 2011

· 70f8e596

Devang Patel authored Jan 25, 2011

Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic.

llvm-svn: 124203

70f8e596

This assertion is too restrictive, it does not apply for dangling dbg value... · 04b649d4

Devang Patel authored Jan 25, 2011

This assertion is too restrictive, it does not apply for dangling dbg value nodes (nodes where dbg.value intrinsic preceds use of the value). 

llvm-svn: 124202

04b649d4

Jan 24, 2011
- Speculatively revert r124138. · 53347954
  Devang Patel authored Jan 24, 2011
```
llvm-svn: 124142
```
  53347954
- Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. · 8cc5355c
  Devang Patel authored Jan 24, 2011
```
llvm-svn: 124138
```
  8cc5355c
- Temporarily workaround JM/lencod miscompile (SIGSEGV). · a293c49f
  Andrew Trick authored Jan 24, 2011
```
rdar://problem/8893967

llvm-svn: 124137
```
  a293c49f
Jan 23, 2011

Null initialize a few variables flagged by · 3c4408ce

Ted Kremenek authored Jan 23, 2011

clang's -Wuninitialized-experimental warning.
While these don't look like real bugs, clang's
-Wuninitialized-experimental analysis is stricter
than GCC's, and these fixes have the benefit
of being general nice cleanups.

llvm-svn: 124073

3c4408ce

Jan 21, 2011

Enable support for precise scheduling of the instruction selection · bd428ec5

Andrew Trick authored Jan 21, 2011

DAG. Disable using "-disable-sched-cycles".

For ARM, this enables a framework for modeling the cpu pipeline and
counting stalls. It also activates several heuristics to drive
scheduling based on the model. Scheduling is inherently imprecise at
this stage, and until spilling is improved it may defeat attempts to
schedule. However, this framework provides greater control over
tuning codegen.

Although the flag is not target-specific, it should have very little
affect on the default scheduler used by x86. The only two changes that
affect x86 are:
- scheduling a high-latency operation bumps the current cycle so independent
  operations can have their latency covered. i.e. two independent 4
  cycle operations can produce results in 4 cycles, not 8 cycles.
- Two operations with equal register pressure impact and no
  latency-based stalls on their uses will be prioritized by depth before height
  (height is irrelevant if no stalls occur in the schedule below this point).

llvm-svn: 123971

bd428ec5

Convert -enable-sched-cycles and -enable-sched-hazard to -disable · 47ff14b0

Andrew Trick authored Jan 21, 2011

flags. They are still not enable in this revision.

Added TargetInstrInfo::isZeroCost() to fix a fundamental problem with
the scheduler's model of operand latency in the selection DAG.

Generalized unit tests to work with sched-cycles.

llvm-svn: 123969

47ff14b0

Jan 20, 2011

My editor's indent went crazy. Fix. · 37c4a8be
Eric Christopher authored Jan 20, 2011
```
llvm-svn: 123909
```
37c4a8be

Expand invalid return values for umulo and smulo. Handle these similarly · 785db078

Eric Christopher authored Jan 20, 2011

to add/sub by doing the normal operation and then checking for overflow
afterwards. This generally relies on the DAG handling the later invalid
operations as well.

Fixes the 64-bit part of rdar://8622122 and rdar://8774702.

llvm-svn: 123908

785db078

Selection DAG scheduler register pressure heuristic fixes. · 2cd1f0be

Andrew Trick authored Jan 20, 2011

Added a check for already live regs before claiming HighRegPressure.
Fixed a few cases of checking the wrong number of successors.
Added some tracing until these heuristics are better understood.

llvm-svn: 123892

2cd1f0be

Use only one API at a time. · b2139f65
Eric Christopher authored Jan 20, 2011
```
llvm-svn: 123866
```
b2139f65

If we can, lower the multiply part of a umulo/smulo call to a libcall · bb14f656

Eric Christopher authored Jan 20, 2011

with an invalid type then split the result and perform the overflow check
normally.

Fixes the 32-bit parts of rdar://8622122 and rdar://8774702.

llvm-svn: 123864

bb14f656

Jan 18, 2011
- Remove unused variables found by gcc-4.6's -Wunused-but-set-variable. · 249fcd44
  Jeffrey Yasskin authored Jan 18, 2011
```
llvm-svn: 123707
```
  249fcd44
- Remove checking that prevented overlapping CALLSEQ_START/CALLSEQ_END · 4fa832aa
  Stuart Hastings authored Jan 18, 2011
```
ranges, add legalizer support for nested calls.  Necessary for ARM
byval support.  Radar 7662569.

llvm-svn: 123704
```
  4fa832aa
Jan 17, 2011

Fix an off-by-one error in ctpop combining. · 45d183cc
Benjamin Kramer authored Jan 17, 2011
```
llvm-svn: 123664
```
45d183cc

Add a DAGCombine to turn (ctpop x) u< 2 into (x & x-1) == 0. · 24c5184d

Benjamin Kramer authored Jan 17, 2011

This shaves off 4 popcounts from the hacked 186.crafty source.

This is enabled even when a native popcount instruction is available. The
combined code is one operation longer but it should be faster nevertheless.

llvm-svn: 123621

24c5184d