Commits · 9af7afcb7f834ff7a4b78a284051c53db984007c · Roger Ferrer / llvm-epi-0.8

Jan 30, 2011

Respect the -tail-dup-size command line option even when optimizing for size. · 9af7afcb

Jakob Stoklund Olesen authored Jan 30, 2011

This is similar to the -unroll-threshold option. There should be no change in
behavior when -tail-dup-size is not explicit on the llc command line.

llvm-svn: 124564

9af7afcb

Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x,... · 946e1522

Benjamin Kramer authored Jan 30, 2011

Teach DAGCombine to fold fold (sra (trunc (sr x, c1)), c2) -> (trunc (sra x, c1+c2) when c1 equals the amount of bits that are truncated off.

This happens all the time when a smul is promoted to a larger type.

On x86-64 we now compile "int test(int x) { return x/10; }" into
  movslq  %edi, %rax
  imulq $1717986919, %rax, %rax
  movq  %rax, %rcx
  shrq  $63, %rcx
  sarq  $34, %rax <- used to be "shrq $32, %rax; sarl $2, %eax"
  addl  %ecx, %eax

This fires 96 times in gcc.c on x86-64.

llvm-svn: 124559

946e1522

Jan 29, 2011
- Add the missing sub identity "A-(A-B) -> B" to DAGCombine. · 65bb14d3
  Benjamin Kramer authored Jan 29, 2011
```
This happens e.g. for code like "X - X%10" where we lower the modulo operation
to a series of multiplies and shifts that are then subtracted from X, leading to
this missed optimization.

llvm-svn: 124532
```
  65bb14d3
- Re-apply r124518 with fix. Watch out for invalidated iterator. · d983eba7
  Evan Cheng authored Jan 29, 2011
```
llvm-svn: 124526
```
  d983eba7
- Revert r124518. It broke Linux self-host. · 65b8ccf6
  Evan Cheng authored Jan 29, 2011
```
llvm-svn: 124522
```
  65b8ccf6
- Re-commit r124462 with fixes. Tail recursion elim will now dup ret into... · d4eff314
  Evan Cheng authored Jan 29, 2011
```
Re-commit r124462 with fixes. Tail recursion elim will now dup ret into unconditional predecessor to enable TCE on demand.

llvm-svn: 124518
```
  d4eff314
Jan 28, 2011
- Revert r124462. There are a few big regressions that I need to fix first. · aaa9606b
  Evan Cheng authored Jan 28, 2011
```
llvm-svn: 124478
```
  aaa9606b
- Fix build with stdcxx by using llvm::next. Patch by Joerg Sonnenberger! · 0af77fd4
  Nick Lewycky authored Jan 28, 2011
```
llvm-svn: 124472
```
  0af77fd4
- Print the visibility of declarations. · 6c17d548
  Rafael Espindola authored Jan 28, 2011
```
llvm-svn: 124468
```
  6c17d548
- - Stop simplifycfg from duplicating "ret" instructions into unconditional · 417fca86
  Evan Cheng authored Jan 28, 2011
```
  branches. PR8575, rdar://5134905, rdar://8911460.
- Allow codegen tail duplication to dup small return blocks after register
  allocation is done.

llvm-svn: 124462
```
  417fca86
Jan 27, 2011

Remove a temporary workaround for a lencod miscompile. Depends on the fix in r124442. · c0ca6760
Andrew Trick authored Jan 27, 2011
```
llvm-svn: 124443
```
c0ca6760

VirtRegRewriter fix: update kill flags, which are used by the scavenger. · 13bb644f

Andrew Trick authored Jan 27, 2011

rdar://problem/8893967: JM/lencod miscompile at -arch armv7 -mthumb -O3

Added ResurrectKill to remove kill flags after we decide to reused a
physical register. And (hopefully) ensure that we call it in all the
right places.

Sorry, I'm not checking in a unit test given that it's a miscompile I
can't reproduce easily with a toy example. Failures in the rewriter
depend on a series of heuristic decisions maked during one of the many
upstream phases in codegen. This case would require coercing regalloc
to generate a couple of rematerialzations in a way that causes the
scavenger to reuse the same register at just the wrong point.

The general way to test this is to implement kill flags
verification. Then we could have a simple, robust compile-only unit
test. That would be worth doing if the whole pass was not about to
disappear. At this point we focus verification work on the next
generation of regalloc.

llvm-svn: 124442

13bb644f

Speculatively revert r124380. · 1cec7554
Devang Patel authored Jan 27, 2011
```
llvm-svn: 124397
```
1cec7554
While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. · 3b266a27
Devang Patel authored Jan 27, 2011
```
Take 2. This includes fix for dragonegg crash.

llvm-svn: 124380
```
3b266a27

Avoid modifying the OneClassForEachPhysReg map while iterating over it. · 2d69fb41

Bob Wilson authored Jan 27, 2011

Linear scan regalloc is currently assuming that any register aliased with
a member of a regclass must also be in at least one regclass. That is not
always true. For example, for X86, RIP is in a regclass but IP is not.
If you're unlucky, this can cause a crash by invalidating the iterator.

llvm-svn: 124365

2d69fb41

Try harder to not have unused variables. · a148c592
Matt Beaumont-Gay authored Jan 27, 2011
```
llvm-svn: 124350
```
a148c592
Opt-mode -Wunused-variable cleanup · 0cddbf2b
Matt Beaumont-Gay authored Jan 27, 2011
```
llvm-svn: 124346
```
0cddbf2b
Reapply 124301 · 92b7077f
Devang Patel authored Jan 27, 2011
```
llvm-svn: 124339
```
92b7077f

Jan 26, 2011
- Initialize variable to get rid of clang warning. · fb4ee9bb
  Bill Wendling authored Jan 26, 2011
```
llvm-svn: 124331
```
  fb4ee9bb
- Revert 124301. · b370bf32
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124327
```
  b370bf32
- Revert r124302 · 084e0628
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124320
```
  084e0628
- · bab5e6ed
  David Greene authored Jan 26, 2011
```
[AVX] Add INSERT_SUBVECTOR and support it on x86.  This provides a
default implementation for x86, going through the stack in a similr
fashion to how the codegen implements BUILD_VECTOR.  Eventually this
will get matched to VINSERTF128 if AVX is available.

llvm-svn: 124307
```
  bab5e6ed
- While legalizing SDValues do not drop SDDbgValues, trasfer them to new legal nodes. · a11210b1
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124302
```
  a11210b1
- Process valid SDDbgValues even if the node does not have any order assigned. · 9d4eb2f4
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124301
```
  9d4eb2f4
- Refactor. · 1448e7c8
  Devang Patel authored Jan 26, 2011
```
llvm-svn: 124300
```
  1448e7c8
- · b6f16119
  David Greene authored Jan 26, 2011
```
[AVX] Support EXTRACT_SUBVECTOR on x86.  This provides a default
implementation of EXTRACT_SUBVECTOR for x86, going through the stack
in a similr fashion to how the codegen implements BUILD_VECTOR.
Eventually this will get matched to VEXTRACTF128 if AVX is available.

llvm-svn: 124292
```
  b6f16119
- Rename member variables to follow the rest of LLVM. · b3089020
  Jakob Stoklund Olesen authored Jan 26, 2011
```
No functional change.

llvm-svn: 124257
```
  b3089020
- Provide an interface to transfer SDDbgValue from one SDNode to another. · efc6b16e
  Devang Patel authored Jan 25, 2011
```
llvm-svn: 124245
```
  efc6b16e
Jan 25, 2011

· 70f8e596

Devang Patel authored Jan 25, 2011

Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic.

llvm-svn: 124203

70f8e596

This assertion is too restrictive, it does not apply for dangling dbg value... · 04b649d4

Devang Patel authored Jan 25, 2011

This assertion is too restrictive, it does not apply for dangling dbg value nodes (nodes where dbg.value intrinsic preceds use of the value). 

llvm-svn: 124202

04b649d4

Jan 24, 2011
- Support printing exception section into the current one. This is the case when LSDASection is blank · b15beb2a
  Anton Korobeynikov authored Jan 24, 2011
```
llvm-svn: 124150
```
  b15beb2a
- Speculatively revert r124138. · 53347954
  Devang Patel authored Jan 24, 2011
```
llvm-svn: 124142
```
  53347954
- Resolve DanglingDbgValue of PHI nodes where the use follows dbg.value intrinisic. · 8cc5355c
  Devang Patel authored Jan 24, 2011
```
llvm-svn: 124138
```
  8cc5355c
- Temporarily workaround JM/lencod miscompile (SIGSEGV). · a293c49f
  Andrew Trick authored Jan 24, 2011
```
rdar://problem/8893967

llvm-svn: 124137
```
  a293c49f
Jan 23, 2011
- Add support for the --noexecstack option. · b3eca9bb
  Rafael Espindola authored Jan 23, 2011
```
llvm-svn: 124077
```
  b3eca9bb
- Null initialize a few variables flagged by · 3c4408ce
  Ted Kremenek authored Jan 23, 2011
```
clang's -Wuninitialized-experimental warning.
While these don't look like real bugs, clang's
-Wuninitialized-experimental analysis is stricter
than GCC's, and these fixes have the benefit
of being general nice cleanups.

llvm-svn: 124073
```
  3c4408ce
- Delay the creation of eh_frame so that the user can change the defaults. · 4b7b7fba
  Rafael Espindola authored Jan 23, 2011
```
Add support for SHT_X86_64_UNWIND.

llvm-svn: 124059
```
  4b7b7fba
- Remove more duplicated code. · 0e7e34e4
  Rafael Espindola authored Jan 23, 2011
```
llvm-svn: 124056
```
  0e7e34e4
- Remove duplicated code. · aea4958e
  Rafael Espindola authored Jan 23, 2011
```
llvm-svn: 124054
```
  aea4958e
Jan 21, 2011

Enable support for precise scheduling of the instruction selection · bd428ec5

Andrew Trick authored Jan 21, 2011

DAG. Disable using "-disable-sched-cycles".

For ARM, this enables a framework for modeling the cpu pipeline and
counting stalls. It also activates several heuristics to drive
scheduling based on the model. Scheduling is inherently imprecise at
this stage, and until spilling is improved it may defeat attempts to
schedule. However, this framework provides greater control over
tuning codegen.

Although the flag is not target-specific, it should have very little
affect on the default scheduler used by x86. The only two changes that
affect x86 are:
- scheduling a high-latency operation bumps the current cycle so independent
  operations can have their latency covered. i.e. two independent 4
  cycle operations can produce results in 4 cycles, not 8 cycles.
- Two operations with equal register pressure impact and no
  latency-based stalls on their uses will be prioritized by depth before height
  (height is irrelevant if no stalls occur in the schedule below this point).

llvm-svn: 123971

bd428ec5