Commits · 75d9d5159e70b9255a65aed45fd1adefde0be7c8 · Roger Ferrer / llvm-epi-0.8

Aug 07, 2012

Add trace accessor methods, implement primitive if-conversion heuristic. · 75d9d515

Jakob Stoklund Olesen authored Aug 07, 2012

Compare the critical paths of the two traces through an if-conversion
candidate. If the difference is larger than the branch brediction
penalty, reject the if-conversion. If would never pay.

llvm-svn: 161433

75d9d515

Tidy up a bit. · af9aec0c
Jim Grosbach authored Aug 07, 2012
```
llvm-svn: 161430
```
af9aec0c

The dominance computation already has logic for computing if an edge dominates · 59564079

Rafael Espindola authored Aug 07, 2012

a use or a BB, but it is inline in the handling of the invoke instruction.

This patch refactors it so that it can be used in other cases. For example, in

define i32 @f(i32 %x) {
bb0:
  %cmp = icmp eq i32 %x, 0
  br i1 %cmp, label %bb2, label %bb1
bb1:
  br label %bb2
bb2:
  %cond = phi i32 [ %x, %bb0 ], [ 0, %bb1 ]
  %foo = add i32 %cond, %x
  ret i32 %foo
}

GVN should be able to replace %x with 0 in any use that is dominated by the
true edge out of bb0. In the above example the only such use is the one in
the phi.

llvm-svn: 161429

59564079

Add a comment about mftb vs. mfspr on PPC. · 895a5f5d
Hal Finkel authored Aug 07, 2012
```
Thanks to Alex Rosenberg for the suggestion.

llvm-svn: 161428
```
895a5f5d

Fix the representation of debug line table in DebugInfo LLVM library, · 947228c4

Alexey Samsonov authored Aug 07, 2012

and "instruction address -> file/line" lookup.

Instead of plain collection of rows, debug line table for compilation unit is now
treated as the number of row ranges, describing sequences (series of contiguous machine
instructions). The sequences are not always listed in the order of increasing
address, so previously used std::lower_bound() sometimes produced wrong results.
Now the instruction address lookup consists of two stages: finding the correct
sequence, and searching for address in range of rows for this sequence.

llvm-svn: 161414

947228c4

PR13095: Give an inline cost bonus to functions using byval arguments. · c99d0e91

Benjamin Kramer authored Aug 07, 2012

We give a bonus for every argument because the argument setup is not needed
anymore when the function is inlined. With this patch we interpret byval
arguments as a compact representation of many arguments. The byval argument
setup is implemented in the backend as an inline memcpy, so to model the
cost as accurately as possible we take the number of pointer-sized elements
in the byval argument and give a bonus of 2 instructions for every one of
those. The bonus is capped at 8 elements, which is the number of stores
at which the x86 backend switches from an expanded inline memcpy to a real
memcpy. It would be better to use the real memcpy threshold from the backend,
but it's not available via TargetData.

This change brings the performance of c-ray in line with gcc 4.7. The included
test case tries to reproduce the c-ray problem to catch regressions for this
benchmark early, its performance is dominated by the inline decision of a
specific call.

This only has a small impact on most code, more on x86 and arm than on x86_64
due to the way the ABI works. When building LLVM for x86 it gives a small
inline cost boost to virtually any function using StringRef or STL allocators,
but only a 0.01% increase in overall binary size. The size of gcc compiled by
clang actually shrunk by a couple bytes with this patch applied, but not
significantly.

llvm-svn: 161413

c99d0e91

Fix PR13412, a nasty miscompile due to the interleaved · 2f6cf488

Chandler Carruth authored Aug 07, 2012

instsimplify+inline strategy.

The crux of the problem is that instsimplify was reasonably relying on
an invariant that is true within any single function, but is no longer
true mid-inline the way we use it. This invariant is that an argument
pointer != a local (alloca) pointer.

The fix is really light weight though, and allows instsimplify to be
resiliant to these situations: when checking the relation ships to
function arguments, ensure that the argumets come from the same
function. If they come from different functions, then none of these
assumptions hold. All credit to Benjamin Kramer for coming up with this
clever solution to the problem.

llvm-svn: 161410

2f6cf488

Add a much more conservative strategy for aligning branch targets. · 881d0a79

Chandler Carruth authored Aug 07, 2012

Previously, MBP essentially aligned every branch target it could. This
bloats code quite a bit, especially non-looping code which has no real
reason to prefer aligned branch targets so heavily.

As Andy said in review, it's still a bit odd to do this without a real
cost model, but this at least has much more plausible heuristics.

Fixes PR13265.

llvm-svn: 161409

881d0a79

MachineCSE: Update the heuristics for isProfitableToCSE. · cb36b8c2

Manman Ren authored Aug 07, 2012

If the result of a common subexpression is used at all uses of the candidate
expression, CSE should not increase the live range of the common subexpression.

rdar://11393714 and rdar://11819721

llvm-svn: 161396

cb36b8c2

Revert r161371. Removing the 'const' before Type is a "good thing". · 0acd0c0a

Bill Wendling authored Aug 07, 2012

--- Reverse-merging r161371 into '.':
U    include/llvm/Target/TargetData.h
U    lib/Target/TargetData.cpp

llvm-svn: 161394

0acd0c0a

The define for 64 bit sign extension neglected to · f4946cfb

Jack Carter authored Aug 07, 2012

initialize fields of the class that it used.

The result was nonsense code.

Before:
0000000000000000 <foo>:
   0:    00441100     0x441100
   4:    03e00008     jr    ra
   8:    00000000     nop

After:
0000000000000000 <foo>:
   0:    00041000     sll    v0,a0,0x0
   4:    03e00008     jr    ra
   8:    00000000     nop 

llvm-svn: 161377

f4946cfb

Constify the Type parameter to some methods (which are const anyway). · 654cd4aa
Bill Wendling authored Aug 07, 2012
```
llvm-svn: 161371
```
654cd4aa

Allow x86 subtargets to use the GenericModel defined in X86Schedule.td. · e0c83b1f

Andrew Trick authored Aug 07, 2012

This allows codegen passes to query properties like
InstrItins->SchedModel->IssueWidth. It also ensure's that
computeOperandLatency returns the X86 defaults for loads and "high
latency ops". This should have no significant impact on existing
schedulers because X86 defaults happen to be the same as global
defaults.

llvm-svn: 161370

e0c83b1f

Mips relocation R_MIPS_64 relocates a 64 bit double word. · 4c58381c

Jack Carter authored Aug 07, 2012

I hit this in a very large program (spirit.cpp), but 
have not figured out how to make a small make check
test for it.

llvm-svn: 161366

4c58381c

The Mips64InstrInfo.td definitions DynAlloc64 LEA_ADDiu64 · 612c6631

Jack Carter authored Aug 06, 2012

were using a class defined for 32 bit instructions and 
thus the instruction was for addiu instead of daddiu.

This was corrected by adding the instruction opcode as a 
field in the  base class to be filled in by the defs.

llvm-svn: 161359

612c6631

Reduce indentation by early exiting. · 45f74e31
Bill Wendling authored Aug 06, 2012
```
llvm-svn: 161356
```
45f74e31
Fix typo. · 2da09fd7
Jakob Stoklund Olesen authored Aug 06, 2012
```
llvm-svn: 161354
```
2da09fd7

Aug 06, 2012

Add a way to grab the target options from the LTO command line. · b8dcda77

Bill Wendling authored Aug 06, 2012

When the command line target options were removed from the LLVM libraries, LTO
lost its ability to specify things like `-disable-fp-elim'. Add this back by
adding the command line variables to the `lto' project.
<rdar://problem/12038729>

llvm-svn: 161353

b8dcda77

Mips relocations R_MIPS_HIGHER and R_MIPS_HIGHEST. · 84491abb

Jack Carter authored Aug 06, 2012

These 2 relocations gain access to the 
highest and the second highest 16 bits
of a 64 bit object.

R_MIPS_HIGHER %higher(A+S)
The %higher(x) function is [ (((long long) x + 0x80008000LL) >> 32) & 0xffff ]. 

R_MIPS_HIGHEST %highest(A+S)
The %highest(x) function is [ (((long long) x + 0x800080008000LL) >> 48) & 0xffff ]. 

llvm-svn: 161348

84491abb

MFTB on PPC64 should really be encoded using MFSPR. · 33e529d5

Hal Finkel authored Aug 06, 2012

The MFTB instruction itself is being phased out, and its functionality
is provided by MFSPR. According to the ISA docs, using MFSPR works on all known
chips except for the 601 (which did not have a timebase register anyway)
and the POWER3.

Thanks to Adhemerval Zanella for pointing this out!

llvm-svn: 161346

33e529d5

Add support for the OpenBSD for Bitrig. · 22738d00
Eric Christopher authored Aug 06, 2012
```
Patch by David Hill.

llvm-svn: 161344
```
22738d00
Fix MIPS DSP Rev1 intrinsics memory properties. · f679652e
Simon Atanasyan authored Aug 06, 2012
```
The patch reviewed by Akira Hatanaka.

llvm-svn: 161332
```
f679652e
Put up warning signs around MO::getNextOperandForReg(). · 8b7cfe33
Jakob Stoklund Olesen authored Aug 06, 2012
```
llvm-svn: 161329
```
8b7cfe33
Remove empty overrides of processFunctionBeforeFrameFinalized(). · 7d6e0856
Roman Divacky authored Aug 06, 2012
```
llvm-svn: 161328
```
7d6e0856

Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom... · ab47fe4e

Craig Topper authored Aug 06, 2012

Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom handling in DAGISelToDAG due to limitations in TableGen's implicit def handling. Fixes PR11305.

llvm-svn: 161318

ab47fe4e

Aug 05, 2012
- Update test to check for r161305 · 812005e5
  Craig Topper authored Aug 05, 2012
```
llvm-svn: 161307
```
  812005e5
- Remove custom inserter for MWAIT. It doesn't do anything that couldn't be represented in a pattern. · 6d0408d3
  Craig Topper authored Aug 05, 2012
```
llvm-svn: 161306
```
  6d0408d3
- Use a COPY node instead of an explicit MOVA opcode in the custom insterter for... · 43ee9fae
  Craig Topper authored Aug 05, 2012
```
Use a COPY node instead of an explicit MOVA opcode in the custom insterter for pcmpestrm/pcmpistrm. Allows the register allocator to handle it better and prevent wasted identity moves.

llvm-svn: 161305
```
  43ee9fae
Aug 04, 2012
- Add readcyclecounter lowering on PPC64. · 70381a7b
  Hal Finkel authored Aug 04, 2012
```
On PPC64, this can be done with a simple TableGen pattern.
To enable this, I've added the (otherwise missing) readcyclecounter
SDNode definition to TargetSelectionDAG.td.

llvm-svn: 161302
```
  70381a7b
- Skip impdef regs during eabi save/restore list emission to workaround PR11902 · ef731edf
  Anton Korobeynikov authored Aug 04, 2012
```
llvm-svn: 161301
```
  ef731edf
- Recognize vst1.64 / vld1.64 with 3 and 4 regs as load from / store to stack stuff · 3a4fdfec
  Anton Korobeynikov authored Aug 04, 2012
```
(this corresponds by spilling/reloading regs in DTriple / DQuad reg classes).
No testcase, found by inspection.

llvm-svn: 161300
```
  3a4fdfec
- Add stack spill / reload instructions for DTriple and DQuad register classes, which · 218aaf6d
  Anton Korobeynikov authored Aug 04, 2012
```
were missed for no reason. This fixes PR13377

llvm-svn: 161299
```
  218aaf6d
- Remove extraneous ';'. · 98f0b770
  Bill Wendling authored Aug 04, 2012
```
llvm-svn: 161298
```
  98f0b770
- Update cmake build. · 6d29c296
  Benjamin Kramer authored Aug 04, 2012
```
llvm-svn: 161297
```
  6d29c296
- Postpone the deletion of the old name in StructType::setName to allow using a... · 3849fcbe
  Benjamin Kramer authored Aug 04, 2012
```
Postpone the deletion of the old name in StructType::setName to allow using a slice of the old name.

Fixes PR13522. Add a rudimentary unit test to exercise the behavior.

llvm-svn: 161296
```
  3849fcbe
- [CMake] add_lit_target: Remove comments about add_dependencies. It is not a... · aa63610b
  NAKAMURA Takumi authored Aug 04, 2012
```
[CMake] add_lit_target: Remove comments about add_dependencies. It is not a bug in cmake that add_custom_target(DEPENDS) would not accept targets but file-level dependencies.

llvm-svn: 161295
```
  aa63610b
- llc: Try to suppress failures since r161262 . · 10b90b45
  NAKAMURA Takumi authored Aug 04, 2012
```
FIXME: Fix several tests on i686-win32 due to lacking of many libraries.
llvm-svn: 161292
```
  10b90b45
- Delete a dead variable. · a9d0b850
  Jakob Stoklund Olesen authored Aug 04, 2012
```
TwoAddressInstructionPass doesn't remat any more.

llvm-svn: 161285
```
  a9d0b850
- TwoAddressInstructionPass refactoring: Extract another method. · a0c72ecf
  Jakob Stoklund Olesen authored Aug 03, 2012
```
llvm-svn: 161284
```
  a0c72ecf
- Refactor and check "onlyReadsMemory" before optimizing builtins. · 874886cd
  Bob Wilson authored Aug 03, 2012
```
This patch is mostly just refactoring a bunch of copy-and-pasted code, but
it also adds a check that the call instructions are readnone or readonly.
That check was already present for sin, cos, sqrt, log2, and exp2 calls, but
it was missing for the rest of the builtins being handled in this code.

llvm-svn: 161282
```
  874886cd