Commits · ba848e3bca9aa8a80272bea5a7b35e6c269200f9 · Roger Ferrer / llvm-epi

Apr 12, 2013

Replace coff-/elf-dump with llvm-readobj · ba848e3b
Nico Rieck authored Apr 12, 2013
```
llvm-svn: 179361
```
ba848e3b

Add extensive relocation tests for llvm-readobj · e3517329

Nico Rieck authored Apr 12, 2013

This test ensures that relocation type names returned by libObject match
the raw relocation type value.

llvm-svn: 179360

e3517329

Fix the test on linux by setting the triple and the align format · 25a23bc0
Nadav Rotem authored Apr 12, 2013
```
llvm-svn: 179354
```
25a23bc0

Add a flag to align all basic blocks in the function. · c3b0f50a

Nadav Rotem authored Apr 12, 2013

When debugging performance regressions we often ask ourselves if the regression
that we see is due to poor isel/sched/ra or due to some micro-architetural
problem. When comparing two code sequences one good way to rule out front-end
bottlenecks (and other the issues) is to force code alignment. This pass adds
a flag that forces the alignment of all of the basic blocks in the program.

llvm-svn: 179353

c3b0f50a

Add 179294 back, but don't use bit fields so that it works on big endian hosts. · ecf13205

Rafael Espindola authored Apr 12, 2013

Original message:

Print more information about relocations.

With this patch llvm-readobj now prints if a relocation is pcrel, its length,
if it is extern and if it is scattered.

It also refactors the code a bit to use bit fields instead of shifts and
masks all over the place.

llvm-svn: 179345

ecf13205

Aliasing rules for struct-path aware TBAA. · 06a9d50a

Manman Ren authored Apr 11, 2013

Added PathAliases to check if two struct-path tags can alias.
Added command line option -struct-path-tbaa.

llvm-svn: 179337

06a9d50a

Apr 11, 2013

Use FileCheck instead of grep. · 6bda0db2
Preston Gurd authored Apr 11, 2013
```
llvm-svn: 179322
```
6bda0db2

Optimize icmp involving addition better · b81cd63c

David Majnemer authored Apr 11, 2013

Allows LLVM to optimize sequences like the following:

%add = add nsw i32 %x, 1
%cmp = icmp sgt i32 %add, %y

into:

%cmp = icmp sge i32 %x, %y

as well as:

%add1 = add nsw i32 %x, 20
%add2 = add nsw i32 %y, 57
%cmp = icmp sge i32 %add1, %add2

into:

%add = add nsw i32 %y, 37
%cmp = icmp sle i32 %cmp, %x

llvm-svn: 179316

b81cd63c

Mips specific inline asm memory operand modifier test case · a16fa808
Jack Carter authored Apr 11, 2013
```
These changes are based on commit responses for r179135.

llvm-svn: 179315
```
a16fa808
Revert my last two commits while I debug what is wrong in a big endian host. · e2742a03
Rafael Espindola authored Apr 11, 2013
```
llvm-svn: 179303
```
e2742a03

Print more information about relocations. · 708a44d4

Rafael Espindola authored Apr 11, 2013

With this patch llvm-readobj now prints if a relocation is pcrel, its length,
if it is extern and if it is scattered.

It also refactors the code a bit to use bit fields instead of shifts and
masks all over the place.

llvm-svn: 179294

708a44d4

Fix for wrong instcombine on vector insert/extract · a95f8749

Benjamin Kramer authored Apr 11, 2013

When trying to collapse sequences of insertelement/extractelement
instructions into single shuffle instructions, there is one specific
case where the Instruction Combiner wrongly updates the resulting
Mask of shuffle indexes.

The problem is in function CollectShuffleElments.

If we have a sequence of insert/extract element instructions
like the one below:

  %tmp1 = extractelement <4 x float> %LHS, i32 0
  %tmp2 = insertelement <4 x float> %RHS, float %tmp1, i32 1
  %tmp3 = extractelement <4 x float> %RHS, i32 2
  %tmp4 = insertelement <4 x float> %tmp2, float %tmp3, i32 3

Where:
  . %RHS will have a mask of [4,5,6,7]
  . %LHS will have a mask of [0,1,2,3]

The Mask of shuffle indexes is wrongly computed to [4,1,6,7]
instead of [4,0,6,7].
When analyzing %tmp2 in order to compute the Mask for the
resulting shuffle instruction, the algorithm forgets to update
the mask index at position 1 with the index associated to the
element extracted from %LHS by instruction %tmp1.

Patch by Andrea DiBiagio!

llvm-svn: 179291

a95f8749

Add a CHECK-NOT for a more faithful translation of the original grep | count 2. · 0840082c
Eli Bendersky authored Apr 11, 2013
```
Thanks to Reid Kleckner for catching this.

llvm-svn: 179289
```
0840082c
Add missing colons to check lines. · b50682e1
Benjamin Kramer authored Apr 11, 2013
```
llvm-svn: 179277
```
b50682e1
FileCheckize a bunch of tests. · 3960c1cd
Benjamin Kramer authored Apr 11, 2013
```
llvm-svn: 179276
```
3960c1cd

Optimize vector select from all 0s or all 1s · 55658d42

Michael Liao authored Apr 11, 2013

As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane,
vector select could be simplified to AND/OR or removed if one or both values
being selected is all 0s or all 1s.

llvm-svn: 179267

55658d42

Add CLAC/STAC instruction encoding/decoding support · 95d94403

Michael Liao authored Apr 11, 2013

As these two instructions in AVX extension are privileged instructions for
special purpose, it's only expected to be used in inlined assembly.

llvm-svn: 179266

95d94403

Enhance bool simplifcation in X86 to handle more cases · f7bf8705

Michael Liao authored Apr 11, 2013

This patch is revised based on patch from Victor Umansky
<victor.umansky@intel.com>. More cases are handled in X86's bool
simplification, i.e.
- SETCC_CARRY
- value is truncated to i1 with AND

As a by-product, PR5443 is also fixed.

llvm-svn: 179265

f7bf8705

Add MachO-x86-64 tests. · 1d532a30

Rafael Espindola authored Apr 11, 2013

The object was already checked in, but was not being tested.

llvm-svn: 179256

1d532a30

Rewrite some of the test/CodeGen/X86 tests to use FileCheck instead of grep · 1dceb3c9
Eli Bendersky authored Apr 10, 2013
```
llvm-svn: 179241
```
1dceb3c9

MC: Support COFF image-relative MCSymbolRefs · 1da4529b

Nico Rieck authored Apr 10, 2013

Add support for the COFF relocation types IMAGE_REL_I386_DIR32NB and
IMAGE_REL_AMD64_ADDR32NB for 32- and 64-bit respectively. These are
similar to normal 4-byte relocations except that they do not include
the base address of the image.

Image-relative relocations are used for debug information (32-bit) and
SEH unwind tables (64-bit).

A new MCSymbolRef variant called 'VK_COFF_IMGREL32' is introduced to
specify such relocations. For AT&T assembly, this variant can be accessed
using the symbol suffix '@imgrel'.

llvm-svn: 179240

1da4529b

Manually remove successors in if conversion when CopyAndPredicateBlock is used · 95081bff

Hal Finkel authored Apr 10, 2013

In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is
used because the to-be-predicated block has other predecessors, we need to
explicitly remove the old copied block from the successors list. Normally if
conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges
to cleanup the successors list, but if the predicated block contained an
un-analyzable branch (such as a now-predicated return), then this will fail.

These extra successors were causing a problem on PPC because it was causing
later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in
the code.

llvm-svn: 179227

95081bff

Mips specific inline asm memory operand modifier test case · b6bcdfd2
Jack Carter authored Apr 10, 2013
```
These changes are based on commit responses for r179135.

llvm-svn: 179225
```
b6bcdfd2

Apr 10, 2013

fixed xsave, xsaveopt, xrstor mnemonics with intel syntax; added test cases · 394bf148
Kay Tiong Khoo authored Apr 10, 2013
```
llvm-svn: 179223
```
394bf148

Revert "Update the version of dwarf we say we're emitting to at least 3." · f8d5b644

Eric Christopher authored Apr 10, 2013

temporarily while we work on plumbing through some changes to continue
supporting gdb on darwin.

This reverts commit r179122.

llvm-svn: 179222

f8d5b644

Add object-emission flag for lit tests. This flag is used · 9dea0955

Jyotsna Verma authored Apr 10, 2013

to disable following tests for Hexagon that require direct object
generation support.

DebugInfo/dwarf-public-names.ll
DebugInfo/dwarf-version.ll
DebugInfo/member-pointers.ll
DebugInfo/namespace.ll
DebugInfo/two-cus-from-same-file.ll

Fixes bug 15616 - http://llvm.org/bugs/show_bug.cgi?id=15616

llvm-svn: 179209

9dea0955

Make the SLP store-merger less paranoid about function calls. We check for... · 73dffa41

Nadav Rotem authored Apr 10, 2013

Make the SLP store-merger less paranoid about function calls. We check for function calls when we check if it is safe to sink instructions.

llvm-svn: 179207

73dffa41

R600/SI: Add pattern for AMDGPUurecip · 8caa904b

Michel Danzer authored Apr 10, 2013



21 more little piglits with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 179186

8caa904b

This is for an experimental option -mips-os16. The idea is to compile all · fe94cc3e

Reed Kotler authored Apr 10, 2013

Mips32 code as Mips16 unless it can't be compiled as Mips 16. For now this
would happen as long as floating point instructions are not needed.
Probably it would also make sense to compile as mips32 if atomic operations
are needed too. There may be other cases too.

A module pass prescans the IR and adds the mips16 or nomips16 attribute
to functions depending on the functions needs.

Mips 16 mode can result in a 40% code compression by utililizing 16 bit
encoding of many instructions.

The hope is for this to replace the traditional gcc way of dealing with
Mips16 code using floating point which involves essentially using soft float
but with a library implemented using mips32 floating point. This gcc 
method also requires creating stubs so that Mips32 code can interact with
these Mips 16 functions that have floating point needs. My conjecture is
that in reality this traditional gcc method would never win over this
new method.

I will be implementing the traditional gcc method also. Some of it is already
done but I needed to do the stubs to finish the work and those required
this mips16/32 mixed mode capability.

I have more ideas for to make this new method much better and I think the old
method will just live in llvm for anyone that needs the backward compatibility
but I don't for what reason that would be needed.

llvm-svn: 179185

fe94cc3e

Use a scheme closer to that of GNU as when deciding the type of a · adac407e

Peter Collingbourne authored Apr 10, 2013

symbol with multiple .type declarations.

Differential Revision: http://llvm-reviews.chandlerc.com/D607

llvm-svn: 179184

adac407e

R600: Add VTX_READ_* and RAT_WRITE_CACHELESS_* when computing cf addr · 04d9aa48
Vincent Lejeune authored Apr 10, 2013
```
llvm-svn: 179174
```
04d9aa48

[test] Use lit's shell test runner on Windows · d16abb77

Reid Kleckner authored Apr 10, 2013

Summary:
I did a local comparison between using bash and using lit's runner, and
more of the suite passes with lit than passes with bash.  Most of the
bash failures have to do with /dev/null, which is nonsensical on
Windows, but the lit runner handles it.

The lit shell runner is also much faster than bash, so I would expect
most Windows devs would want it by default.

The behavior can be overridden on any OS by setting
LIT_USE_INTERNAL_SHELL to 0 or 1 in the environment.

Reviewers: chapuni, ddunbar

CC: llvm-commits, timurrrr

Differential Revision: http://llvm-reviews.chandlerc.com/D559

llvm-svn: 179173

d16abb77

ARM: Make "SMC" instructions conditional on new TrustZone architecture feature. · c6047655

Tim Northover authored Apr 10, 2013

These instructions aren't universally available, but depend on a specific
extension to the normal ARM architecture (rather than, say, v6/v7/...) so a new
feature is appropriate.

This also enables the feature by default on A-class cores which usually have
these extensions, to avoid breaking existing code and act as a sensible
default.

llvm-svn: 179171

c6047655

R600/SI: dynamical figure out the reg class of MIMG · 8b1ed28e

Christian Konig authored Apr 10, 2013



Depending on the number of bits set in the writemask.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179166

8b1ed28e

R600/SI: adjust writemask to only the used components · 8e06e2a8

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179165

8e06e2a8

R600/SI: remove image sample writemask · 4ace6632

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179164

4ace6632

__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in · ac0469c5
Evan Cheng authored Apr 10, 2013
```
xmm0 / xmm1.

rdar://13599493

llvm-svn: 179141
```
ac0469c5

Mips specific inline asm operand modifier 'D' · b04e357d

Jack Carter authored Apr 09, 2013

Modifier 'D' is to use the second word of a double integer.

We had previously implemented the pure register varient of 
the modifier and this patch implements the memory reference.



#include "stdio.h"

int b[8] = {0,1,2,3,4,5,6,7};
void main()
{
    int i;
    
    // The first word. Notice, no 'D'
    {asm (
    "lw    %0,%1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);

    // The second word
    {asm (
    "lw    %0,%D1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);
}

llvm-svn: 179135

b04e357d

Allow PPC B and BLR to be if-converted into some predicated forms · 5711eca1

Hal Finkel authored Apr 09, 2013

This enables us to form predicated branches (which are the same conditional
branches we had before) and also a larger set of predicated returns (including
instructions like bdnzlr which is a conditional return and loop-counter
decrement all in one).

At the moment, if conversion does not capture all possible opportunities. A
simple example is provided in early-ret2.ll, where if conversion forms one
predicated return, and then the PPCEarlyReturn pass picks up the other one. So,
at least for now, we'll keep both mechanisms.

llvm-svn: 179134

5711eca1

Apr 09, 2013
- Update the version of dwarf we say we're emitting to at least 3. · 06c89d65
  Eric Christopher authored Apr 09, 2013
```
Deals with a dwarf2 -> dwarf3 DW_FORM_ref_addr change.

llvm-svn: 179122
```
  06c89d65