Commits · b35a211f61c360b4758d79e8c156ac7c3f63df19 · Roger Ferrer / llvm-epi-0.8

Apr 03, 2013

Measure time that IR parsing took as part of the -time-passes measurement. · b35a211f
Eli Bendersky authored Apr 03, 2013
```
llvm-svn: 178662
```
b35a211f

PPC: Enable FRES and FRSQRTE on the default PPC64 description · 7ac4592e

Hal Finkel authored Apr 03, 2013

I discussed this with Bill Schmidt on IRC, and it was decided that this is a
safe and reasonable default.

llvm-svn: 178659

7ac4592e

PPC: Add a FIXME regarding the non-working fma+fneg Altivec pattern · 0c6d2193
Hal Finkel authored Apr 03, 2013
```
llvm-svn: 178658
```
0c6d2193
Remove some obsolete PowerPC/README entries · 2ed21a8c
Hal Finkel authored Apr 03, 2013
```
llvm-svn: 178657
```
2ed21a8c

· 084ff8e8

Ulrich Weigand authored Apr 03, 2013

More direct types in PowerPC AltiVec intrinsics.

This patch follows up on work done by Bill Schmidt in r178277,
and replaces most of the remaining uses of VRRC in ISEL DAG patterns.

The resulting .inc files are identical except for comments, so
no change in code generation is expected.

llvm-svn: 178656

084ff8e8

Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. · 92e26646

Bill Schmidt authored Apr 03, 2013

For this we need to use a libcall.  Previously LLVM didn't implement
libcall support for frem, so I've added it in the usual
straightforward manner.  A test case from the bug report is included.

llvm-svn: 178639

92e26646

AArch64: implement ETMv4 trace system registers. · 5816ca11
Tim Northover authored Apr 03, 2013
```
llvm-svn: 178637
```
5816ca11

Second pass at addressing PR15351 by explicitly checking for AVX support · 5f7c680f

Aaron Ballman authored Apr 03, 2013

when getting the host processor information.  It emits a .byte sequence on GNUC compilers to work around lack of xgetbv support with older assemblers, and resolves a comment typo found in the previous patch.

llvm-svn: 178636

5f7c680f

Temporarily relax the WIN32 checks in the SRet test to fix the Atom D2700 bot · 7205c72d
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178635
```
7205c72d
Fix SRet for thiscall in i686-pc-win32 · f4e0665e
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178634
```
f4e0665e

AArch64: switch patterns to be type-based rather than RegClass-based · 5b097a73

Tim Northover authored Apr 03, 2013

It's a bit of churn in the blame log, but I think there are real benefits to
the newer system so I'm making the change in one go.

llvm-svn: 178633

5b097a73

Fix grammar. · 14c2067c
Eric Christopher authored Apr 03, 2013
```
llvm-svn: 178624
```
14c2067c
Remove ZeroOrMore from the option description. We don't need it here. · 5590949f
Eric Christopher authored Apr 03, 2013
```
llvm-svn: 178623
```
5590949f

Add 64-bit compare + branch for SPARC v9. · d9bbdfd3

Jakob Stoklund Olesen authored Apr 03, 2013

The same compare instruction is used for 32-bit and 64-bit compares. It
sets two different sets of flags: icc and xcc.

This patch adds a conditional branch instruction using the xcc flags for
64-bit compares.

llvm-svn: 178621

d9bbdfd3

Remove some unsupported-feature comments from PPC.td · b00fc876
Hal Finkel authored Apr 03, 2013
```
These refer to the reciprocal estimate support recently committed.

llvm-svn: 178618
```
b00fc876

Use PPC reciprocal estimates with Newton iteration in fast-math mode · 2e103310

Hal Finkel authored Apr 03, 2013

When unsafe FP math operations are enabled, we can use the fre[s] and
frsqrte[s] instructions, which generate reciprocal (sqrt) estimates, together
with some Newton iteration, in order to quickly generate floating-point
division and sqrt results. All of these instructions are separately optional,
and so each has its own feature flag (except for the Altivec instructions,
which are covered under the existing Altivec flag). Doing this is not only
faster than using the IEEE-compliant fdiv/fsqrt instructions, but allows these
computations to be pipelined with other computations in order to hide their
overall latency.

I've also added a couple of missing fnmsub patterns which turned out to be
missing (but are necessary for good code generation of the Newton iterations).
Altivec needs a similar fix, but that will probably be more complicated because
fneg is expanded for Altivec's v4f32.

llvm-svn: 178617

2e103310

Fix the fde encoding used by mips to match gas. · b9b7ae0c

Rafael Espindola authored Apr 03, 2013

This finally fixes the encoding. The patch also
* Removes eh-frame.ll. It was an unnecessary .ll to .o test that was checking
  the wrong value.
* Merge fde-reloc.s and eh-frame.s into a single test, since the only difference
  was the run lines.
* Don't blindly test the content of the entire .eh_frame section. It makes it
  hard to anyone actually fixing a bug and hitting a difference in a binary
  blob. Instead, use a CHECK for each field and document what is being checked.

llvm-svn: 178615

b9b7ae0c

Rolling back the AVX support patch due to breaking a gcc 4.6 build bot that... · 9c0f0af5

Aaron Ballman authored Apr 03, 2013

Rolling back the AVX support patch due to breaking a gcc 4.6 build bot that doesn't understand the xgetbv instruction for some reason.  Will revisit when time permits.

llvm-svn: 178614

9c0f0af5

Remove an optimization where we were changing an objc_autorelease into an... · b8c88365

Michael Gottesman authored Apr 03, 2013

Remove an optimization where we were changing an objc_autorelease into an objc_autoreleaseReturnValue.

The semantics of ARC implies that a pointer passed into an objc_autorelease
must live until some point (potentially down the stack) where an
autorelease pool is popped. On the other hand, an
objc_autoreleaseReturnValue just signifies that the object must live
until the end of the given function at least.

Thus objc_autorelease is stronger than objc_autoreleaseReturnValue in
terms of the semantics of ARC* implying that performing the given
strength reduction without any knowledge of how this relates to
the autorelease pool pop that is further up the stack violates the
semantics of ARC.

*Even though objc_autoreleaseReturnValue if you know that no RV
optimization will occur is more computationally expensive.

llvm-svn: 178612

b8c88365

Improved comment. No functionality change. · 62424391
Michael Gottesman authored Apr 03, 2013
```
llvm-svn: 178605
```
62424391
Attempting to fix the build on older GCC versions. · 56be6ba5
Aaron Ballman authored Apr 03, 2013
```
llvm-svn: 178604
```
56be6ba5

Remove anonymous namespace. · aff60814

Rafael Espindola authored Apr 03, 2013

Looks like the gcc in http://lab.llvm.org:8011/builders/clang-x86_64-darwin11-self-mingw32/ doesn't like "not external linkage":

/Volumes/Macintosh_HD2/buildbots/clang-x86_64-darwin11-self-mingw32/llvm.src/include/llvm/Support/YAMLTraits.h: In instantiation of 'const bool llvm::yaml::has_SequenceMethodTraits<std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> > >::value':
/Volumes/Macintosh_HD2/buildbots/clang-x86_64-darwin11-self-mingw32/llvm.src/include/llvm/Support/YAMLTraits.h:281:   instantiated from 'llvm::yaml::has_SequenceTraits<std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> > >'
/Volumes/Macintosh_HD2/buildbots/clang-x86_64-darwin11-self-mingw32/llvm.src/utils/yaml2obj/yaml2obj.cpp:627:   instantiated from here
/Volumes/Macintosh_HD2/buildbots/clang-x86_64-darwin11-self-mingw32/llvm.src/include/llvm/Support/YAMLTraits.h:243: error: 'llvm::yaml::SequenceTraits<std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> > >::size' is not a valid template argument for type 'size_t (*)(llvm::yaml::IO&, std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> >&)' because function 'static size_t llvm::yaml::SequenceTraits<std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> > >::size(llvm::yaml::IO&, std::vector<<unnamed>::COFFYAML::Relocation, std::allocator<<unnamed>::COFFYAML::Relocation> >&)' has not external linkage

llvm-svn: 178600

aff60814

This patch addresses PR15351 by explicitly checking for AVX support · 6bc0dfc7
Aaron Ballman authored Apr 03, 2013
```
when getting the host processor information.

llvm-svn: 178598
```
6bc0dfc7

Use yaml::IO in yaml2obj.cpp. · e1d9afa8

Rafael Espindola authored Apr 02, 2013

The generic structs and specializations will be refactored when obj2yaml is
changed to use yaml::IO.

llvm-svn: 178593

e1d9afa8

Formatting. · e2fbc67e
Eric Christopher authored Apr 02, 2013
```
llvm-svn: 178589
```
e2fbc67e

[mips] Small update to the implementation of eh.return for Mips. · 023c678a

Akira Hatanaka authored Apr 02, 2013

This patch initializes t9 to the handler address, but only if the relocation
model is pic. This handles the case where handler to which eh.return jumps 
points to the start of the function.

Patch by Sasa Stankovic.

llvm-svn: 178588

023c678a

Support and test template arguments for unions. · 6476f908
Eric Christopher authored Apr 02, 2013
```
llvm-svn: 178586
```
6476f908
Reformat arguments. · 17dd8f07
Eric Christopher authored Apr 02, 2013
```
llvm-svn: 178585
```
17dd8f07

[mips] Expand pseudo multiply/divide instructions in MipsCodeEmitter.cpp. · 2ffc5734

Akira Hatanaka authored Apr 02, 2013

This patch fixes the following two tests which have been failing on
llvm-mips-linux builder since r178403:

LLVM :: Analysis/Profiling/load-branch-weights-ifs.ll
LLVM :: Analysis/Profiling/load-branch-weights-loops.ll

llvm-svn: 178584

2ffc5734

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll... · fc613f4d

NAKAMURA Takumi authored Apr 02, 2013

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll and handle-move.ll, corresponding to r178549.

This reverts r176808, r176798, and r177914.

llvm-svn: 178583

fc613f4d

Allow MachineTraceMetrics to be used when the model has no resources. · aeb69a54
Jakob Stoklund Olesen authored Apr 02, 2013
```
It it still possible to extract information from itineraries, for
example.

llvm-svn: 178582
```
aeb69a54

Apr 02, 2013

Fix a typo. · 09da37d1
Jakub Staszak authored Apr 02, 2013
```
llvm-svn: 178567
```
09da37d1

[ms-inline asm] Add support for parsing variables with namespace alias · 8a24466f

Chad Rosier authored Apr 02, 2013

qualifiers.

This patch only adds support for parsing these identifiers in the
X86AsmParser.  The front-end interface isn't capable of looking up
these identifiers at this point in time.  The end result is the
compiler now errors during object file emission, rather than at
parse time.  Test case coming shortly.
Part of rdar://13499009 and PR13340

llvm-svn: 178566

8a24466f

Add MDBuilder utilities for path-aware TBAA. · c018eea6

Manman Ren authored Apr 02, 2013

Add utilities to create struct nodes in TBAA type DAG and to create path-aware
tags. The format of struct nodes in TBAA type DAG: a unique name, a list of
fields with field offsets and field types. The format of path-aware tags:
a base type in TBAA type DAG, an access type and an offset relative to the base
type.

llvm-svn: 178564

c018eea6

Fix PR15630: Replace faulty stdcx. with stwcx. · 3581cd4b

Bill Schmidt authored Apr 02, 2013

When doing a partword atomic operation, a lwarx was being paired with
a stdcx. instead of a stwcx. when compiling for a 64-bit target.  The
target has nothing to do with it in this case; we always need a stwcx.

Thanks to Kai Nacke for reporting the problem.

llvm-svn: 178559

3581cd4b

Don't attempt MTM heuristics without a scheduling model present. · 8fbfc591
Jakob Stoklund Olesen authored Apr 02, 2013
```
This should fix the PPC buildbots.

llvm-svn: 178558
```
8fbfc591

Count processor resources individually in MachineTraceMetrics. · 3ca14772

Jakob Stoklund Olesen authored Apr 02, 2013

The new instruction scheduling models provide information about the
number of cycles consumed on each processor resource. This makes it
possible to estimate ILP more accurately than simply counting
instructions / issue width.

The functions getResourceDepth() and getResourceLength() now identify
the limiting processor resource, and return a cycle count based on that.

This gives more precise resource information, particularly in traces
that use one resource a lot more than others.

llvm-svn: 178553

3ca14772

[fast-isel] Use the correct API to disable FastLowerArguments for Win64. · 7925d280
Chad Rosier authored Apr 02, 2013
```
llvm-svn: 178549
```
7925d280

DAGCombiner: Merge store/loads when we have extload/truncstores · d6c6e868

Arnold Schwaighofer authored Apr 02, 2013

This is helps on architectures where i8,i16 are not legal but we have byte, and
short loads/stores. Allowing us to merge copies like the one below on ARM.

copy(char *a, char *b, int n) {
 do {
   int t0 = a[0];
   int t1 = a[1];
   b[0] = t0;
   b[1] = t1;

radar://13536387

llvm-svn: 178546

d6c6e868

Simplify test cases for Atom preferring call register indirect over · 95cbee6c
Preston Gurd authored Apr 02, 2013
```
call memory indirect (32 and 64 bit).

llvm-svn: 178541
```
95cbee6c