Commits · 49da758cbf1fffbf487e91cfbf87b5767ae98632 · Roger Ferrer / llvm-epi-0.8

Dec 29, 2013
- [SparcV9] For codegen generated library calls that return float, set inreg... · 5ac9c8fa
  Venkatraman Govindaraju authored Dec 29, 2013
```
[SparcV9] For codegen generated library calls that return float, set inreg flag manually in LowerCall().
 This makes the sparc backend to generate Sparc64 ABI compliant code.

llvm-svn: 198149
```
  5ac9c8fa
- Make more of the x86 lowering helper functions static. · a448bd86
  Craig Topper authored Dec 29, 2013
```
llvm-svn: 198146
```
  a448bd86
- [SparcV9]: Implement lowering of long double (fp128) arguments in Sparc64 ABI. · 0776cc0a
  Venkatraman Govindaraju authored Dec 29, 2013
```
Also, pass fp128 arguments to varargs through integer registers if necessary.

llvm-svn: 198145
```
  0776cc0a
- Switch from EVT to MVT in more of the x86 instruction lowering code. · 059e8e0d
  Craig Topper authored Dec 29, 2013
```
llvm-svn: 198144
```
  059e8e0d
Dec 28, 2013

ARM IAS: handle errors more appropriately · 0c4b1026

Saleem Abdulrasool authored Dec 28, 2013

Directive parsers must return false if the target assembler is interested in
handling the directive. The Error member function returns true always. Using
the 'return Error()' pattern would incorrectly indicate to the general parser
that the target was not interested in the directive, when in reality it simply
encountered a badly formed directive or some other error. This corrects the
behaviour to ensure that the parser behaves appropriately.

llvm-svn: 198132

0c4b1026

New machine model for cortex-a9. Schedule for resources and latency. · 3ca67d64

Andrew Trick authored Dec 28, 2013

Schedule more conservatively to account for stalls on floating point
resources and latency. Use the AGU resource to model latency stalls
since it's shared between FP and LD/ST instructions. This might not be
completely accurate but should work well in practice.

llvm-svn: 198125

3ca67d64

The Cortex-A9 machine model is incomplete. Mark it as such. · 03b22e39

Andrew Trick authored Dec 28, 2013

Many vector operations never had itineraries. Since the new machine
model was a mapping from existing itinerary classes, we don't have a
model for these. We still want to migrate A9 even though no one has
invested in a complete model, so mark it incomplete to avoid the
scheduler asserting.

llvm-svn: 198123

03b22e39

Factor MI-Sched in preparation for post-ra scheduling support. · d7f890ed

Andrew Trick authored Dec 28, 2013

Factor the MachineFunctionPass into MachineSchedulerBase.

Split the DAG class into ScheduleDAGMI and SchedulerDAGMILive.

llvm-svn: 198119

d7f890ed

Use getSimpleValueType in a few spots where the type should be simple. · bf096926
Craig Topper authored Dec 28, 2013
```
llvm-svn: 198117
```
bf096926

Minor indentation fix to match other switch statements. Change... · e829fe42

Craig Topper authored Dec 28, 2013

Minor indentation fix to match other switch statements. Change llvm_unreachable text to match similar places.

llvm-svn: 198116

e829fe42

[X86] Teach the backend how to fold target specific dag node for packed · eaceba0e

Andrea Di Biagio authored Dec 28, 2013

vector shift by immedate count (VSHLI/VSRLI/VSRAI) into a build_vector when
the vector in input to the shift is a build_vector of all constants or UNDEFs.

Target specific nodes for packed shifts by immediate count are in
general introduced by function 'getTargetVShiftByConstNode' (in
X86ISelLowering.cpp) when lowering shift operations, SSE/AVX immediate
shift intrinsics and (only in very few cases) SIGN_EXTEND_INREG dag
nodes.

This patch adds extra rules for simplifying vector shifts inside
function 'getTargetVShiftByConstNode'.

Added file test/CodeGen/X86/vec_shift5.ll to verify that packed
shifts by immediate are correctly folded into a build_vector when the
input vector to the shift dag node is a vector of constants or undefs.

llvm-svn: 198113

eaceba0e

ARMAsmParser: fix typo in comment · 83e3770a
Saleem Abdulrasool authored Dec 28, 2013
```
llvm-svn: 198095
```
83e3770a

Disable transforms that introduce calls to exp10*() on Linux due to · f5689f83

Chandler Carruth authored Dec 28, 2013

widespread glibc bugs.

The glibc implementation of exp10 has a very serious precision bug in
version 2.15 (and older versions). This is still very widely used (the
current Ubuntu LTS for example uses it) and so it isn't reasonable to
make transforms that produce these functions. This fixes many
miscompiles introduced when we started transforming pow(10.0, ...) into
exp10, and it may have fixed other latent miscompiles where exp10
provided sufficient precision but exp10f did not.

This is all really horrible. The primary bug has been fixed for over
a year and glibc 2.18 works correctly for the test cases I have, but it
will be 2017 before the LTS using 2.15 is no longer supported by Ubuntu
(and thus reasonable for folks to be relying on). =[ We're either going
to need to live without these optimizations, or find a way to switch
behavior more dynamically than using simply the fact that the OS is
"Linux".

To make matters worse, there appears to be significant testing and
fixing of numerous other bugs in the exp10 family of functions right now
in glibc. While those haven't been causing problems I've seen in the
wild, it gives me concerns that we may need to wait until an even later
release of glibc before we can reliably transform code into exp10.

llvm-svn: 198093

f5689f83

Dec 26, 2013
- TLI: Make exp10* avaiable on Linux/Mac/iOS and unavailable elsewhere · f4355eef
  Reid Kleckner authored Dec 26, 2013
```
This makes it unavailable on NetBSD, Android, etc.

Patch by Brad Smith!

llvm-svn: 198056
```
  f4355eef
- Recognize armv7a and friends as aliases for armv7-a etc. for the purpose · a13f8b4f
  Joerg Sonnenberger authored Dec 26, 2013
```
of architecture naming.

llvm-svn: 198043
```
  a13f8b4f
- ARM IAS: support .even directive · a554968d
  Saleem Abdulrasool authored Dec 26, 2013
```
The .even directive aligns content to an evan-numbered address.  This is an ARM
specific directive applicable to any section.

llvm-svn: 198031
```
  a554968d
- [Sparc] Lower and MachineInstr to MC and print assembly using MCInstPrinter. · bf683fd1
  Venkatraman Govindaraju authored Dec 26, 2013
```
llvm-svn: 198030
```
  bf683fd1
- [Sparc] Add target specific MCExpr class to handle sparc specific modifiers like %hi, %lo, etc., · 08bcf290
  Venkatraman Govindaraju authored Dec 26, 2013
```
llvm-svn: 198029
```
  08bcf290
- [Sparc] Add MCInstPrinter implementation for SPARC. · 0b938652
  Venkatraman Govindaraju authored Dec 25, 2013
```
llvm-svn: 198028
```
  0b938652
Dec 25, 2013
- [Mips] Does not take in account 'use-soft-float' attribute's value when · fde102cb
  Simon Atanasyan authored Dec 25, 2013
```
consider to generate stubs for mips16 hard-float mode.

The patch reviewed by Reed Kotler.

llvm-svn: 198019
```
  fde102cb
- AVX-512: decoder for AVX-512, made by Alexey Bader. · 371e3638
  Elena Demikhovsky authored Dec 25, 2013
```
llvm-svn: 198013
```
  371e3638
- Support for microMIPS load effective address. · bd28c373
  Zoran Jovanovic authored Dec 25, 2013
```
llvm-svn: 198010
```
  bd28c373
- Support for microMIPS FPU instructions 2. · 8876be39
  Zoran Jovanovic authored Dec 25, 2013
```
llvm-svn: 198009
```
  8876be39
- AVX-512: Result type of scalar SETCC is MVT::i1 for AVX-512. · b64d7e85
  Elena Demikhovsky authored Dec 25, 2013
```
llvm-svn: 198008
```
  b64d7e85
- [AArch64]Fix a problem that the register order of fmls/fmla by element is incorrect. · 83799741
  Hao Liu authored Dec 25, 2013
```
E.g. the codegen result is 
     fmls v1.2s, v0.2s, v2.s[3]
which is expected to be
     fmls v0.2s, v1.2s, v2.s[3]

llvm-svn: 198001
```
  83799741
Dec 24, 2013

Fix typo. · 002019a2
Richard Sandiford authored Dec 24, 2013
```
llvm-svn: 197986
```
002019a2

[SystemZ] Use interlocked-access 1 instructions for CodeGen · 41350a52

Richard Sandiford authored Dec 24, 2013

...namely LOAD AND ADD, LOAD AND AND, LOAD AND OR and LOAD AND EXCLUSIVE OR.
LOAD AND ADD LOGICAL isn't really separately useful for LLVM.

I'll look at adding reusing the CC results in new year.

llvm-svn: 197985

41350a52

[SystemZ] Add MC support for interlocked-access 1 instructions · 45645a2c
Richard Sandiford authored Dec 24, 2013
```
llvm-svn: 197984
```
45645a2c
AVX-512: fixed some patterns for MVT::i1 · 64c9548d
Elena Demikhovsky authored Dec 24, 2013
```
llvm-svn: 197981
```
64c9548d
[AArch64]Add patterns to match normal shift nodes: shl, sra and srl. · ce7a12be
Hao Liu authored Dec 24, 2013
```
llvm-svn: 197969
```
ce7a12be

[AArch64 NEON] Fix a bug when lowering BUILD_VECTOR. · 82bd84aa

Kevin Qin authored Dec 24, 2013

DAG.getVectorShuffle() doesn't always return a vector_shuffle node.
If mask is the exact sequence of it's operand(For example, operand_0
is v8i8, and  the mask is 0, 1, 2, 3, 4, 5, 6, 7), it will directly
return that operand. So a check is added here.

llvm-svn: 197967

82bd84aa

[AArch64 NEON] Fix a pattern match failure with NEON_VDUP. · cd5f3153

Kevin Qin authored Dec 24, 2013

This failure caused by improper condition when lowering shuffle_vector
to scalar_to_vector. After this patch NEON_VDUP with v1i64 will not
be generated.

llvm-svn: 197966

cd5f3153

[AArch64] Check fmul node single use in fused multiply patterns · bc2996b3

Ana Pazos authored Dec 24, 2013

Check for single use of fmul node in fused multiply patterns
to allow generation of fused multiply add/sub instructions.
Otherwise fmul operation ends up being repeated more than
once which does not help peformance on targets with
only one MAC unit, as for example cortex-a53.

llvm-svn: 197929

bc2996b3

[AArch64 NEON] Fixed fused multiply negate add/sub patterns · 3ca23915

Ana Pazos authored Dec 24, 2013

The correct pattern matching should be:

- fnmadd is (-Ra) + (-Rn)*Rm  which should be matched as:

  fma (fneg node:$Rn),  node:$Rm, (fneg node:$Ra) and as

  (f32 (fsub (f32 (fneg FPR32:$Ra)), (f32 (fmul FPR32:$Rn, FPR32:$Rm))))

- fnmsub is (-Ra) + Rn*Rm which should be matched as

  fma node:$Rn,  node:$Rm, (fneg node:$Ra) and as

  (f32 (fsub (f32 (fmul FPR32:$Rn, FPR32:$Rm)), FPR32:$Ra))))

llvm-svn: 197928

3ca23915

Dec 23, 2013

Debug info: On ARM ensure that the data sections come before the · edb61f02

Adrian Prantl authored Dec 23, 2013

(optional) DWARF sections, so compiling with -g does not result in
different code being generated.

rdar://problem/15623193

llvm-svn: 197922

edb61f02

ARM: bkpt has an implicit immediate constant 0 · 70187554

Saleem Abdulrasool authored Dec 23, 2013



The bkpt mnemonic has an implicit immediate constant of 0 unless otherwise
specified.  Add an instruction alias for the unvalued breakpoint mnemonic to
treat it as a 0.  This improves compatibility with GNU AS.

Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org>
llvm-svn: 197913

70187554

Dec 22, 2013
- Use r2 when encoding tls on ppc32. Fixes PR18305. · bc1655b4
  Roman Divacky authored Dec 22, 2013
```
llvm-svn: 197878
```
  bc1655b4
- AVX512: SETCC returns i1 for AVX-512 and i8 for all others · fe24a30e
  Elena Demikhovsky authored Dec 22, 2013
```
llvm-svn: 197876
```
  fe24a30e
- Add some comments. · 8854e769
  Roman Divacky authored Dec 22, 2013
```
llvm-svn: 197875
```
  8854e769
Dec 20, 2013
- ARM AnalyzeBranch should ignore DEBUG_VALUES while analyzing terminators. · 18c98a58
  Lang Hames authored Dec 20, 2013
```
Found by inspection by Julien Lerouge. Thanks Julian!

llvm-svn: 197833
```
  18c98a58