Commits · 3eb78ec974c395dd92d63f6fe912a38c133188ea · Roger Ferrer / llvm-epi-0.8

Apr 05, 2013

Add obj2yaml to test dependencies · 3eb78ec9
Alexey Samsonov authored Apr 05, 2013
```
llvm-svn: 178852
```
3eb78ec9

Fix for PR14824: "Optimization arm_ldst_opt inserts newly generated... · b309b3b3

Stepan Dyatkovskiy authored Apr 05, 2013

Fix for PR14824: "Optimization arm_ldst_opt inserts newly generated instruction vldmia at incorrect position".
Patch introduces memory operands tracking in ARMLoadStoreOpt::LoadStoreMultipleOpti. For each register it keeps the order of load operations as it was before optimization pass.
It is kind of deep improvement of fix proposed by Hao: http://llvm.org/bugs/show_bug.cgi?id=14824#c4
But it also tracks conflicts between different register classes (e.g. D2 and S5).
For more details see:
Bug description: http://llvm.org/bugs/show_bug.cgi?id=14824
LLVM Commits discussion:
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130311/167936.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130318/168688.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130325/169376.html
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130401/170238.html

llvm-svn: 178851

b309b3b3

The ppc bots say this is the last broken line, so lets try one more :-( · 4e1e3e75
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178849
```
4e1e3e75
One more try before I just delete the macho bits until tomorrow. · 1218a40c
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178847
```
1218a40c

More test loosening. · 531efab6

Rafael Espindola authored Apr 05, 2013

Sorry for so many commits, but llvm is still building on my ppc vm.

llvm-svn: 178843

531efab6

Loosen this test too. · 61ad7493
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178841
```
61ad7493

Loosen this test. · b080267b

Rafael Espindola authored Apr 05, 2013

Looks like there is a big endian/little endian problem here. Loosen the
test to try to get the bots green while llvm builds on a ppc qemu vm.

The failure was in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/

llvm-svn: 178839

b080267b

Add a test for obj2yaml in preparation for refactoring it. · 599e810a
Rafael Espindola authored Apr 05, 2013
```
llvm-svn: 178829
```
599e810a
RegisterPressure heuristics currently require signed comparisons. · 80e66ce0
Andrew Trick authored Apr 05, 2013
```
llvm-svn: 178823
```
80e66ce0

LoopVectorizer: Pass OperandValueKind information to the cost model · df6f67ed

Arnold Schwaighofer authored Apr 04, 2013

Pass down the fact that an operand is going to be a vector of constants.

This should bring the performance of MultiSource/Benchmarks/PAQ8p/paq8p on x86
back. It had degraded to scalar performance due to my pervious shift cost change
that made all shifts expensive on x86.

radar://13576547

llvm-svn: 178809

df6f67ed

X86 cost model: Differentiate cost for vector shifts of constants · 44f902ed

Arnold Schwaighofer authored Apr 04, 2013

SSE2 has efficient support for shifts by a scalar. My previous change of making
shifts expensive did not take this into account marking all shifts as expensive.
This would prevent vectorization from happening where it is actually beneficial.

With this change we differentiate between shifts of constants and other shifts.

radar://13576547

llvm-svn: 178808

44f902ed

PPC: Improve code generation for mixed-precision reciprocal sqrt · f96c18e3

Hal Finkel authored Apr 04, 2013

The DAGCombine logic that recognized a/sqrt(b) and transformed it into
a multiplication by the reciprocal sqrt did not handle cases where the
sqrt and the division were separated by an fpext or fptrunc.

llvm-svn: 178801

f96c18e3

Apr 04, 2013

Disable 2010-10-01-crash.ll for Hexagon as the Hexagon frontend will · bc03a979
Jyotsna Verma authored Apr 04, 2013
```
never produce a byval parameter with size < 8 bytes.

llvm-svn: 178792
```
bc03a979

Add back parsing of header charactestics. · 7733466c

Rafael Espindola authored Apr 04, 2013

It had been dropped during the switch to yaml::IO. Also add a test going
from yaml2obj to llvm-readobj. It can be extended as we add more
fields/formats to yaml2obj.

llvm-svn: 178786

7733466c

[XCore] Add bru instruction. · 0c12d185
Richard Osborne authored Apr 04, 2013
```
llvm-svn: 178783
```
0c12d185

[XCore] The RRegs register class is a superset of GRRegs. · f18d95f7

Richard Osborne authored Apr 04, 2013

At the time when the XCore backend was added there were some issues with
with overlapping register classes but these all seem to be fixed now.
Describing the register classes correctly allow us to get rid of a
codegen only instruction (LDAWSP_lru6_RRegs) and it means we can
disassemble ru6 instructions that use registers above r11.

llvm-svn: 178782

f18d95f7

Avoid high-latency false CPSR dependencies even for tMOVSi. · 299475e0

Jakob Stoklund Olesen authored Apr 04, 2013

The Thumb2SizeReduction pass avoids false CPSR dependencies, except it
still aggressively creates tMOVi8 instructions because they are so
common.

Avoid creating false CPSR dependencies even for tMOVi8 instructions when
the the CPSR flags are known to have high latency. This allows integer
computation to overlap floating point computations.

Also process blocks in a reverse post-order and propagate high-latency
flags to successors.

<rdar://problem/13468102>

llvm-svn: 178773

299475e0

New-password-test commit. · e58df62e
Stepan Dyatkovskiy authored Apr 04, 2013
```
llvm-svn: 178765
```
e58df62e
R600: Take export into account when computing cf address · c44fa997
Vincent Lejeune authored Apr 04, 2013
```
llvm-svn: 178761
```
c44fa997
Propagate path to ASan/MSan symbolizer into test environment to produce useful reports on errors. · e2c772a1
Alexey Samsonov authored Apr 04, 2013
```
llvm-svn: 178749
```
e2c772a1

Add SPARC v9 support for select on 64-bit compares. · 8cfaffaa

Jakob Stoklund Olesen authored Apr 04, 2013

This requires v9 cmov instructions using the %xcc flags instead of the
%icc flags.

Still missing:
- Select floats on %xcc flags.
- Select i64 on %fcc flags.

llvm-svn: 178737

8cfaffaa

Apr 03, 2013

X86 cost model: Vector shifts are expensive in most cases · e9b50164

Arnold Schwaighofer authored Apr 03, 2013

The default logic does not correctly identify costs of casts because they are
marked as custom on x86.

For some cases, where the shift amount is a scalar we would be able to generate
better code. Unfortunately, when this is the case the value (the splat) will get
hoisted out of the loop, thereby making it invisible to ISel.

radar://13130673
radar://13537826

llvm-svn: 178703

e9b50164

Implement the "mips endian" for r_info. · 2025e8b8

Rafael Espindola authored Apr 03, 2013

Normally r_info is just a 32 of 64 bit number matching the endian of the rest
of the file. Unfortunately, mips 64 bit little endian is special: The top 32
bits are a little endian number and the following 32 are a big endian one.

llvm-svn: 178694

2025e8b8

[XCore] Check disassembly of the st8 instruction. · 122acb21
Richard Osborne authored Apr 03, 2013
```
llvm-svn: 178689
```
122acb21
[XCore] Update disassembler test to improve coverage of the instructions. · fb0b4ea3
Richard Osborne authored Apr 03, 2013
```
Previously some instructions were unintentionally covered twice and
others were not covered at all.

llvm-svn: 178688
```
fb0b4ea3

Implements low-level object file format specific output for COFF and · 9cad53cf

Eric Christopher authored Apr 03, 2013

ELF with support for:

- File headers
- Section headers + data
- Relocations
- Symbols
- Unwind data (only COFF/Win64)

The output format follows a few rules:
- Values are almost always output one per line (as elf-dump/coff-dump already do). - Many values are translated to something readable (like enum names), with the raw value in parentheses.
- Hex numbers are output in uppercase, prefixed with "0x".
- Flags are sorted alphabetically.
- Lists and groups are always delimited.

Example output:
---------- snip ----------
Sections [
  Section {
    Index: 1
    Name: .text (5)
    Type: SHT_PROGBITS (0x1)
    Flags [ (0x6)
      SHF_ALLOC (0x2)
      SHF_EXECINSTR (0x4)
    ]
    Address: 0x0
    Offset: 0x40
    Size: 33
    Link: 0
    Info: 0
    AddressAlignment: 16
    EntrySize: 0
    Relocations [
      0x6 R_386_32 .rodata.str1.1 0x0
      0xB R_386_PC32 puts 0x0
      0x12 R_386_32 .rodata.str1.1 0x0
      0x17 R_386_PC32 puts 0x0
    ]
    SectionData (
      0000: 83EC04C7 04240000 0000E8FC FFFFFFC7  |.....$..........|
      0010: 04240600 0000E8FC FFFFFF31 C083C404  |.$.........1....|
      0020: C3                                   |.|
    )
  }
]
---------- snip ----------

Relocations and symbols can be output standalone or together with the section header as displayed in the example.
This feature set supports all tests in test/MC/COFF and test/MC/ELF (and I suspect all additional tests using elf-dump), making elf-dump and coff-dump deprecated.

Patch by Nico Rieck!

llvm-svn: 178679

9cad53cf

Implement sectionContainsSymbol for ELF. · 8d67ab4f
Eric Christopher authored Apr 03, 2013
```
Patch by Nico Rieck!

llvm-svn: 178677
```
8d67ab4f
When dumping clear the arm/thumb flag for now. · d5972ea8
Eric Christopher authored Apr 03, 2013
```
Patch by Nico Rieck!

llvm-svn: 178676
```
d5972ea8
R600: Fix last ALU of a clause being emitted in a separate clause · c3d3f9b6
Vincent Lejeune authored Apr 03, 2013
```
llvm-svn: 178675
```
c3d3f9b6

Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. · 92e26646

Bill Schmidt authored Apr 03, 2013

For this we need to use a libcall.  Previously LLVM didn't implement
libcall support for frem, so I've added it in the usual
straightforward manner.  A test case from the bug report is included.

llvm-svn: 178639

92e26646

AArch64: implement ETMv4 trace system registers. · 5816ca11
Tim Northover authored Apr 03, 2013
```
llvm-svn: 178637
```
5816ca11
Temporarily relax the WIN32 checks in the SRet test to fix the Atom D2700 bot · 7205c72d
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178635
```
7205c72d
Fix SRet for thiscall in i686-pc-win32 · f4e0665e
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178634
```
f4e0665e

Add 64-bit compare + branch for SPARC v9. · d9bbdfd3

Jakob Stoklund Olesen authored Apr 03, 2013

The same compare instruction is used for 32-bit and 64-bit compares. It
sets two different sets of flags: icc and xcc.

This patch adds a conditional branch instruction using the xcc flags for
64-bit compares.

llvm-svn: 178621

d9bbdfd3

Use PPC reciprocal estimates with Newton iteration in fast-math mode · 2e103310

Hal Finkel authored Apr 03, 2013

When unsafe FP math operations are enabled, we can use the fre[s] and
frsqrte[s] instructions, which generate reciprocal (sqrt) estimates, together
with some Newton iteration, in order to quickly generate floating-point
division and sqrt results. All of these instructions are separately optional,
and so each has its own feature flag (except for the Altivec instructions,
which are covered under the existing Altivec flag). Doing this is not only
faster than using the IEEE-compliant fdiv/fsqrt instructions, but allows these
computations to be pipelined with other computations in order to hide their
overall latency.

I've also added a couple of missing fnmsub patterns which turned out to be
missing (but are necessary for good code generation of the Newton iterations).
Altivec needs a similar fix, but that will probably be more complicated because
fneg is expanded for Altivec's v4f32.

llvm-svn: 178617

2e103310

Fix the fde encoding used by mips to match gas. · b9b7ae0c

Rafael Espindola authored Apr 03, 2013

This finally fixes the encoding. The patch also
* Removes eh-frame.ll. It was an unnecessary .ll to .o test that was checking
  the wrong value.
* Merge fde-reloc.s and eh-frame.s into a single test, since the only difference
  was the run lines.
* Don't blindly test the content of the entire .eh_frame section. It makes it
  hard to anyone actually fixing a bug and hitting a difference in a binary
  blob. Instead, use a CHECK for each field and document what is being checked.

llvm-svn: 178615

b9b7ae0c

Remove an optimization where we were changing an objc_autorelease into an... · b8c88365

Michael Gottesman authored Apr 03, 2013

Remove an optimization where we were changing an objc_autorelease into an objc_autoreleaseReturnValue.

The semantics of ARC implies that a pointer passed into an objc_autorelease
must live until some point (potentially down the stack) where an
autorelease pool is popped. On the other hand, an
objc_autoreleaseReturnValue just signifies that the object must live
until the end of the given function at least.

Thus objc_autorelease is stronger than objc_autoreleaseReturnValue in
terms of the semantics of ARC* implying that performing the given
strength reduction without any knowledge of how this relates to
the autorelease pool pop that is further up the stack violates the
semantics of ARC.

*Even though objc_autoreleaseReturnValue if you know that no RV
optimization will occur is more computationally expensive.

llvm-svn: 178612

b8c88365

[mips] Small update to the implementation of eh.return for Mips. · 023c678a

Akira Hatanaka authored Apr 02, 2013

This patch initializes t9 to the handler address, but only if the relocation
model is pic. This handles the case where handler to which eh.return jumps 
points to the start of the function.

Patch by Sasa Stankovic.

llvm-svn: 178588

023c678a

Support and test template arguments for unions. · 6476f908
Eric Christopher authored Apr 02, 2013
```
llvm-svn: 178586
```
6476f908

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll... · fc613f4d

NAKAMURA Takumi authored Apr 02, 2013

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll and handle-move.ll, corresponding to r178549.

This reverts r176808, r176798, and r177914.

llvm-svn: 178583

fc613f4d