Commits · e2c772a1b06b9f202e56e466696fdc25463210ed · Roger Ferrer / llvm-epi

Apr 04, 2013
- Propagate path to ASan/MSan symbolizer into test environment to produce useful reports on errors. · e2c772a1
  Alexey Samsonov authored Apr 04, 2013
  
  llvm-svn: 178749
  e2c772a1
- Add SPARC v9 support for select on 64-bit compares. · 8cfaffaa
  Jakob Stoklund Olesen authored Apr 04, 2013
  
  This requires v9 cmov instructions using the %xcc flags instead of the %icc flags. Still missing: - Select floats on %xcc flags. - Select i64 on %fcc flags. llvm-svn: 178737
  8cfaffaa
Apr 03, 2013

X86 cost model: Vector shifts are expensive in most cases · e9b50164

Arnold Schwaighofer authored Apr 03, 2013

The default logic does not correctly identify costs of casts because they are
marked as custom on x86.

For some cases, where the shift amount is a scalar we would be able to generate
better code. Unfortunately, when this is the case the value (the splat) will get
hoisted out of the loop, thereby making it invisible to ISel.

radar://13130673
radar://13537826

llvm-svn: 178703

e9b50164

Implement the "mips endian" for r_info. · 2025e8b8

Rafael Espindola authored Apr 03, 2013

Normally r_info is just a 32 of 64 bit number matching the endian of the rest
of the file. Unfortunately, mips 64 bit little endian is special: The top 32
bits are a little endian number and the following 32 are a big endian one.

llvm-svn: 178694

2025e8b8

[XCore] Check disassembly of the st8 instruction. · 122acb21
Richard Osborne authored Apr 03, 2013
```
llvm-svn: 178689
```
122acb21
[XCore] Update disassembler test to improve coverage of the instructions. · fb0b4ea3
Richard Osborne authored Apr 03, 2013
```
Previously some instructions were unintentionally covered twice and
others were not covered at all.

llvm-svn: 178688
```
fb0b4ea3

Implements low-level object file format specific output for COFF and · 9cad53cf

Eric Christopher authored Apr 03, 2013

ELF with support for:

- File headers
- Section headers + data
- Relocations
- Symbols
- Unwind data (only COFF/Win64)

The output format follows a few rules:
- Values are almost always output one per line (as elf-dump/coff-dump already do). - Many values are translated to something readable (like enum names), with the raw value in parentheses.
- Hex numbers are output in uppercase, prefixed with "0x".
- Flags are sorted alphabetically.
- Lists and groups are always delimited.

Example output:
---------- snip ----------
Sections [
  Section {
    Index: 1
    Name: .text (5)
    Type: SHT_PROGBITS (0x1)
    Flags [ (0x6)
      SHF_ALLOC (0x2)
      SHF_EXECINSTR (0x4)
    ]
    Address: 0x0
    Offset: 0x40
    Size: 33
    Link: 0
    Info: 0
    AddressAlignment: 16
    EntrySize: 0
    Relocations [
      0x6 R_386_32 .rodata.str1.1 0x0
      0xB R_386_PC32 puts 0x0
      0x12 R_386_32 .rodata.str1.1 0x0
      0x17 R_386_PC32 puts 0x0
    ]
    SectionData (
      0000: 83EC04C7 04240000 0000E8FC FFFFFFC7  |.....$..........|
      0010: 04240600 0000E8FC FFFFFF31 C083C404  |.$.........1....|
      0020: C3                                   |.|
    )
  }
]
---------- snip ----------

Relocations and symbols can be output standalone or together with the section header as displayed in the example.
This feature set supports all tests in test/MC/COFF and test/MC/ELF (and I suspect all additional tests using elf-dump), making elf-dump and coff-dump deprecated.

Patch by Nico Rieck!

llvm-svn: 178679

9cad53cf

Implement sectionContainsSymbol for ELF. · 8d67ab4f
Eric Christopher authored Apr 03, 2013
```
Patch by Nico Rieck!

llvm-svn: 178677
```
8d67ab4f
When dumping clear the arm/thumb flag for now. · d5972ea8
Eric Christopher authored Apr 03, 2013
```
Patch by Nico Rieck!

llvm-svn: 178676
```
d5972ea8
R600: Fix last ALU of a clause being emitted in a separate clause · c3d3f9b6
Vincent Lejeune authored Apr 03, 2013
```
llvm-svn: 178675
```
c3d3f9b6

Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. · 92e26646

Bill Schmidt authored Apr 03, 2013

For this we need to use a libcall.  Previously LLVM didn't implement
libcall support for frem, so I've added it in the usual
straightforward manner.  A test case from the bug report is included.

llvm-svn: 178639

92e26646

AArch64: implement ETMv4 trace system registers. · 5816ca11
Tim Northover authored Apr 03, 2013
```
llvm-svn: 178637
```
5816ca11
Temporarily relax the WIN32 checks in the SRet test to fix the Atom D2700 bot · 7205c72d
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178635
```
7205c72d
Fix SRet for thiscall in i686-pc-win32 · f4e0665e
Timur Iskhodzhanov authored Apr 03, 2013
```
llvm-svn: 178634
```
f4e0665e

Add 64-bit compare + branch for SPARC v9. · d9bbdfd3

Jakob Stoklund Olesen authored Apr 03, 2013

The same compare instruction is used for 32-bit and 64-bit compares. It
sets two different sets of flags: icc and xcc.

This patch adds a conditional branch instruction using the xcc flags for
64-bit compares.

llvm-svn: 178621

d9bbdfd3

Use PPC reciprocal estimates with Newton iteration in fast-math mode · 2e103310

Hal Finkel authored Apr 03, 2013

When unsafe FP math operations are enabled, we can use the fre[s] and
frsqrte[s] instructions, which generate reciprocal (sqrt) estimates, together
with some Newton iteration, in order to quickly generate floating-point
division and sqrt results. All of these instructions are separately optional,
and so each has its own feature flag (except for the Altivec instructions,
which are covered under the existing Altivec flag). Doing this is not only
faster than using the IEEE-compliant fdiv/fsqrt instructions, but allows these
computations to be pipelined with other computations in order to hide their
overall latency.

I've also added a couple of missing fnmsub patterns which turned out to be
missing (but are necessary for good code generation of the Newton iterations).
Altivec needs a similar fix, but that will probably be more complicated because
fneg is expanded for Altivec's v4f32.

llvm-svn: 178617

2e103310

Fix the fde encoding used by mips to match gas. · b9b7ae0c

Rafael Espindola authored Apr 03, 2013

This finally fixes the encoding. The patch also
* Removes eh-frame.ll. It was an unnecessary .ll to .o test that was checking
  the wrong value.
* Merge fde-reloc.s and eh-frame.s into a single test, since the only difference
  was the run lines.
* Don't blindly test the content of the entire .eh_frame section. It makes it
  hard to anyone actually fixing a bug and hitting a difference in a binary
  blob. Instead, use a CHECK for each field and document what is being checked.

llvm-svn: 178615

b9b7ae0c

Remove an optimization where we were changing an objc_autorelease into an... · b8c88365

Michael Gottesman authored Apr 03, 2013

Remove an optimization where we were changing an objc_autorelease into an objc_autoreleaseReturnValue.

The semantics of ARC implies that a pointer passed into an objc_autorelease
must live until some point (potentially down the stack) where an
autorelease pool is popped. On the other hand, an
objc_autoreleaseReturnValue just signifies that the object must live
until the end of the given function at least.

Thus objc_autorelease is stronger than objc_autoreleaseReturnValue in
terms of the semantics of ARC* implying that performing the given
strength reduction without any knowledge of how this relates to
the autorelease pool pop that is further up the stack violates the
semantics of ARC.

*Even though objc_autoreleaseReturnValue if you know that no RV
optimization will occur is more computationally expensive.

llvm-svn: 178612

b8c88365

[mips] Small update to the implementation of eh.return for Mips. · 023c678a

Akira Hatanaka authored Apr 02, 2013

This patch initializes t9 to the handler address, but only if the relocation
model is pic. This handles the case where handler to which eh.return jumps 
points to the start of the function.

Patch by Sasa Stankovic.

llvm-svn: 178588

023c678a

Support and test template arguments for unions. · 6476f908
Eric Christopher authored Apr 02, 2013
```
llvm-svn: 178586
```
6476f908

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll... · fc613f4d

NAKAMURA Takumi authored Apr 02, 2013

llvm/test/CodeGen/X86: Unmark them out of XFAIL:cygming, in atomic{32|64}.ll and handle-move.ll, corresponding to r178549.

This reverts r176808, r176798, and r177914.

llvm-svn: 178583

fc613f4d

Apr 02, 2013

Fix PR15630: Replace faulty stdcx. with stwcx. · 3581cd4b

Bill Schmidt authored Apr 02, 2013

When doing a partword atomic operation, a lwarx was being paired with
a stdcx. instead of a stwcx. when compiling for a 64-bit target.  The
target has nothing to do with it in this case; we always need a stwcx.

Thanks to Kai Nacke for reporting the problem.

llvm-svn: 178559

3581cd4b

Don't attempt MTM heuristics without a scheduling model present. · 8fbfc591
Jakob Stoklund Olesen authored Apr 02, 2013
```
This should fix the PPC buildbots.

llvm-svn: 178558
```
8fbfc591
[fast-isel] Use the correct API to disable FastLowerArguments for Win64. · 7925d280
Chad Rosier authored Apr 02, 2013
```
llvm-svn: 178549
```
7925d280

DAGCombiner: Merge store/loads when we have extload/truncstores · d6c6e868

Arnold Schwaighofer authored Apr 02, 2013

This is helps on architectures where i8,i16 are not legal but we have byte, and
short loads/stores. Allowing us to merge copies like the one below on ARM.

copy(char *a, char *b, int n) {
 do {
   int t0 = a[0];
   int t1 = a[1];
   b[0] = t0;
   b[1] = t1;

radar://13536387

llvm-svn: 178546

d6c6e868

Simplify test cases for Atom preferring call register indirect over · 95cbee6c
Preston Gurd authored Apr 02, 2013
```
call memory indirect (32 and 64 bit).

llvm-svn: 178541
```
95cbee6c

Use a worklist to avoid a sneaky iterator invalidation. · 88d06c3b

Bill Wendling authored Apr 02, 2013

The iterator could be invalidated when it's recursively deleting a whole bunch
of constant expressions in a constant initializer.

Note: This was only reproducible if `opt' was run on a `.bc' file. If `opt' was
run on a `.ll' file, it wouldn't crash. This is why the test first pushes the
`.ll' file through `llvm-as' before feeding it to `opt'.

PR15440

llvm-svn: 178531

88d06c3b

Add 64-bit load and store instructions. · 8eabc3ff
Jakob Stoklund Olesen authored Apr 02, 2013
```
There is only a few new instructions, the rest is handled with patterns.

llvm-svn: 178528
```
8eabc3ff

Basic 64-bit ALU operations. · 917e07f0

Jakob Stoklund Olesen authored Apr 02, 2013

SPARC v9 extends all ALU instructions to 64 bits, so we simply need to
add patterns to use them for both i32 and i64 values.

llvm-svn: 178527

917e07f0

Materialize 64-bit immediates. · bddb20ee

Jakob Stoklund Olesen authored Apr 02, 2013

The last resort pattern produces 6 instructions, and there are still
opportunities for materializing some immediates in fewer instructions.

llvm-svn: 178526

bddb20ee

Add 64-bit shift instructions. · c1d1a481

Jakob Stoklund Olesen authored Apr 02, 2013

SPARC v9 defines new 64-bit shift instructions. The 32-bit shift right
instructions are still usable as zero and sign extensions.

This adds new F3_Sr and F3_Si instruction formats that probably should
be used for the 32-bit shifts as well. They don't really encode an
simm13 field.

llvm-svn: 178525

c1d1a481

Add support for 64-bit calling convention. · 0b21f35a

Jakob Stoklund Olesen authored Apr 02, 2013

This is far from complete, but it is enough to make it possible to write
test cases using i64 arguments.

Missing features:
- Floating point arguments.
- Receiving arguments on the stack.
- Calls.

llvm-svn: 178523

0b21f35a

Apr 01, 2013

Mips direct object exception handling regression · 9423f507

Jack Carter authored Apr 01, 2013

Revision 177141 caused a regression in all but
mips64 little endian. That is because none of the
other Mips targets had test cases checking the 
contents of the .eh_frame section. This patch fixes
both the llvm code and adds an assembler test case 
to include the current 4 flavors.

The test cases unfortunately rely on llvm-objdump. A
preferable method would be to use a pretty printer output
such as what readelf -wf <elf_file> would give.

I also changed the name of the test case to correct a typo.

llvm-svn: 178506

9423f507

R600: Add support for native control flow · bfaa63a6
Vincent Lejeune authored Apr 01, 2013
```
llvm-svn: 178505
```
bfaa63a6
R600: Emit CF_ALU and use true kcache register. · f43bc57b
Vincent Lejeune authored Apr 01, 2013
```
llvm-svn: 178503
```
f43bc57b
Fix a bad assert in PPCTargetLowering · 3f88d089
Hal Finkel authored Apr 01, 2013
```
llvm-svn: 178489
```
3f88d089
Add triple to test/CodeGen/PowerPC/stfiwx-2 · c2eddb0d
Hal Finkel authored Apr 01, 2013
```
llvm-svn: 178486
```
c2eddb0d
Correct assertion condition · 6662fd0f
Shuxin Yang authored Apr 01, 2013
```
llvm-svn: 178484
```
6662fd0f

Merge load/store sequences with adresses: base + index + offset · 6752366e

Arnold Schwaighofer authored Apr 01, 2013

We would also like to merge sequences that involve a variable index like in the
example below.

    int index = *idx++
    int i0 = c[index+0];
    int i1 = c[index+1];
    b[0] = i0;
    b[1] = i1;

By extending the parsing of the base pointer to handle dags that contain a
base, index, and offset we can handle examples like the one above.

The dag for the code above will look something like:

 (load (i64 add (i64 copyfromreg %c)
                (i64 signextend (i8 load %index))))

 (load (i64 add (i64 copyfromreg %c)
                (i64 signextend (i32 add (i32 signextend (i8 load %index))
                                         (i32 1)))))

The code that parses the tree ignores the intermediate sign extensions. However,
if there is a sign extension it needs to be on all indexes.

 (load (i64 add (i64 copyfromreg %c)
                (i64 signextend (add (i8 load %index)
                                     (i8 1))))
 vs

 (load (i64 add (i64 copyfromreg %c)
                (i64 signextend (i32 add (i32 signextend (i8 load %index))
                                         (i32 1)))))
radar://13536387

llvm-svn: 178483

6752366e

Add more PPC floating-point conversion instructions · f6d45f23

Hal Finkel authored Apr 01, 2013

The P7 and A2 have additional floating-point conversion instructions which
allow a direct two-instruction sequence (plus load/store) to convert from all
combinations (signed/unsigned i32/i64) <--> (float/double) (on previous cores,
only some combinations were directly available).

llvm-svn: 178480

f6d45f23