Commits · 21aad9a8e8653947bd5bd13378992348ce8fc4b9 · Roger Ferrer / llvm-epi-0.8

Apr 09, 2013

Hal Finkel authored Apr 09, 2013

Some general cleanup and only scan the end of a BB for branches (once we're
done with the terminators and debug values, then there should not be any other
branches). These address post-commit review suggestions by Bill Schmidt.

No functionality change intended.

llvm-svn: 179112

21aad9a8

Revert r176408 and r176407 to address PR15540. · abcc64fd
Nadav Rotem authored Apr 09, 2013
```
llvm-svn: 179111
```
abcc64fd

[ms-inline asm] Maintain a StringRef to reference a symbol in a parsed operand, · e81309b3

Chad Rosier authored Apr 09, 2013

rather than deriving the StringRef from the Start and End SMLocs.

Using the Start and End SMLocs works fine for operands such as [Symbol], but
not for operands such as [Symbol + ImmDisp].  All existing test cases that
reference a variable exercise this patch.
rdar://13602265

llvm-svn: 179109

e81309b3

DAGCombiner: Fold a shuffle on CONCAT_VECTORS into a new CONCAT_VECTORS if possible. · bbae991d

Benjamin Kramer authored Apr 09, 2013

This pattern occurs in SROA output due to the way vector arguments are lowered
on ARM.

The testcase from PR15525 now compiles into this, which is better than the code
we got with the old scalarrepl:
_Store:
	ldr.w	r9, [sp]
	vmov	d17, r3, r9
	vmov	d16, r1, r2
	vst1.8	{d16, d17}, [r0]
	bx	lr

Differential Revision: http://llvm-reviews.chandlerc.com/D647

llvm-svn: 179106

bbae991d

Use virtual base registers on PPC · b5899d57

Hal Finkel authored Apr 09, 2013

On PowerPC, non-vector loads and stores have r+i forms; however, in functions
with large stack frames these were not being used to access slots far from the
stack pointer because such slots were out of range for the signed 16-bit
immediate offset field. This increases register pressure because we need a
separate register for each offset (when the r+r form is used). By enabling
virtual base registers, we can deal with large stack frames without unduly
increasing register pressure.

llvm-svn: 179105

b5899d57

Convert MachOObjectFile to a template. · c2413f59

Rafael Espindola authored Apr 09, 2013

For now it is templated only on being 64 or 32 bits. I will add little/big
endian next.

llvm-svn: 179097

c2413f59

DWARF parser: Fix DWARF-2/3 incompatibility: size of DW_FORM_ref_addr is the... · d60859b2

Alexey Samsonov authored Apr 09, 2013

DWARF parser: Fix DWARF-2/3 incompatibility: size of DW_FORM_ref_addr is the same as DW_FORM_addr in DWARF2, and is 4/8 bytes on 32/64-bit DWARF starting from DWARF3. Adding a test for this is a huge pain - generating and uploading pre-built binary with DWARF3 debug info is way too ugly, and writing fine-grained unittests for DebugInfo is impossible, as it doesn't expose any headers in include/llvm. That said, I'm going to choose the second approach and submit the patch exposing DebugInfo headers for review soon enough.

llvm-svn: 179095

d60859b2

Extract a function. · c910feb4
Jakob Stoklund Olesen authored Apr 09, 2013
```
llvm-svn: 179086
```
c910feb4
Revert 179071 because it is not the right way to support non standard new/new[] operators. · 7b7585d1
Nadav Rotem authored Apr 09, 2013
```
llvm-svn: 179084
```
7b7585d1

Compute correct frame sizes for SPARC v9 64-bit frames. · 2cfe46fd

Jakob Stoklund Olesen authored Apr 09, 2013

The save area is twice as big and there is no struct return slot. The
stack pointer is always 16-byte aligned (after adding the bias).

Also eliminate the stack adjustment instructions around calls when the
function has a reserved stack frame.

llvm-svn: 179083

2cfe46fd

More uses for SymbolTableEntryBase. · eb8b211e
Rafael Espindola authored Apr 09, 2013
```
llvm-svn: 179076
```
eb8b211e

Add a SymbolTableEntryBase. · 5d6cec9b

Rafael Espindola authored Apr 09, 2013

Use it when we don't need to know if we have a 32 or 64 bit SymbolTableEntry.

llvm-svn: 179074

5d6cec9b

Add a SectionBase struct. · 65d601f9

Rafael Espindola authored Apr 08, 2013

Use it to share code and when we don't need to know if we have a 32 or 64
bit Section.

llvm-svn: 179072

65d601f9

c++ new operators are not malloc-like functions because they do not return uninitialized memory. · 9dd90ac5
Nadav Rotem authored Apr 08, 2013
```
Users may overide new-operators and implement any function that they like.

llvm-svn: 179071
```
9dd90ac5
InstructionSimplify.cpp: Fix a ligature, "fi", to get rid of utf8 in comment. · 065fd352
NAKAMURA Takumi authored Apr 08, 2013
```
llvm-svn: 179066
```
065fd352

Redo the fix Benjamin Kramer committed in r178793 about iterator invalidation in Reassociate. · 331f01dc

Shuxin Yang authored Apr 08, 2013

I brazenly think this change is slightly simpler than r178793 because: 
  - no "state" in functor
  - "OpndPtrs[i]" looks simpler than "&Opnds[OpndIndices[i]]" 

  While I can reproduce the probelm in Valgrind, it is rather difficult to come up
a standalone testing case. The reason is that when an iterator is invalidated,
the stale invalidated elements are not yet clobbered by nonsense data, so the
optimizer can still proceed successfully. 

  Thank Benjamin for fixing this bug and generously providing the test case.

llvm-svn: 179062

331f01dc

Apr 08, 2013

Template the MachO types over the word size. · c0406e16
Rafael Espindola authored Apr 08, 2013
```
llvm-svn: 179051
```
c0406e16
Remove is64BitLoadCommand. · 29d45017
Rafael Espindola authored Apr 08, 2013
```
llvm-svn: 179048
```
29d45017

X86 cost model: Model cost for uitofp and sitofp on SSE2 · f47d2d7f

Arnold Schwaighofer authored Apr 08, 2013

The costs are overfitted so that I can still use the legalization factor.

For example the following kernel has about half the throughput vectorized than
unvectorized when compiled with SSE2. Before this patch we would vectorize it.

unsigned short A[1024];
double B[1024];
void f() {
  int i;
  for (i = 0; i < 1024; ++i) {
    B[i] = (double) A[i];
  }
}

radar://13599001

llvm-svn: 179033

f47d2d7f

[ms-inline asm] Add support for ImmDisp [ Symbol ] memory operands. · fce4fab1
Chad Rosier authored Apr 08, 2013
```
rdar://13521249

llvm-svn: 179030
```
fce4fab1

Generate PPC early conditional returns · b5aa7e54

Hal Finkel authored Apr 08, 2013

PowerPC has a conditional branch to the link register (return) instruction: BCLR.
This should be used any time when we'd otherwise have a conditional branch to a
return. This adds a small pass, PPCEarlyReturn, which runs just prior to the
branch selection pass (and, importantly, after block placement) to generate
these conditional returns when possible. It will also eliminate unconditional
branches to returns (these happen rarely; most of the time these have already
been tail duplicated by the time PPCEarlyReturn is invoked). This is a nice
optimization for small functions that do not maintain a stack frame.

llvm-svn: 179026

b5aa7e54

DWARF parser: remove duplicated code and fix code style in DIE extractors. · c03f2ee0
Alexey Samsonov authored Apr 08, 2013
```
llvm-svn: 179023
```
c03f2ee0
Add all 4 MachO object types. Use the stored type to implement is64Bits(). · d66c4146
Rafael Espindola authored Apr 08, 2013
```
llvm-svn: 179021
```
d66c4146
R600: Control Flow support for pre EG gen · 5f11dd39
Vincent Lejeune authored Apr 08, 2013
```
llvm-svn: 179020
```
5f11dd39

AArch64: remove barriers from AArch64 atomic operations. · 15410e98

Tim Northover authored Apr 08, 2013

I've managed to convince myself that AArch64's acquire/release
instructions are sufficient to guarantee C++11's required semantics,
even in the sequentially-consistent case.

llvm-svn: 179005

15410e98

ARM: Remove unused variable. · d56a324e
Benjamin Kramer authored Apr 08, 2013
```
llvm-svn: 179001
```
d56a324e

Cleanup and improve PPC fsel generation · 81f8799f

Hal Finkel authored Apr 07, 2013

First, we should not cheat: fsel-based lowering of select_cc is a
finite-math-only optimization (the ISA manual, section F.3 of v2.06, makes
this clear, as does a note in our own README).

This also adds fsel-based lowering of EQ and NE condition codes. As it turned
out, fsel generation was covered by a grand total of zero regression test
cases. I've added some test cases to cover the existing behavior (which is now
finite-math only), as well as the new EQ cases.

llvm-svn: 179000

81f8799f

Apr 07, 2013
- Make MachOObjectFile independent from MachOObject. · 421305af
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178998
```
  421305af
- Implement MachOObjectFile::getData directly. · c1f28b6a
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178997
```
  c1f28b6a
- Implement MachOObjectFile::is64Bit directly. · 28814d79
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178996
```
  28814d79
- Implement MachOObjectFile::getHeaderSize directly. · 774a8cec
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178995
```
  774a8cec
- Implement MachOObjectFile::getHeader directly. · d6652591
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178994
```
  d6652591
- Implement LowerCall_64 for the SPARC v9 64-bit ABI. · a30f4832
  Jakob Stoklund Olesen authored Apr 07, 2013
```
There is still no support for byval arguments (which I don't think are
needed) and varargs.

llvm-svn: 178993
```
  a30f4832
- Implement MachOObjectFile::getHeaderSize and MachOObjectFile::getData. · 60689987
  Rafael Espindola authored Apr 07, 2013
```
These were the last missing forwarding functions. Also consistently use
the forwarding functions instead of using MachOObj directly.

llvm-svn: 178992
```
  60689987
- Remove LoadCommandInfo now that we always have a pointer to the command. · 3c50f062
  Rafael Espindola authored Apr 07, 2013
```
LoadCommandInfo was needed to keep a command and its offset in the file. Now
that we always have a pointer to the command, we don't need the offset.

llvm-svn: 178991
```
  3c50f062
- Add MachOObjectFile::LoadCommandInfo. · 224208b8
  Rafael Espindola authored Apr 07, 2013
```
This avoids using MachOObject::getLoadCommandInfo.

llvm-svn: 178990
```
  224208b8
- Use getLoadCommandInfo instead of MachOObj->getLoadCommandInfo. · 1309a448
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178989
```
  1309a448
- Construct MachOObject in MachOObjectFile's constructor. · 17bece31
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178988
```
  17bece31
- Remove unused argument. · 717c4d44
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178987
```
  717c4d44
- Remove MachOObjectFile::getObject. · 5ffc079c
  Rafael Espindola authored Apr 07, 2013
```
llvm-svn: 178986
```
  5ffc079c