Commits · b1f68f96966f758a06e379c4ddeb185b032747f6 · Roger Ferrer / llvm-epi-0.8

Apr 02, 2012

Initial 64 bit direct object support. · b1f68f96

Akira Hatanaka authored Apr 02, 2012

This patch allows llvm to recognize that a 64 bit object file is being produced
and that the subsequently generated ELF header has the correct information.

The test case checks for both big and little endian flavors.

Patch by Jack Carter.

llvm-svn: 153889

b1f68f96

The binutils for the IBM BG/P are too old to support CFI. · 7591afa2
Hal Finkel authored Apr 02, 2012
```
llvm-svn: 153886
```
7591afa2
Add triple support for the IBM BG/P and BG/Q supercomputers. · f208af02
Hal Finkel authored Apr 02, 2012
```
llvm-svn: 153882
```
f208af02
Turn on the accelerator tables for Darwin. · ad9fe895
Eric Christopher authored Apr 02, 2012
```
llvm-svn: 153880
```
ad9fe895

Fast fix for PR12343: · f62ffeca

Stepan Dyatkovskiy authored Apr 02, 2012

http://llvm.org/bugs/show_bug.cgi?id=12343

We have not trivial way for splitting edges that are goes from indirect branch. We can do it with some tricks, but it should be additionally discussed. And it is still dangerous due to difficulty of indirect branches controlling.

Fix forbids this case for unswitching.

llvm-svn: 153879

f62ffeca

Implement the SVR4 byval alignment for aggregates. Fixing a FIXME. · b9663ccd
Roman Divacky authored Apr 02, 2012
```
llvm-svn: 153876
```
b9663ccd
Second part for the 153874 one · 98144e9e
Silviu Baranga authored Apr 02, 2012
```
llvm-svn: 153875
```
98144e9e
Added fix in TableGen instruction decoder generation. The decoder now breaks for every leaf node. · ac37acd3
Silviu Baranga authored Apr 02, 2012
```
llvm-svn: 153874
```
ac37acd3
Add missing 'd'. · ebe09ec1
Rafael Espindola authored Apr 02, 2012
```
llvm-svn: 153872
```
ebe09ec1

Hack the hack. If we have a situation where an ASM object is defined but isn't · 71b19bbd

Bill Wendling authored Apr 02, 2012

reflected in the LLVM IR (as a declare or something), then treat it like a data
object.

N.B. This isn't 100% correct. The ASM parser should supply more information so
that we know what type of object it is, and what attributes it should have.

llvm-svn: 153870

71b19bbd

Emit the asm writer's mnemonic table with SequenceToOffsetTable. · 22d093e4
Benjamin Kramer authored Apr 02, 2012
```
This way we can get AVX v-prefixed instructions tail merged with the normal insns.

llvm-svn: 153869
```
22d093e4
Move getOpcodeName from the various target InstPrinters into the superclass MCInstPrinter. · 1c0541b0
Benjamin Kramer authored Apr 02, 2012
```
All implementations used the same code.

llvm-svn: 153866
```
1c0541b0

Reorder fields in MatchEntry and OperandMatchEntry to reduce padding. A bit... · 4de73738

Craig Topper authored Apr 02, 2012

Reorder fields in MatchEntry and OperandMatchEntry to reduce padding. A bit tricky due to the target specific sizes for some of the fields so the ordering is only optimal for the targets in the tree.

llvm-svn: 153865

4de73738

Optimizing swizzles of complex shuffles may generate additional complex shuffles. · 702f0807

Nadav Rotem authored Apr 02, 2012

Do not try to optimize swizzles of shuffles if the source shuffle has more than
a single user, except when the source shuffle is also a swizzle.

llvm-svn: 153864

702f0807

Remove getInstructionName from MCInstPrinter implementations in favor of using... · dab9e35a

Craig Topper authored Apr 02, 2012

Remove getInstructionName from MCInstPrinter implementations in favor of using the instruction name table from MCInstrInfo. Reduces static data in the InstPrinter implementations.

llvm-svn: 153863

dab9e35a

Fix CXXFLAGS for huge_val.m4. · 8e52bdce
Eric Christopher authored Apr 02, 2012
```
Patch by Jeremy Huddleston!

llvm-svn: 153862
```
8e52bdce

Make MCInstrInfo available to the MCInstPrinter. This will be used to remove... · 54bfde79

Craig Topper authored Apr 02, 2012

Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo.

llvm-svn: 153860

54bfde79

It could come about that we parse the inline ASM before we get a potential · 3a0bcf06

Bill Wendling authored Apr 02, 2012

definition for it. In that case, we want to wait for the potential definition
before we create a symbol for it.

llvm-svn: 153859

3a0bcf06

Use SequenceToOffsetTable to generate instruction name table for AsmWriter. · 7a2cea18
Craig Topper authored Apr 02, 2012
```
llvm-svn: 153857
```
7a2cea18

Start cleaning up the InlineCost class. This switches to sentinel values · 219173a1

Chandler Carruth authored Apr 01, 2012

rather than a bitfield, a great suggestion by Chris during code review.

There is still quite a bit of cruft in the interface, but that requires
sorting out some awkward uses of the cost inside the actual inliner.

No functionality changed intended here.

llvm-svn: 153853

219173a1

Apr 01, 2012

Fix some 80-col. violations I introduced with the A2 PPC64 core. · 3ecfa7b2
Hal Finkel authored Apr 01, 2012
```
llvm-svn: 153852
```
3ecfa7b2
Enable prefetch generation on PPC64. · 322e41a9
Hal Finkel authored Apr 01, 2012
```
llvm-svn: 153851
```
322e41a9
Add LdStSTD* itin. for the PPC64 A2 core. · 9032344c
Hal Finkel authored Apr 01, 2012
```
llvm-svn: 153850
```
9032344c

This commit contains a few changes that had to go in together. · b0783508

Nadav Rotem authored Apr 01, 2012

1. Simplify xor/and/or (bitcast(A), bitcast(B)) -> bitcast(op (A,B))
   (and also scalar_to_vector).

2. Xor/and/or are indifferent to the swizzle operation (shuffle of one src).
   Simplify xor/and/or (shuff(A), shuff(B)) -> shuff(op (A, B))

3. Optimize swizzles of shuffles:  shuff(shuff(x, y), undef) -> shuff(x, y).

4. Fix an X86ISelLowering optimization which was very bitcast-sensitive.

Code which was previously compiled to this:

movd    (%rsi), %xmm0
movdqa  .LCPI0_0(%rip), %xmm2
pshufb  %xmm2, %xmm0
movd    (%rdi), %xmm1
pshufb  %xmm2, %xmm1
pxor    %xmm0, %xmm1
pshufb  .LCPI0_1(%rip), %xmm1
movd    %xmm1, (%rdi)
ret

Now compiles to this:

movl    (%rsi), %eax
xorl    %eax, (%rdi)
ret

llvm-svn: 153848

b0783508

Fix typo. · 652f2127
Lang Hames authored Apr 01, 2012
```
llvm-svn: 153846
```
652f2127
Set the default PPC node scheduling preference to ILP (for the embedded cores). · 88ed4e3b
Hal Finkel authored Apr 01, 2012
```
The 440 and A2 cores have detailed itineraries, and this allows them to be
fully used to maximize throughput.

llvm-svn: 153845
```
88ed4e3b
Add ppc440 itin. entries for LdStSTD* · b9845f57
Hal Finkel authored Apr 01, 2012
```
llvm-svn: 153844
```
b9845f57

Use full anti-dep. breaking with post-ra sched. on the embedded ppc cores. · ec5a1e36

Hal Finkel authored Apr 01, 2012

Post-RA scheduling gives a significant performance improvement on
the embedded cores, so turn it on. Using full anti-dep. breaking is
important for FP-intensive blocks, so turn it on (just on the
embedded cores for now; this should also be good on the 970s because
post-ra scheduling is all that we have for now, but that should have
more testing first).

llvm-svn: 153843

ec5a1e36

Add instruction itinerary for the PPC64 A2 core. · 9f9f8929

Hal Finkel authored Apr 01, 2012

This adds a full itinerary for IBM's PPC64 A2 embedded core. These
cores form the basis for the CPUs in the new IBM BG/Q supercomputer.

llvm-svn: 153842

9f9f8929

Use SequenceToOffsetTable to create instruction name table. Saves space... · 91773ab2

Craig Topper authored Apr 01, 2012

Use SequenceToOffsetTable to create instruction name table. Saves space particularly on X86 where AVX instructions just add a 'v' to the front of other instructions.

llvm-svn: 153841

91773ab2

Emit the LLVM<->DWARF register mapping as a sorted table and use binary search to do the lookup. · 12af4285

Benjamin Kramer authored Apr 01, 2012

This also avoids emitting the information twice, which led to code bloat. On i386-linux-Release+Asserts
with all targets built this change shaves a whopping 1.3 MB off clang. The number is probably exaggerated
by recent inliner changes but the methods were already enormous with the old inline cost computation.

The DWARF reg -> LLVM reg mapping doesn't seem to have holes in it, so it could be a simple lookup table.
I didn't implement that optimization yet to avoid potentially changing functionality.

There is still some duplication both in tablegen and the generated code that should be cleaned up eventually.

llvm-svn: 153837

12af4285

Belatedly address some code review from Chris. · 45ae88f5

Chandler Carruth authored Apr 01, 2012

As a side note, I really dislike array_pod_sort... Do we really still
care about any STL implementations that get this so wrong? Does libc++?

llvm-svn: 153834

45ae88f5

Add some more testing to cover the remaining two cases where · cdb1f8cf
Chandler Carruth authored Apr 01, 2012
```
always-inlining is disabled: recursive functions and indirectbr.

llvm-svn: 153833
```
cdb1f8cf

Fix a pretty scary bug I introduced into the always inliner with · c5bfb3c0

Chandler Carruth authored Apr 01, 2012

a single missing character. Somehow, this had gone untested. I've added
tests for returns-twice logic specifically with the always-inliner that
would have caught this, and fixed the bug.

Thanks to Matt for the careful review and spotting this!!! =D

llvm-svn: 153832

c5bfb3c0

Replace four tiny tests with various uses of grep and not with a single · 1989bb9c
Chandler Carruth authored Apr 01, 2012
```
test and FileCheck.

llvm-svn: 153831
```
1989bb9c
misched: Add finalizeScheduler to complete the target interface. · 779b32a4
Andrew Trick authored Apr 01, 2012
```
llvm-svn: 153827
```
779b32a4
Removing a file that's no longer being used after the recent refactorings · f5becf61
Eli Bendersky authored Apr 01, 2012
```
llvm-svn: 153825
```
f5becf61

Split the LdStGeneral PPC itin. class into LdStLoad and LdStStore. · 59607e63

Hal Finkel authored Apr 01, 2012

Loads and stores can have different pipeline behavior, especially on
embedded chips. This change allows those differences to be expressed.
Except for the 440 scheduler, there are no functionality changes.
On the 440, the latency adjustment is only by one cycle, and so this
probably does not affect much. Nevertheless, it will make a larger
difference in the future and this removes a FIXME from the 440 itin.

llvm-svn: 153821

59607e63

Mar 31, 2012
- Add a workaround for building with old versions of clang. · 1eaae507
  Rafael Espindola authored Mar 31, 2012
```
llvm-svn: 153820
```
  1eaae507
- Add a triple to the test. · 77242fa7
  Rafael Espindola authored Mar 31, 2012
```
llvm-svn: 153818
```
  77242fa7