Commits · 4f56720754c4ce64b37c62c0c935b4993adcb15e · Roger Ferrer / llvm-epi-0.8

Nov 27, 2011

Prevent rotating the blocks of a loop (and thus getting a backedge to be · 4f567207

Chandler Carruth authored Nov 27, 2011

fallthrough) in cases where we might fail to rotate an exit to an outer
loop onto the end of the loop chain.

Having *some* rotation, but not performing this rotation, is the primary
fix of thep performance regression with -enable-block-placement for
Olden/em3d (a whopping 30% regression). Still working on reducing the
test case that actually exercises this and the new rotation strategy out
of this code, but I want to check if this regresses other test cases
first as that may indicate it isn't the correct fix.

llvm-svn: 145195

4f567207

Make our handling of MMX x SSE closer to what gcc does: · a6416a7c

Rafael Espindola authored Nov 27, 2011

* Enabling sse enables mmx.
* Disabling (-mno-mmx) mmx, doesn't disable sse (we got this right already).
* The order in not important. -msse -mno-mmx is the same as -mno-mmx -msse.

llvm-svn: 145194

a6416a7c

rewrite the known problems section. Including a short list of individual bugs... · 080dd7ce

Chris Lattner authored Nov 27, 2011

rewrite the known problems section.  Including a short list of individual bugs per target isn't particularly useful.  Link to the target features matrix.

llvm-svn: 145193

080dd7ce

move the detailed information about the EH rewrite to a comment, Bill is · 4857190a
Chris Lattner authored Nov 27, 2011
```
blog'izing it.

llvm-svn: 145192
```
4857190a
tweak subprojects' section · e9a31c40
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145191
```
e9a31c40
some random notes. · 25a77906
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145190
```
25a77906
Add inreg attributes to reference arguments. · 32d8a275
Rafael Espindola authored Nov 27, 2011
```
llvm-svn: 145189
```
32d8a275
remove a test that is using old-style llvm.dbg intrinsics, apparently only · 251d827d
Chris Lattner authored Nov 27, 2011
```
fails on ppc and arm hosts.

llvm-svn: 145188
```
251d827d

Reference initialization with initializer lists. · 29526f09

Sebastian Redl authored Nov 27, 2011

This supports single-element initializer lists for references according to DR1288, as well as creating temporaries and binding to them for other initializer lists.

llvm-svn: 145186

29526f09

Error on non x86 architectures. · 0618d14e
Rafael Espindola authored Nov 27, 2011
```
llvm-svn: 145185
```
0618d14e
Fix file name in comments. · fd03d0b7
Rafael Espindola authored Nov 27, 2011
```
llvm-svn: 145184
```
fd03d0b7

Take two on rotating the block ordering of loops. My previous attempt · 03adbd46

Chandler Carruth authored Nov 27, 2011

was centered around the premise of laying out a loop in a chain, and
then rotating that chain. This is good for preserving contiguous layout,
but bad for actually making sane rotations. In order to keep it safe,
I had to essentially make it impossible to rotate deeply nested loops.
The information needed to correctly reason about a deeply nested loop is
actually available -- *before* we layout the loop. We know the inner
loops are already fused into chains, etc. We lose information the moment
we actually lay out the loop.

The solution was the other alternative for this algorithm I discussed
with Benjamin and some others: rather than rotating the loop
after-the-fact, try to pick a profitable starting block for the loop's
layout, and then use our existing layout logic. I was worried about the
complexity of this "pick" step, but it turns out such complexity is
needed to handle all the important cases I keep teasing out of benchmarks.

This is, I'm afraid, a bit of a work-in-progress. It is still
misbehaving on some likely important cases I'm investigating in Olden.
It also isn't really tested. I'm going to try to craft some interesting
nested-loop test cases, but it's likely to be extremely time consuming
and I don't want to go there until I'm sure I'm testing the correct
behavior. Sadly I can't come up with a way of getting simple, fine
grained test cases for this logic. We need complex loop structures to
even trigger much of it.

llvm-svn: 145183

03adbd46

Revert r145180 as it is causing test failures on all the bots. · 37ab257b

Chandler Carruth authored Nov 27, 2011

Original commit message:
Fixed ObjectFile functions:
- getSymbolOffset() renamed as getSymbolFileOffset()
- getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile.
- added getRelocationOffset()
- fixed MachOObjectFile::getSymbolSize()
- fixed MachOObjectFile::getSymbolSection()
- fixed MachOObjectFile::getSymbolOffset() for symbols without section data.

llvm-svn: 145182

37ab257b

Fix an impressive type-o / spell-o Duncan noticed. · 9e466841
Chandler Carruth authored Nov 27, 2011
```
llvm-svn: 145181
```
9e466841

Fixed ObjectFile functions: · 2631f93f

Danil Malyshev authored Nov 27, 2011

- getSymbolOffset() renamed as getSymbolFileOffset()
- getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile.
- added getRelocationOffset()
- fixed MachOObjectFile::getSymbolSize()
- fixed MachOObjectFile::getSymbolSection()
- fixed MachOObjectFile::getSymbolOffset() for symbols without section data.

llvm-svn: 145180

2631f93f

Rework a bit of the implementation of loop block rotation to not rely so · a0545809

Chandler Carruth authored Nov 27, 2011

heavily on AnalyzeBranch. That routine doesn't behave as we want given
that rotation occurs mid-way through re-ordering the function. Instead
merely check that there are not unanalyzable branching constructs
present, and then reason about the CFG via successor lists. This
actually simplifies my mental model for all of this as well.

The concrete result is that we now will rotate more loop chains. I've
added a test case from Olden highlighting the effect. There is still
a bit more to do here though in order to regain all of the performance
in Olden.

llvm-svn: 145179

a0545809

Eli managed to kill off llvm.membarrier in llvm 3.0 also, this means · 0bcbde46
Chris Lattner authored Nov 27, 2011
```
that mainline needs no autoupgrade logic for intrinsics yet, woohoo!

llvm-svn: 145178
```
0bcbde46
add some final random notes, I've completed my pass over all the commits. · 3dcdc29d
Chris Lattner authored Nov 27, 2011
```
I'll work on turning this into something intelligible tomorrow.

llvm-svn: 145177
```
3dcdc29d
The llvm.atomic intrinsics *were* removed in LLVM 3.0 (in r141333), remove the · 410f3d7f
Chris Lattner authored Nov 27, 2011
```
autoupgrade logic for 2.9 and before.

llvm-svn: 145176
```
410f3d7f

remove autoupgrade support for old forms of llvm.prefetch and the old · ee471c48

Chris Lattner authored Nov 27, 2011

trampoline forms.  Both of these were correct in LLVM 3.0, and we don't
need to support LLVM 2.9 and earlier in mainline.

llvm-svn: 145174

ee471c48

add some notes. · d5bb9e6c
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145173
```
d5bb9e6c

remove asmparsing and documentation support for "volatile load", which was... · bc639298

Chris Lattner authored Nov 27, 2011

remove asmparsing and documentation support for "volatile load", which was only produced by LLVM 2.9 and earlier.  LLVM 3.0 and later prefers "load volatile".

llvm-svn: 145172

bc639298

Upgrade syntax of tests using volatile instructions to use 'load volatile'... · 6a144a22

Chris Lattner authored Nov 27, 2011

Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic.

llvm-svn: 145171

6a144a22

some notes. · ebed15e9
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145170
```
ebed15e9

remove autoupgrade support for really old-style debug info intrinsics. · 90ef78c0

Chris Lattner authored Nov 27, 2011

I think this is the last of autoupgrade that can be removed in 3.1.
Can the atomic upgrade stuff also go?

llvm-svn: 145169

90ef78c0

Use libcxx makefile's do-installhdrs target. <rdar://problem/10397739> · 800b2b42
Bob Wilson authored Nov 27, 2011
```
llvm-svn: 145168
```
800b2b42
remove some old autoupgrade logic · 6aa6c0c3
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145167
```
6aa6c0c3
remove autoupgrade support for LLVM 2.9 exception stuff. Mainline supports · db891539
Chris Lattner authored Nov 27, 2011
```
LLVM 3.0 and later.

llvm-svn: 145165
```
db891539
remove support for reading llvm 2.9 .bc files. LLVM 3.1 is only compatible back to 3.0 · 1c9e5678
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145164
```
1c9e5678
add some notes · 74a3e00e
Chris Lattner authored Nov 27, 2011
```
llvm-svn: 145163
```
74a3e00e

Refactor libcxx makefile. No functional changes intended. · 8a3c663e

Bob Wilson authored Nov 27, 2011

Besides cleaning up the repetition in the installhdrs target, the point of this
change is to provide a separate do-installhdrs target that can be used directly
from clang's runtime/libcxx makefile to install a copy of the headers along
with clang.  <rdar://problem/10397739>

llvm-svn: 145162

8a3c663e

Add several new instructions supported by the latest MicroBlaze. · 97b3da54
Wesley Peck authored Nov 27, 2011
```
These instructions are not generated by the backend yet, this will come in a later commit.

llvm-svn: 145161
```
97b3da54

Partially revert r145157 to quiet an unhappy buildbot. · 8e6d9da0

Bob Wilson authored Nov 27, 2011

Removing that buildbot would be a better solution, but this is at least
a temporary workaround.

llvm-svn: 145160

8e6d9da0

Optimize comparison against 0 in conditional instructions. · d2e2e178
Wesley Peck authored Nov 27, 2011
```
Fix a couple of 80-column violations.

llvm-svn: 145159
```
d2e2e178

Introduce a loop block rotation optimization to the new block placement · 9ffb97e6

Chandler Carruth authored Nov 27, 2011

pass. This is designed to achieve one of the important optimizations
that the old code placement pass did, but more simply.

This is a somewhat rough and *very* conservative version of the
transform. We could get a lot fancier here if there are profitable cases
to do so. In particular, this only looks for a single pattern, it
insists that the loop backedge being rotated away is the last backedge
in the chain, and it doesn't provide any means of doing better in-loop
placement due to the rotation. However, it appears that it will handle
the important loops I am finding in the LLVM test suite.

llvm-svn: 145158

9ffb97e6

Merge the install-clang-c target into install-clang. <rdar://problem/10217046> · 4eefd2d5
Bob Wilson authored Nov 27, 2011
```
llvm-svn: 145157
```
4eefd2d5
Move code into anonymous namespaces. · 7ba71be3
Benjamin Kramer authored Nov 26, 2011
```
llvm-svn: 145154
```
7ba71be3

Nov 26, 2011
- Merge 128-bit and 256-bit X86ISD node types for VPERMILPS and VPERMILPD.... · 51280d56
  Craig Topper authored Nov 26, 2011
```
Merge 128-bit and 256-bit X86ISD node types for VPERMILPS and VPERMILPD. Simplify some shuffle lowering code since V1 can never be UNDEF due to canonalizing that occurs when shuffle nodes are created.

llvm-svn: 145153
```
  51280d56
- Rename a couple of options and fix some simple typos. · 69d50404
  Wesley Peck authored Nov 26, 2011
```
llvm-svn: 145152
```
  69d50404
- Add the minimum implementation of cpuid.h. This works on "modern" intel cpus · d086573a
  Rafael Espindola authored Nov 26, 2011
```
and on clang, which seams to handled "=b" correctly even when ebx is the
PIC register.

llvm-svn: 145149
```
  d086573a