Commits · 0b8bc004243b9d9a85be5d80b9d7b064aeef54b7 · Roger Ferrer / llvm-epi-0.8

Nov 14, 2011

AnalyzeCallOperands function for N32/64. · 0b8bc004

Akira Hatanaka authored Nov 14, 2011

N32/64 places all variable arguments in integer registers (or on stack),
regardless of their types, but follows calling convention of non-vaarg function
when it handles fixed arguments.

llvm-svn: 144553

0b8bc004

Modify LowerFormalArguments to correctly handle vaarg arguments for Mips64. · 52359363
Akira Hatanaka authored Nov 14, 2011
```
llvm-svn: 144552
```
52359363
PTX: Let LLVM use loads/stores for all mem* intrinsics, instead of relying on... · 33a51902
Justin Holewinski authored Nov 14, 2011
```
PTX: Let LLVM use loads/stores for all mem* intrinsics, instead of relying on custom implementations.

llvm-svn: 144551
```
33a51902
Add release notes for the MicroBlaze backend. · 1c29a83a
Wesley Peck authored Nov 14, 2011
```
llvm-svn: 144550
```
1c29a83a

Remove variable that keeps the size of area used to save byval or variable · d673cfe0

Akira Hatanaka authored Nov 14, 2011

argument registers on the callee's stack frame, along with functions that set
and get it.
    
It is not necessary to add the size of this area when computing stack size in
emitPrologue, since it has already been accounted for in
PEI::calculateFrameObjectOffsets.

llvm-svn: 144549

d673cfe0

Fixup comment. · 04832b92
Eric Christopher authored Nov 14, 2011
```
llvm-svn: 144548
```
04832b92
Fix early-clobber handling in shrinkToUses. · 7e6004a3
Jakob Stoklund Olesen authored Nov 14, 2011
```
I broke this in r144515, it affected most ARM testers.

<rdar://problem/10441389>

llvm-svn: 144547
```
7e6004a3
Dependency file for dylib source was not being cleaned up. · 6b0a1e36
Johnny Chen authored Nov 14, 2011
```
llvm-svn: 144546
```
6b0a1e36
Add more info on the failure. · 9b54724c
Johnny Chen authored Nov 14, 2011
```
llvm-svn: 144545
```
9b54724c

Fixed Objective-C method lookup for methods with · 4fb79b79

Sean Callanan authored Nov 14, 2011

a single argument.  We assumed that the : was
omitted from the selector name, but actually Clang
adds the : in the one-argument case.

llvm-svn: 144544

4fb79b79

Disable generation of compact unwind encodings. <rdar://problem/10441578> · 8d1c7dbd
Bob Wilson authored Nov 14, 2011
```
This still seems to be causing some failures.  It needs more testing before
it gets enabled again.

llvm-svn: 144543
```
8d1c7dbd
Delete stale comment. · 7e07b388
Jakob Stoklund Olesen authored Nov 14, 2011
```
llvm-svn: 144542
```
7e07b388
Don't build optimized unless we are trying to test inlining. · ba174bea
Greg Clayton authored Nov 14, 2011
```
llvm-svn: 144539
```
ba174bea
Tidy up. 80 column. · ee201fae
Jim Grosbach authored Nov 14, 2011
```
llvm-svn: 144538
```
ee201fae
Make headers standalone. · 0ffbcc95
Benjamin Kramer authored Nov 14, 2011
```
llvm-svn: 144537
```
0ffbcc95
Make headers standalone, move a virtual method out of line. · d00e94e8
Benjamin Kramer authored Nov 14, 2011
```
llvm-svn: 144536
```
d00e94e8

build/Make: Switch over to using llvm-config-2 for dependencies one more... · a5772b92

Daniel Dunbar authored Nov 14, 2011

build/Make: Switch over to using llvm-config-2 for dependencies one more (hopefully last) time, now that it also builds as a build tool.

llvm-svn: 144535

a5772b92

It helps to deallocate memory as well as allocate it. =] This actually · fd9b4d98

Chandler Carruth authored Nov 14, 2011

cleans up all the chains allocated during the processing of each
function so that for very large inputs we don't just grow memory usage
without bound.

llvm-svn: 144533

fd9b4d98

Remove an over-eager assert that was firing on one of the ARM regression · 0a31d149

Chandler Carruth authored Nov 14, 2011

tests when I forcibly enabled block placement.

It is apparantly possible for an unanalyzable block to fallthrough to
a non-loop block. I don't actually beleive this is correct, I believe
that 'canFallThrough' is returning true needlessly for the code
construct, and I've left a bit of a FIXME on the verification code to
try to track down why this is coming up.

Anyways, removing the assert doesn't degrade the correctness of the algorithm.

llvm-svn: 144532

0a31d149

Begin chipping away at one of the biggest quadratic-ish behaviors in · 0af6a0bb

Chandler Carruth authored Nov 14, 2011

this pass. We're leaving already merged blocks on the worklist, and
scanning them again and again only to determine each time through that
indeed they aren't viable. We can instead remove them once we're going
to have to scan the worklist. This is the easy way to implement removing
them. If this remains on the profile (as I somewhat suspect it will), we
can get a lot more clever here, as the worklist's order is essentially
irrelevant. We can use swapping and fold the two loops to reduce
overhead even when there are many blocks on the worklist but only a few
of them are removed.

llvm-svn: 144531

0af6a0bb

Under the hood, MBPI is doing a linear scan of every successor every · 84cd44c7

Chandler Carruth authored Nov 14, 2011

time it is queried to compute the probability of a single successor.
This makes computing the probability of every successor of a block in
sequence... really really slow. ;] This switches to a linear walk of the
successors rather than a quadratic one. One of several quadratic
behaviors slowing this pass down.

I'm not really thrilled with moving the sum code into the public
interface of MBPI, but I don't (at the moment) have ideas for a better
interface. My direction I'm thinking in for a better interface is to
have MBPI actually retain much more state and make *all* of these
queries cheap. That's a lot of work, and would require invasive changes.
Until then, this seems like the least bad (ie, least quadratic)
solution. Suggestions welcome.

llvm-svn: 144530

84cd44c7

Add clang_complete to release notes · 8bee91ff
Tobias Grosser authored Nov 14, 2011
```
llvm-svn: 144529
```
8bee91ff
Add Polly to release notes · cfa35956
Tobias Grosser authored Nov 14, 2011
```
llvm-svn: 144528
```
cfa35956

Reuse the logic in getEdgeProbability within getHotSucc in order to · a9e71faa

Chandler Carruth authored Nov 14, 2011

correctly handle blocks whose successor weights sum to more than
UINT32_MAX. This is slightly less efficient, but the entire thing is
already linear on the number of successors. Calling it within any hot
routine is a mistake, and indeed no one is calling it. It also
simplifies the code.

llvm-svn: 144527

a9e71faa

Fix an overflow bug in MachineBranchProbabilityInfo. This pass relied on · ed5aa547

Chandler Carruth authored Nov 14, 2011

the sum of the edge weights not overflowing uint32, and crashed when
they did. This is generally safe as BranchProbabilityInfo tries to
provide this guarantee. However, the CFG can get modified during codegen
in a way that grows the *sum* of the edge weights. This doesn't seem
unreasonable (imagine just adding more blocks all with the default
weight of 16), but it is hard to come up with a case that actually
triggers 32-bit overflow. Fortuately, the single-source GCC build is
good at this. The solution isn't very pretty, but its no worse than the
previous code. We're already summing all of the edge weights on each
query, we can sum them, check for an overflow, compute a scale, and sum
them again.

I've included a *greatly* reduced test case out of the GCC source that
triggers it. It's a pretty lame test, as it clearly is just barely
triggering the overflow. I'd like to have something that is much more
definitive, but I don't understand the fundamental pattern that triggers
an explosion in the edge weight sums.

The buggy code is duplicated within this file. I'll colapse them into
a single implementation in a subsequent commit.

llvm-svn: 144526

ed5aa547

Add AVX2 version of instructions to load folding tables. Also add a bunch of... · 182b00a2
Craig Topper authored Nov 14, 2011
```
Add AVX2 version of instructions to load folding tables. Also add a bunch of missing SSE/AVX instructions.

llvm-svn: 144525
```
182b00a2
[PCH] Load the chained objc categories only after recursive loading is finished · 7d268c3b
Argyrios Kyrtzidis authored Nov 14, 2011
```
otherwise we may crash.

llvm-svn: 144524
```
7d268c3b

Add a cautionary note to this API. It was not at all obvious to me how · 2432d81e

Chandler Carruth authored Nov 14, 2011

expensive the most useful interface to this analysis is.

Fun story -- it's also not correct. That's getting fixed in another
patch.

llvm-svn: 144523

2432d81e

Add neverHasSideEffects, mayLoad, and mayStore to many patternless SSE/AVX... · a331515c

Craig Topper authored Nov 14, 2011

Add neverHasSideEffects, mayLoad, and mayStore to many patternless SSE/AVX instructions. Remove MMX check from LowerVECTOR_SHUFFLE since MMX vector types won't go through it anyway.

llvm-svn: 144522

a331515c

Fix a regression in wide character codegen. See PR11369. · d60b72f6
Nico Weber authored Nov 14, 2011
```
llvm-svn: 144521
```
d60b72f6
[PCH] Do not crash if a class extension in a chained PCH introduces/redeclares a property. · 846e61a3
Argyrios Kyrtzidis authored Nov 14, 2011
```
llvm-svn: 144520
```
846e61a3
[PCH] In ASTWriter::WriteChainedObjCCategories use getDeclID since the decls · 09c1b3d8
Argyrios Kyrtzidis authored Nov 14, 2011
```
should have been already emitted.

llvm-svn: 144519
```
09c1b3d8
Add support for ARM halfword load/stores and signed byte loads with negative · 2a1df883
Chad Rosier authored Nov 14, 2011
```
offsets.
rdar://10412592

llvm-svn: 144518
```
2a1df883
Use getVNInfoBefore() when it makes sense. · d7bcf43d
Jakob Stoklund Olesen authored Nov 14, 2011
```
llvm-svn: 144517
```
d7bcf43d

Teach machine block placement to cope with unnatural loops. These don't · 1071cfa4

Chandler Carruth authored Nov 14, 2011

get loop info structures associated with them, and so we need some way
to make forward progress selecting and placing basic blocks. The
technique used here is pretty brutal -- it just scans the list of blocks
looking for the first unplaced candidate. It keeps placing blocks like
this until the CFG becomes tractable.

The cost is somewhat unfortunate, it requires allocating a vector of all
basic block pointers eagerly. I have some ideas about how to simplify
and optimize this, but I'm trying to get the logic correct first.

Thanks to Benjamin Kramer for the reduced test case out of GCC. Sadly
there are other bugs that GCC is tickling that I'm reducing and working
on now.

llvm-svn: 144516

1071cfa4

Use kill slots instead of the previous slot in shrinkToUses. · 69797902
Jakob Stoklund Olesen authored Nov 13, 2011
```
It's more natural to use the actual end points.

llvm-svn: 144515
```
69797902
[libclang] Move the check for errors in c-index-test before the TU gets disposed. · 70480496
Argyrios Kyrtzidis authored Nov 13, 2011
```
llvm-svn: 144514
```
70480496

Nov 13, 2011

Cleanup some 80-columns violations and poor formatting. These snuck by · c4a2cb34
Chandler Carruth authored Nov 13, 2011
```
when I was reading through the code for style.

llvm-svn: 144513
```
c4a2cb34

Terminate all dead defs at the dead slot instead of the 'next' slot. · d8f2405e

Jakob Stoklund Olesen authored Nov 13, 2011

This makes no difference for normal defs, but early clobber dead defs
now look like:

  [Slot_EarlyClobber; Slot_Dead)

instead of:

  [Slot_EarlyClobber; Slot_Register).

Live ranges for normal dead defs look like:

  [Slot_Register; Slot_Dead)

as before.

llvm-svn: 144512

d8f2405e

Fix comment for LegalizeTypeAction enum. · 424ca7bb
Craig Topper authored Nov 13, 2011
```
llvm-svn: 144511
```
424ca7bb