Commits · 67107ea1af126a253a27fe812d38b715f54b7148 · Roger Ferrer / llvm-epi-0.8

Nov 17, 2013

Fix ndebug-build unused variable in loop rerolling · 67107ea1
Hal Finkel authored Nov 17, 2013
```
llvm-svn: 194941
```
67107ea1
Use right address space pointer size · 36f5eb59
Matt Arsenault authored Nov 17, 2013
```
llvm-svn: 194940
```
36f5eb59

Hal Finkel authored Nov 16, 2013

This adds a loop rerolling pass: the opposite of (partial) loop unrolling. The
transformation aims to take loops like this:

for (int i = 0; i < 3200; i += 5) {
  a[i]     += alpha * b[i];
  a[i + 1] += alpha * b[i + 1];
  a[i + 2] += alpha * b[i + 2];
  a[i + 3] += alpha * b[i + 3];
  a[i + 4] += alpha * b[i + 4];
}

and turn them into this:

for (int i = 0; i < 3200; ++i) {
  a[i] += alpha * b[i];
}

and loops like this:

for (int i = 0; i < 500; ++i) {
  x[3*i] = foo(0);
  x[3*i+1] = foo(0);
  x[3*i+2] = foo(0);
}

and turn them into this:

for (int i = 0; i < 1500; ++i) {
  x[i] = foo(0);
}

There are two motivations for this transformation:

  1. Code-size reduction (especially relevant, obviously, when compiling for
code size).

  2. Providing greater choice to the loop vectorizer (and generic unroller) to
choose the unrolling factor (and a better ability to vectorize). The loop
vectorizer can take vector lengths and register pressure into account when
choosing an unrolling factor, for example, and a pre-unrolled loop limits that
choice. This is especially problematic if the manual unrolling was optimized
for a machine different from the current target.

The current implementation is limited to single basic-block loops only. The
rerolling recognition should work regardless of how the loop iterations are
intermixed within the loop body (subject to dependency and side-effect
constraints), but the significant restriction is that the order of the
instructions in each iteration must be identical. This seems sufficient to
capture all current use cases.

This pass is not currently enabled by default at any optimization level.

llvm-svn: 194939

bf45efde

ObjectiveC ARC. More validation of toll-free bridging of · 2c312128
Fariborz Jahanian authored Nov 16, 2013
```
CF objects with objc_bridge'ing annotaiton.
// rdar://15454846

llvm-svn: 194938
```
2c312128

Nov 16, 2013
- The WebKit_JS CC preserves the same registers as the C CC. · 565acf92
  Juergen Ributzka authored Nov 16, 2013
```
llvm-svn: 194936
```
  565acf92
- Apply the InstCombine fptrunc sqrt optimization to llvm.sqrt · 12100bf7
  Hal Finkel authored Nov 16, 2013
```
InstCombine, in visitFPTrunc, applies the following optimization to sqrt calls:

  (fptrunc (sqrt (fpext x))) -> (sqrtf x)

but does not apply the same optimization to llvm.sqrt. This is a problem
because, to enable vectorization, Clang generates llvm.sqrt instead of sqrt in
fast-math mode, and because this optimization is being applied to sqrt and not
applied to llvm.sqrt, sometimes the fast-math code is slower.

This change makes InstCombine apply this optimization to llvm.sqrt as well.

This fixes the specific problem in PR17758, although the same underlying issue
(optimizations applied to libcalls are not applied to intrinsics) exists for
other optimizations in SimplifyLibCalls.

llvm-svn: 194935
```
  12100bf7
- Fix assert on unaligned access to global with different address space size. · dfb3e709
  Matt Arsenault authored Nov 16, 2013
```
llvm-svn: 194934
```
  dfb3e709
- Fix codegen for null different sized pointer. · 19231e63
  Matt Arsenault authored Nov 16, 2013
```
llvm-svn: 194932
```
  19231e63
- ScopDetection: Improve formatting · 378a9f2b
  Tobias Grosser authored Nov 16, 2013
```
llvm-svn: 194931
```
  378a9f2b
- ObjectiveC ARC. Validate toll free bridge casting · 8a0210e5
  Fariborz Jahanian authored Nov 16, 2013
```
of ObjectiveC objects to CF types when CF type
has the objc_bridge attribute.

llvm-svn: 194930
```
  8a0210e5
- ScalarEvolution: Warn if the result of setFlags/clearFlags is unused. · c6f95576
  Benjamin Kramer authored Nov 16, 2013
```
This was a source of bugs in the past.

llvm-svn: 194929
```
  c6f95576
- Annotate APInt methods where it's not clear whether they are in place with warn_unused_result. · 5f2768c3
  Benjamin Kramer authored Nov 16, 2013
```
Fix ScalarEvolution bugs uncovered by this.

llvm-svn: 194928
```
  5f2768c3
- R600: Make dot_4 instructions predicable · 745d4298
  Vincent Lejeune authored Nov 16, 2013
```
llvm-svn: 194927
```
  745d4298
- Use array_pod_sort instead of std::sort · 0c8d604f
  Duncan P. N. Exon Smith authored Nov 16, 2013
```
Per Rafael's review of r194514.

llvm-svn: 194926
```
  0c8d604f
- InstCombine: fold (A >> C) == (B >> C) --> (A^B) < (1 << C) for constant Cs. · 03f3e248
  Benjamin Kramer authored Nov 16, 2013
```
This is common in bitfield code.

llvm-svn: 194925
```
  03f3e248
- Fix filename in header comment · 38fc2e7a
  Duncan P. N. Exon Smith authored Nov 16, 2013
```
llvm-svn: 194924
```
  38fc2e7a
- prepend LLVM to all Polly* libs · 3d1806b9
  Sebastian Pop authored Nov 16, 2013
```
llvm-svn: 194923
```
  3d1806b9
- factor out code in shouldEnablePolly · 8d6cca19
  Sebastian Pop authored Nov 16, 2013
```
to be able to call the same functionality from registerPollyEarlyAsPossiblePasses
and registerPollyOptLevel0Passes.

llvm-svn: 194922
```
  8d6cca19
- move MayAliasSet.cpp into lib/Analysis · 4915ccbe
  Sebastian Pop authored Nov 16, 2013
```
llvm-svn: 194921
```
  4915ccbe
- Remove unused but set variable. · 847c1d90
  Benjamin Kramer authored Nov 16, 2013
```
llvm-svn: 194920
```
  847c1d90
- Move remaining %clang_cc1 tests out of test/Driver · b504417b
  Alp Toker authored Nov 16, 2013
```
clang -cc1 skips the driver so it never made sense to include these with the
Driver tests.

Basic type tests and flag tests generally both go in Frontend.

Now that the final -cc1 tests have been moved out of test/Driver, add a
local substitution to enforce and detect future mistakes.

These miscategorized tests were probably the source of confusion in r194817.

llvm-svn: 194919
```
  b504417b
- gtest-death-test.cc: Move ~DeathTestFactory() to unbreak cygming build since r194865. · f8d6c690
  NAKAMURA Takumi authored Nov 16, 2013
```
llvm-svn: 194918
```
  f8d6c690
- Debug Info Verifier: remove un-used argument in verifyDebugInfo. · 23662907
  Manman Ren authored Nov 16, 2013
```
No functionality change.

llvm-svn: 194917
```
  23662907
- If a replaceable global operator new/delete is marked inline, don't warn if · fa27bc4c
  Richard Smith authored Nov 16, 2013
```
it's also __attribute__((used)), since that undoes the problematic part of
'inline'.

llvm-svn: 194916
```
  fa27bc4c
- ObjetiveC ARC. Start diagnosing invalid toll free bridging. · f07183ce
  Fariborz Jahanian authored Nov 16, 2013
```
// rdar://15454846.

llvm-svn: 194915
```
  f07183ce
- Move the entire debug print loop into DEBUG_WITH_TYPE. · b37c431d
  Rui Ueyama authored Nov 16, 2013
```
No functionality change.

llvm-svn: 194914
```
  b37c431d
- Replace one more magic number with sizeof(). · a3ada6b0
  Rui Ueyama authored Nov 16, 2013
```
llvm-svn: 194913
```
  a3ada6b0
- Add a new SBThread::GetExtendedBacktraceOriginatingIndexID() method · 8ee9cb58
  Jason Molenda authored Nov 16, 2013
```
(and same thing to Thread base class) which can be used when looking
at an ExtendedBacktrace thread; it will try to find the IndexID() of
the original thread that was executing this backtrace when it was
recorded.  If lldb can't find a record of that thread, it will return
the same value as IndexID() for the ExtendedBacktrace thread.

llvm-svn: 194912
```
  8ee9cb58
- Use early continue. · 5dcabbc9
  Rui Ueyama authored Nov 16, 2013
```
llvm-svn: 194911
```
  5dcabbc9
- Style fixes, brought to you by clang-format · 1c84d804
  Tobias Grosser authored Nov 16, 2013
```
llvm-svn: 194910
```
  1c84d804
- Simplify. No functionality change. · e4d20ab7
  Rui Ueyama authored Nov 16, 2013
```
llvm-svn: 194909
```
  e4d20ab7
- Replace duplicate code with calls to getOrPushAttribute(). · 4072d91a
  Rui Ueyama authored Nov 16, 2013
```
llvm-svn: 194908
```
  4072d91a
- X86: Make specifying avx2 simpler on Darwin with '-arch' · 82eee268
  Jim Grosbach authored Nov 16, 2013
```
Teach the '-arch' command line option to enable the compiler-friendly
features of core-avx2 CPUs on Darwin. Pass the information along in the
target triple like Darwin+ARM does.

llvm-svn: 194907
```
  82eee268
- X86: Encode the 'h' cpu subtype in the MachO header for x86. · 664d148a
  Jim Grosbach authored Nov 16, 2013
```
llvm-svn: 194906
```
  664d148a
- Downgrade the Error on an 'inline' operator new or delete to an ExtWarn. Some · 13dfdc88
  Richard Smith authored Nov 16, 2013
```
projects are relying on such (questionable) practices, so we should give them
a way to opt out of this diagnostic.

llvm-svn: 194905
```
  13dfdc88
- Mention address space related changes in release notes. · b8342261
  Matt Arsenault authored Nov 16, 2013
```
llvm-svn: 194904
```
  b8342261
- Use correct size for address space in BasicAA. · a8fe22ba
  Matt Arsenault authored Nov 16, 2013
```
The tests just hit this with a different sized
address space since I haven't figured out how
to use this to break it.

I thought I committed this a long time ago,
and I'm not sure why missing this hasn't caused
any problems.

llvm-svn: 194903
```
  a8fe22ba
- DwarfCompileUnit: Push type safety of DIDescriptor through CompileUnit::createAndAddDIE. · 52c5020d
  David Blaikie authored Nov 16, 2013
```
llvm-svn: 194902
```
  52c5020d
- DwarfCompileUnit: Remove unnecessary OwningPtr<T>::get() call · eb0338fe
  David Blaikie authored Nov 16, 2013
```
llvm-svn: 194901
```
  eb0338fe
- Consumed analysis: track state of temporary objects. · 68cc3f13
  DeLesley Hutchins authored Nov 16, 2013
```
Earlier versions discarded the state too soon, and did not track state changes,
e.g. when passing a temporary to a move constructor.  Patch by
chris.wailes@gmail.com; review and minor fixes by delesley.

llvm-svn: 194900
```
  68cc3f13