Commits · 508dd9b94c7c310a42982fae9b22fc21ebcdd482 · Roger Ferrer / llvm-epi

Dec 17, 2014

tsan: add disabled test case for issue 87 · 508dd9b9
Dmitry Vyukov authored Dec 17, 2014
```
llvm-svn: 224422
```
508dd9b9
Teach lit.cfg to recognize -windows-gnu in addition to -mingw32. · f6309716
Yaron Keren authored Dec 17, 2014
```
llvm-svn: 224421
```
f6309716

Peter Collingbourne authored Dec 17, 2014

Patch by Andrew Wilkins!

canAvoidElementLoad and canAvoidLoad were incorrectly
eliding loads when an index expression is used as an
another array index expression. This led to a panic.

See comments on https://github.com/go-llvm/llgo/issues/175

Test Plan: lit test added

Differential Revision: http://reviews.llvm.org/D6676

llvm-svn: 224420

1f89ffdf

clang-format: Fix incorrect calculation of token lenghts. · 0580ff0e
Daniel Jasper authored Dec 17, 2014
```
This led, e.g. to break JavaScript regex literals too early.

llvm-svn: 224419
```
0580ff0e
Added 5 more tests related to sink store revision 224247 · 028e966a
Elena Demikhovsky authored Dec 17, 2014
```
- by Ella Bolshinsky

http://reviews.llvm.org/D6420

llvm-svn: 224418
```
028e966a

Strength reduce intrinsics with overflow into regular arithmetic operations if possible. · a451b9b0

Erik Eckstein authored Dec 17, 2014

Some intrinsics, like s/uadd.with.overflow and umul.with.overflow, are already strength reduced.
This change adds other arithmetic intrinsics: s/usub.with.overflow, smul.with.overflow.
It completes the work on PR20194.

llvm-svn: 224417

a451b9b0

Revert "Linker: Drop superseded subprograms" · 92731d26

Duncan P. N. Exon Smith authored Dec 17, 2014

This reverts commit r224389.  Based on feedback from the bots, the
assertion seems to be going off *more* often, not less (previously I was
just seeing it in an internal bootstrap, now it's happening in public
builds too).

http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/936/
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/5325

Reverting in order to investigate.

llvm-svn: 224416

92731d26

Add parsing of 'foo@local". · 0c0d5def

Justin Hibbits authored Dec 17, 2014

Summary:
Currently, it supports generating, but not parsing, this expression.
Test added as well.

Test Plan: New test added, no regressions due to this.

Reviewers: hfinkel

Reviewed By: hfinkel

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D6672

llvm-svn: 224415

0c0d5def

Remove a debugging assert. · 5f060309

Rafael Espindola authored Dec 17, 2014

Sorry for the noise, I have no idea how it survived to the final version.

llvm-svn: 224414

5f060309

Remove unused includes and out of date comment. NFC. · 839353bc
Rafael Espindola authored Dec 17, 2014
```
llvm-svn: 224413
```
839353bc
Fix the windows build. · 81adfb5c
Rafael Espindola authored Dec 17, 2014
```
llvm-svn: 224412
```
81adfb5c

Sema: Don't dyn_cast a null pointer in CheckUsingDeclQualifier · 4d2de1b0

David Majnemer authored Dec 17, 2014

This code was written with the intent that a pointer could be null but
we dyn_cast'd it anyway.  Change the dyn_cast to a dyn_cast_or_null.

This fixes PR21933.

llvm-svn: 224411

4d2de1b0

Refactor and simplify the code reading /proc/cpuinfo. NFC. · 97935a91
Rafael Espindola authored Dec 17, 2014
```
llvm-svn: 224410
```
97935a91
RegisterCoalescer: Sprinkle some const modifiers. · f4a72cd0
Matthias Braun authored Dec 17, 2014
```
llvm-svn: 224409
```
f4a72cd0

llvm-lto: Add testing coverage for local contexts · f9abf4fb

Duncan P. N. Exon Smith authored Dec 17, 2014

Add coverage in `llvm-lto` for the API exposed by libLTO to create
modules in local contexts.

The goal here isn't to test the symbol-related API extensively, just to
confirm that these modules work at all.  (I'll be shifting code around
soon that should be NFC and I realized there was no test coverage.)

llvm-svn: 224408

f9abf4fb

Delete debugging cruft that crept in with r223802. · 52ee5e44
Nick Lewycky authored Dec 17, 2014
```
llvm-svn: 224407
```
52ee5e44

[ASan] Re-structure the allocator code. NFC. · b2dcac0b

Alexey Samsonov authored Dec 17, 2014

Introduce "Allocator" object, which contains all the bits and pieces
ASan allocation machinery actually use: allocator from sanitizer_common,
quarantine, fallback allocator and quarantine caches, fallback mutex.

This step is a preparation to adding more state to this object. We want
to reduce dependency of Allocator on commandline flags and be able to
"safely" modify its behavior (such as the size of the redzone) at
runtime.

llvm-svn: 224406

b2dcac0b

InstSimplify: shl nsw/nuw undef, %V -> undef · 65c52ae8

David Majnemer authored Dec 17, 2014

We can always choose an value for undef which might cause %V to shift
out an important bit except for one case, when %V is zero.

However, shl behaves like an identity function when the right hand side
is zero.

llvm-svn: 224405

65c52ae8

Make ValueEnumerator::print use OS for metadata too. Noticed by inspection. · ee0a3a7a
Nick Lewycky authored Dec 17, 2014
```
llvm-svn: 224404
```
ee0a3a7a

Parse: Consume tokens more carefully in CheckForLParenAfterColonColon · 6ca445e0

David Majnemer authored Dec 17, 2014

We would consume the lparen even if it wasn't followed by an identifier
or a star-identifier pair.

This fixes PR21815.

llvm-svn: 224403

6ca445e0

[CodeGenPrepare] Reapply r224351 with a fix for the assertion failure: · fc2201e9

Quentin Colombet authored Dec 17, 2014

The type promotion helper does not support vector type, so when make
such it does not kick in in such cases.

Original commit message:
[CodeGenPrepare] Move sign/zero extensions near loads using type promotion.

This patch extends the optimization in CodeGenPrepare that moves a sign/zero
extension near a load when the target can combine them. The optimization may
promote any operations between the extension and the load to make that possible.

Although this optimization may be beneficial for all targets, in particular
AArch64, this is enabled for X86 only as I have not benchmarked it for other
targets yet.


** Context **

Most targets feature extended loads, i.e., loads that perform a zero or sign
extension for free. In that context it is interesting to expose such pattern in
CodeGenPrepare so that the instruction selection pass can form such loads.
Sometimes, this pattern is blocked because of instructions between the load and
the extension. When those instructions are promotable to the extended type, we
can expose this pattern.


** Motivating Example **

Let us consider an example:
define void @foo(i8* %addr1, i32* %addr2, i8 %a, i32 %b) {
  %ld = load i8* %addr1
  %zextld = zext i8 %ld to i32
  %ld2 = load i32* %addr2
  %add = add nsw i32 %ld2, %zextld
  %sextadd = sext i32 %add to i64
  %zexta = zext i8 %a to i32
  %addza = add nsw i32 %zexta, %zextld
  %sextaddza = sext i32 %addza to i64
  %addb = add nsw i32 %b, %zextld
  %sextaddb = sext i32 %addb to i64
  call void @dummy(i64 %sextadd, i64 %sextaddza, i64 %sextaddb)
  ret void
}

As it is, this IR generates the following assembly on x86_64:
[...]
  movzbl  (%rdi), %eax   # zero-extended load
  movl  (%rsi), %es      # plain load
  addl  %eax, %esi       # 32-bit add
  movslq  %esi, %rdi     # sign extend the result of add
  movzbl  %dl, %edx      # zero extend the first argument
  addl  %eax, %edx       # 32-bit add
  movslq  %edx, %rsi     # sign extend the result of add
  addl  %eax, %ecx       # 32-bit add
  movslq  %ecx, %rdx     # sign extend the result of add
[...]
The throughput of this sequence is 7.45 cycles on Ivy Bridge according to IACA.

Now, by promoting the additions to form more extended loads we would generate:
[...]
  movzbl  (%rdi), %eax   # zero-extended load
  movslq  (%rsi), %rdi   # sign-extended load
  addq  %rax, %rdi       # 64-bit add
  movzbl  %dl, %esi      # zero extend the first argument
  addq  %rax, %rsi       # 64-bit add
  movslq  %ecx, %rdx     # sign extend the second argument
  addq  %rax, %rdx       # 64-bit add
[...]
The throughput of this sequence is 6.15 cycles on Ivy Bridge according to IACA.

This kind of sequences happen a lot on code using 32-bit indexes on 64-bit
architectures.

Note: The throughput numbers are similar on Sandy Bridge and Haswell.


** Proposed Solution **

To avoid the penalty of all these sign/zero extensions, we merge them in the
loads at the beginning of the chain of computation by promoting all the chain of
computation on the extended type. The promotion is done if and only if we do not
introduce new extensions, i.e., if we do not degrade the code quality.
To achieve this, we extend the existing “move ext to load” optimization with the
promotion mechanism introduced to match larger patterns for addressing mode
(r200947).
The idea of this extension is to perform the following transformation:
ext(promotableInst1(...(promotableInstN(load))))
=>
promotedInst1(...(promotedInstN(ext(load))))

The promotion mechanism in that optimization is enabled by a new TargetLowering
switch, which is off by default. In other words, by default, the optimization
performs the “move ext to load” optimization as it was before this patch.


** Performance **

Configuration: x86_64: Ivy Bridge fixed at 2900MHz running OS X 10.10.
Tested Optimization Levels: O3/Os
Tests: llvm-testsuite + externals.
Results:
- No regression beside noise.
- Improvements:
CINT2006/473.astar:  ~2%
Benchmarks/PAQ8p: ~2%
Misc/perlin: ~3%

The results are consistent for both O3 and Os.

<rdar://problem/18310086>

llvm-svn: 224402

fc2201e9

Add missing testcase from r224388. · b9be608f
Richard Smith authored Dec 17, 2014
```
llvm-svn: 224401
```
b9be608f
Add printing the LC_ENCRYPTION_INFO_64 load command with llvm-objdump’s -private-headers · 57538299
Kevin Enderby authored Dec 17, 2014
```
and add tests for the two AArch64 binaries.

llvm-svn: 224400
```
57538299
PR21875: codegen for non-type template parameters of nullptr_t type · 8b979f01
David Blaikie authored Dec 17, 2014
```
llvm-svn: 224399
```
8b979f01

[CallGraph] Make sure the edges are not missed due to re-declarations · 87d404d4

Anna Zaks authored Dec 17, 2014

A patch by Daniel DeFreez!

We were previously dropping edges on re-declarations. Store the
canonical declarations in the graph to ensure that different
references to the same function end up reflected with the same call graph
node.

(Note, this might lead to performance fluctuation because call graph
is used to determine the function analysis order.)

llvm-svn: 224398

87d404d4

Revert "[CodeGenPrepare] Move sign/zero extensions near loads using type promotion." · 04b69f89
Reid Kleckner authored Dec 17, 2014
```
This reverts commit r224351. It causes assertion failures when building
ICU.

llvm-svn: 224397
```
04b69f89
Rename asan_allocator2.cc to asan_allocator.cc · 2c31cc3c
Alexey Samsonov authored Dec 17, 2014
```
llvm-svn: 224396
```
2c31cc3c

[ASan] Introduce SetCanPoisonMemory() function. · 91bb25f5

Alexey Samsonov authored Dec 17, 2014

SetCanPoisonMemory()/CanPoisonMemory() functions are now used
instead of "poison_heap" flag to determine if ASan is allowed
to poison the shadow memory. This allows to hot-patch this
value in runtime (e.g. during ASan activation) without introducing
a data race.

llvm-svn: 224395

91bb25f5

PR21909: Don't try (and crash) to generate debug info for explicit... · 0317bc9e

David Blaikie authored Dec 16, 2014

PR21909: Don't try (and crash) to generate debug info for explicit instantiations of explicit specializations.

llvm-svn: 224394

0317bc9e

SelectionDAG switch lowering: use 'unsigned' to count destination popularity · 224cb82a

Hans Wennborg authored Dec 16, 2014

SwitchInst::getNumCases() returns unsinged, so using uint64_t to count cases
seems unnecessary.

Also fix a missing CHECK in the test case.

llvm-svn: 224393

224cb82a

Add the ability to tag one or more breakpoints with a name. These · 5e09c8c3

Jim Ingham authored Dec 16, 2014

names can then be used in place of breakpoint id's or breakpoint id 
ranges in all the commands that operate on breakpoints.

<rdar://problem/10103959>

llvm-svn: 224392

5e09c8c3

[Hexagon] Updating doubleword shift usages to new versions. · aa1bade7
Colin LeMahieu authored Dec 16, 2014
```
llvm-svn: 224391
```
aa1bade7
Add printing the LC_ENCRYPTION_INFO load command with llvm-objdump’s -private-headers. · 0804f467
Kevin Enderby authored Dec 16, 2014
```
llvm-svn: 224390
```
0804f467

Linker: Drop superseded subprograms · 87590268

Duncan P. N. Exon Smith authored Dec 16, 2014

When a function gets replaced by `ModuleLinker`, drop superseded
subprograms.  This ensures that the "first" subprogram pointing at a
function is the same one that `!dbg` references point at.

This is a stop-gap fix for PR21910.  Notably, this fixes Release+Asserts
bootstraps that are currently asserting out in
`LexicalScopes::initialize()` due to the explicit instantiations in
`lib/IR/Dominators.cpp` eventually getting replaced by -argpromotion.

llvm-svn: 224389

87590268

DR1684: a constexpr member function need not be a member of a literal class type. · d52186ff
Richard Smith authored Dec 16, 2014
```
llvm-svn: 224388
```
d52186ff
Fix test cases given Clang's improved location information. · 5413abf8
David Blaikie authored Dec 16, 2014
```
llvm-svn: 224387
```
5413abf8

Try typo correction on all initialization arguments and be less · 938204aa

Kaelyn Takata authored Dec 16, 2014

pessimistic about when to do so.

This also fixes PR21905 as the initialization argument was no longer
viewed as being type dependent due to the TypoExpr being type-cast.

llvm-svn: 224386

938204aa

Dec 16, 2014

DebugInfo: Generalize debug info location handling · bf22a4ea

David Blaikie authored Dec 16, 2014

This is a more scalable (fixed in mostly one place, rather than many
places that will need constant improvement/maintenance) solution to
several commits I've made recently to increase source fidelity for
subexpressions.

This resetting had to be done at the DebugLoc level (not the
SourceLocation level) to preserve scoping information (if the resetting
was done with CGDebugInfo::EmitLocation, it would've caused the tail end
of an expression's codegen to end up in a potentially different scope
than the start, even though it was at the same source location). The
drawback to this is that it might leave CGDebugInfo out of sync. Ideally
CGDebugInfo shouldn't have a duplicate sense of the current
SourceLocation, but for now it seems it does... - I don't think I'm
going to tackle removing that just now.

I expect this'll probably cause some more buildbot fallout & I'll
investigate that as it comes up.

Also these sort of improvements might be starting to show a weakness/bug
in LLVM's line table handling: we don't correctly emit is_stmt for
statements, we just put it on every line table entry. This means one
statement split over multiple lines appears as multiple 'statements' and
two statements on one line (without column info) are treated as one
statement.

I don't think we have any IR representation of statements that would
help us distinguish these cases and identify the beginning of each
statement - so that might be something we need to add (possibly to the
lexical scope chain - a scope for each statement). This does cause some
problems for GDB and possibly other DWARF consumers.

llvm-svn: 224385

bf22a4ea

fix typo, add spaces; NFC · 494a625f
Sanjay Patel authored Dec 16, 2014
```
llvm-svn: 224384
```
494a625f

[X86][SSE] Vector double -> float conversion memory folding (cvtpd2ps) · bf1e0790

Simon Pilgrim authored Dec 16, 2014

Added a missing memory folding relationship for the (V)CVTPD2PS instruction - we can safely fold these for stack reloads.

Differential Revision: http://reviews.llvm.org/D6663

llvm-svn: 224383

bf1e0790