Commits · 8cfcf586bbf5e83daff8aecd193b742002e961c1 · Lorenzo Albano / LLVM bpEVL

May 23, 2016

[X86][SSE] Added cvtdq2pd/cvtps2pd generic IR tests · 8cfcf586
Simon Pilgrim authored May 23, 2016
```
Added D20528 implementations as well as existing x86 intrinsics versions

llvm-svn: 270494
```
8cfcf586

Make sure TestRedefinitionsInInlines.py actually inlines. · 1245c2b3

Chaoren Lin authored May 23, 2016

Reviewers: spyffe

Subscribers: lldb-commits

Differential Revision: http://reviews.llvm.org/D20540

llvm-svn: 270493

1245c2b3

PrologEpilogInserter: Avoid an infinite loop when MinCSFrameIndex == 0 · 4a57bb5a

Justin Bogner authored May 23, 2016

Before r269750 we did the comparisons in this loop in signed ints so
that it DTRT when MinCSFrameIndex was 0. This was changed because it's
now possible for MinCSFrameIndex to be UINT_MAX, but that introduced a
bug when we were comparing `>= 0` - this is tautological in unsigned.

Rework the comparisons here to avoid issues with unsigned wrapping.

No test. I couldn't find a way to get any of the StackGrowsUp in-tree
targets to reach the code that sets MinCSFrameIndex.

llvm-svn: 270492

4a57bb5a

Add the printing the Mach-O (__LLVM,__bundle) xar archive file section "verbosely" · 9873e2c4

Kevin Enderby authored May 23, 2016

to llvm-objdump. This section is created with -fembed-bitcode option.

This requires the use of libxar and the Cmake and lit support were crafted by
Chris Bieneman!

rdar://26202242

llvm-svn: 270491

9873e2c4

xfail TestRedefinitionsInline on Windows. · 76f3def5
Zachary Turner authored May 23, 2016
```
llvm-svn: 270490
```
76f3def5
[X86][SSE] Use shuffle/sext instead of deprecated (+ auto-upgraded) pmovsxwd intrinsic call · f615191f
Simon Pilgrim authored May 23, 2016
```
llvm-svn: 270489
```
f615191f

We have many radars showing that stepping through C++ code can result in slow steps. · c2267787

Greg Clayton authored May 23, 2016

One of the things slowing us down is that ItaniumABILanguageRuntime class doesn't cache vtable to types in a map. This causes us, on every step, for every variable, to read the first pointer in a C++ type that could be dynamic and lookup the symbol, possibly in every symbol file (some symbols files on Darwin can end up having thousands of .o files when using DWARF in .o files, so thousands of .o files are searched each time). 

This fix caches lldb_private::Address (the resolved vtable symbol address in section + offset format) to TypeAndOrName instances inside the one ItaniumABILanguageRuntime in a process. This allows caching of dynamic types and stops us from always doing deep searches in each file.

<rdar://problem/18890778>

llvm-svn: 270488

c2267787

[Kaleidoscope] Add an initial "Building an ORC JIT" tutorial chapter. · 7331cc37

Lang Hames authored May 23, 2016

This is a work in progress - the chapter text is incomplete, though
the example code compiles and runs.

Feedback and patches are, as usual, most welcome.

llvm-svn: 270487

7331cc37

[SPARC] Fix 8 and 16-bit atomic load and store. · fdcc727d

James Y Knight authored May 23, 2016

They were accidentally using the 32-bit load/store instruction for
8/16-bit operations, due to incorrect patterns

(8/16-bit cmpxchg and atomicrmw will be fixed in subsequent changes)

llvm-svn: 270486

fdcc727d

Modify emitTypeInformation to use MemoryTypeTableBuilder, take 2 · 2280f932

Reid Kleckner authored May 23, 2016

This effectively revers commit r270389 and re-lands r270106, but it's
almost a rewrite.

The behavior change in r270106 was that we could no longer assume that
each LF_FUNC_ID record got its own type index. This patch adds a map
from DINode* to TypeIndex, so we can stop making that assumption.

This change also emits padding bytes between type records similar to the
way MSVC does. The size of the type record includes the padding bytes.

llvm-svn: 270485

2280f932

[CUDA] Add -fcuda-approx-transcendentals flag. · 91f6f07b

Justin Lebar authored May 23, 2016

Summary:
This lets us emit e.g. sin.approx.f32.  See
http://docs.nvidia.com/cuda/parallel-thread-execution/#floating-point-instructions-sin

Reviewers: rnk

Subscribers: tra, cfe-commits

Differential Revision: http://reviews.llvm.org/D20493

llvm-svn: 270484

91f6f07b

[profile] clean up runtime warnings. · 66a89196

Xinliang David Li authored May 23, 2016

 o make warning message more meaningful to users.
 o add suggestion to fix the problem
 o limit the max number of output.

llvm-svn: 270483

66a89196

Fix filtering of prior declarations when checking for a tag redeclaration to · cc1b82be

Richard Smith authored May 23, 2016

map to the redecl context for both decls, not just one of them, and to properly
check that the decl contexts are equivalent.

llvm-svn: 270482

cc1b82be

InsertPointAnalysis: Move current live interval from being a class member · f3c8f532
Wei Mi authored May 23, 2016
```
to query interfaces argument; NFC

Differential Revision: http://reviews.llvm.org/D20532

llvm-svn: 270481
```
f3c8f532
tune lowering parameter for small apps (sjeng) · e4520760
Xinliang David Li authored May 23, 2016
```
llvm-svn: 270480
```
e4520760

[InstCombine] Fix assertion when bitcast is converted to gep · 00e7092f

Gerolf Hoflehner authored May 23, 2016

When an aggregate contains an opaque type its size cannot be
determined. This triggers an "Invalid GetElementPtrInst indices for type" assert
in function checkGEPType. The fix suppresses the conversion in this case.

http://reviews.llvm.org/D20319

llvm-svn: 270479

00e7092f

[LoopUnroll] Enable advanced unrolling analysis by default. · be080fc5

Michael Zolotukhin authored May 23, 2016

Summary:
This patch turns on LoopUnrollAnalyzer by default. To mitigate compile
time regressions, I chose very conservative thresholds for now. Later we
can make them more aggressive, but it might require being smarter in
which loops we're optimizing. E.g. currently the biggest issue is that
with more agressive thresholds we unroll many cold loops, which
increases compile time for no performance benefit (performance of those
loops is improved, but it doesn't matter since they are cold).

Test results for compile time(using 4 samples to reduce noise):
```
MultiSource/Benchmarks/VersaBench/ecbdes/ecbdes 5.19%
SingleSource/Benchmarks/Polybench/medley/reg_detect/reg_detect  4.19%
MultiSource/Benchmarks/FreeBench/fourinarow/fourinarow  3.39%
MultiSource/Applications/JM/lencod/lencod 1.47%
MultiSource/Benchmarks/Fhourstones-3_1/fhourstones3_1 -6.06%
```

I didn't see any performance changes in the testsuite, but it improves
some internal tests.

Reviewers: hfinkel, chandlerc

Subscribers: llvm-commits, mzolotukhin

Differential Revision: http://reviews.llvm.org/D20482

llvm-svn: 270478

be080fc5

Fix DEBUG logs in MachineLICM. · f6f4a2a9

Justin Lebar authored May 23, 2016

Summary:
MBBs don't necessarily have a name (in my experience, they almost never
do), in which case this logging is quite unhelpful.  The number seems to
work well.

Reviewers: iteratee

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D20533

llvm-svn: 270477

f6f4a2a9

add cmake files to Xcode project · 562469cc

Todd Fiala authored May 23, 2016

This makes it easier to use Xcode revision diffing tools on them.

llvm-svn: 270476

562469cc

[codeview] Refactor symbol records to use same pattern as types. · a78ecd1e

Zachary Turner authored May 23, 2016

This will pave the way to introduce a full fledged symbol visitor
similar to how we have a type visitor, thus allowing the same
dumping code to be used in llvm-readobj and llvm-pdbdump.

Differential Revision: http://reviews.llvm.org/D20384
Reviewed By: rnk

llvm-svn: 270475

a78ecd1e

Removed the m_decl_objects map from ClangASTContext. · 5ba3215f

Sean Callanan authored May 23, 2016

m_decl_objects is problematic because it assumes that each VarDecl has a unique
variable associated with it.  This is not the case in inline contexts.

Also the information in this map can be reconstructed very easily without
maintaining the map.  The rest of the testsuite passes with this cange, and I've
added a testcase covering the inline contexts affected by this.

<rdar://problem/26278502>

llvm-svn: 270474

5ba3215f

Commiting for http://reviews.llvm.org/D20365 · 86d5f8ad
Mads Ravn authored May 23, 2016
```
llvm-svn: 270473
```
86d5f8ad
Commiting for http://reviews.llvm.org/D20365 · dfa3b3d3
Mads Ravn authored May 23, 2016
```
llvm-svn: 270472
```
dfa3b3d3

Remove dead code. · fa2f307c

Rui Ueyama authored May 23, 2016

The dead declarations made MSVC to warn on explicit template
instantiations of the classes.

llvm-svn: 270471

fa2f307c

Commiting for http://reviews.llvm.org/D20365 · d01743a3
Mads Ravn authored May 23, 2016
```
llvm-svn: 270470
```
d01743a3
fix typo; NFC · 8099fb7e
Sanjay Patel authored May 23, 2016
```
llvm-svn: 270469
```
8099fb7e

Fork performance improvements · b044e4fa

Jonathan Peyton authored May 23, 2016

Most of this is modifications to check for differences before updating data
fields in team struct. There is also some rearrangement of the team struct.

Patch by Diego Caballero

Differential Revision: http://reviews.llvm.org/D20487

llvm-svn: 270468

b044e4fa

use range-loop; NFCI · 13a0d498
Sanjay Patel authored May 23, 2016
```
llvm-svn: 270467
```
13a0d498
llvm-dwp: More error handling around invalid compressed sections · 05f84cd3
David Blaikie authored May 23, 2016
```
llvm-svn: 270466
```
05f84cd3
fix formatting; NFC · e8dc090a
Sanjay Patel authored May 23, 2016
```
llvm-svn: 270465
```
e8dc090a

Allow unit testing on Windows · 1ab887d4

Jonathan Peyton authored May 23, 2016

These changes allow testing on Windows using clang.exe.
There are two main changes:
1. Only link to -lm when it actually exists on the system
2. Create basic versions of pthread_create() and pthread_join() for windows.
   They are not POSIX compliant by any stretch but will allow any existing
   and future tests to use pthread_create() and pthread_join() for testing
   interactions of libomp with os threads.

Differential Revision: http://reviews.llvm.org/D20391

llvm-svn: 270464

1ab887d4

[WebAssembly] Speed up LiveIntervals updating. · 6c8f20d7

Dan Gohman authored May 23, 2016

Use the more specific LiveInterval::removeSegment instead of
LiveInterval::shrinkToUses when we know the specific range that's
being removed.

llvm-svn: 270463

6c8f20d7

llvm-dwp: Ensure compressed sections are preserved long enough for use in the string pool · 2e9bd893

David Blaikie authored May 23, 2016

Now that the string pool is referential rather than maintaining its own
copy of string data, compressed sections (well, technically only the
debug_str section*) need to be preserved for the lifetime of the pool to
match.

* I'm not currently optimizing for memory footprint with compressed
  input - the major memory limit I'm hitting is on dwp+dwp merge steps
  and we aren't currently compressing contents in dwp files, just in the
  .dwo inputs.

llvm-svn: 270462

2e9bd893

Address post-commit review feedback to r270457 · 7eca8a3d

David Majnemer authored May 23, 2016

Add two tests which show our error handling behavior for invalid
parameters in the layout_version and empty_bases attributes.

Amend our documentation to make it more clear that
__declspec(empty_bases) and __declspec(layout_version) can only apply to
classes, structs, and unions.

llvm-svn: 270461

7eca8a3d

Always rerun all tests on Windows. · fa7f9482

Zachary Turner authored May 23, 2016

There is flakiness somewhere in the core infrastructure on Windows,
so to get the buildbot reliably green we need to mark all tests
as flaky.

llvm-svn: 270460

fa7f9482

[Hexagon] Move some debug-only variable declarations into DEBUG · 4a3e285e
Krzysztof Parzyszek authored May 23, 2016
```
llvm-svn: 270459
```
4a3e285e

Clang support for __is_assignable intrinsic · b3d96882

David Majnemer authored May 23, 2016

MSVC now supports the __is_assignable type trait intrinsic,
to enable easier and more efficient implementation of the
Standard Library's is_assignable trait.
As of Visual Studio 2015 Update 3, the VC Standard Library
implementation uses the new intrinsic unconditionally.

The implementation is pretty straightforward due to the previously
existing is_nothrow_assignable and is_trivially_assignable.
We handle __is_assignable via the same code as the other two except
that we skip the extra checks for nothrow or triviality.

Patch by Dave Bartolomeo!

Differential Revision: http://reviews.llvm.org/D20492

llvm-svn: 270458

b3d96882

[MS ABI] Implement __declspec(empty_bases) and __declspec(layout_version) · cd3ebfe2

David Majnemer authored May 23, 2016

The layout_version attribute is pretty straightforward: use the layout
rules from version XYZ of MSVC when used like
struct __declspec(layout_version(XYZ)) S {};

The empty_bases attribute is more interesting.  It tries to get the C++
empty base optimization to fire more often by tweaking the MSVC ABI
rules in subtle ways:
1. Disable the leading and trailing zero-sized object flags if a class
   is marked __declspec(empty_bases) and is empty.

   This means that given:
   struct __declspec(empty_bases) A {};
   struct __declspec(empty_bases) B {};
   struct C : A, B {};

   'C' will have size 1 and nvsize 0 despite not being annotated
   __declspec(empty_bases).

2. When laying out virtual or non-virtual bases, disable the injection
   of padding between classes if the most derived class is marked
   __declspec(empty_bases).

   This means that given:
   struct A {};
   struct B {};
   struct __declspec(empty_bases) C : A, B {};

   'C' will have size 1 and nvsize 0.

3. When calculating the offset of a non-virtual base, choose offset zero
   if the most derived class is marked __declspec(empty_bases) and the
   base is empty _and_ has an nvsize of 0.

   Because of the ABI rules, this does not mean that empty bases
   reliably get placed at offset 0!

   For example:
   struct A {};
   struct B {};
   struct __declspec(empty_bases) C : A, B { virtual ~C(); };

   'C' will be pointer sized to account for the vfptr at offset 0.
   'A' and 'B' will _not_ be at offset 0 despite being empty!
   Instead, they will be located right after the vfptr.

   This occurs due to the interaction betweeen non-virtual base layout
   and virtual function pointer injection: injection occurs after the
   nv-bases and shifts them down by the size of a pointer.

llvm-svn: 270457

cd3ebfe2

SBValue::CreateValueFromData didn’t check whether the SBType passed into it is... · 64c034da

Enrico Granata authored May 23, 2016

SBValue::CreateValueFromData didn’t check whether the SBType passed into it is in fact a valid type - this can lead to LLDB crashing upon access

Committing on behalf of Sebastian Theophil

llvm-svn: 270456

64c034da

Do not split mergeable sections if they are gc'ed. · b91bf1a9

Rui Ueyama authored May 23, 2016

Previously, mergeable section's constructors did more than just
setting member variables; it split section contents into small
pieces. It is not always computationally cheap task because if
the section is a mergeable string section, it needs to scan the
entire section to split them by NUL characters.

If a section would be thrown away by GC, that cost ended up
being a waste of time. It is going to be larger problem if the
section is compressed -- the whole time to uncompress it and
split it up is going to be a waste.

Luckily, we can defer section splitting after GC. We just have
to remember which offsets are in use during GC and apply that later.
This patch implements it.

Differential Revision: http://reviews.llvm.org/D20516

llvm-svn: 270455

b91bf1a9