Commits · a69e5b8b9d32801ed0db5fe2c112a9f81808ddee · Roger Ferrer / llvm-epi-0.8

Jan 20, 2014

Update StackProtector when coloring merges stack slots · a69e5b8b

Hal Finkel authored Jan 20, 2014

StackProtector keeps a ValueMap of alloca instructions to layout kind tags for
use by PEI and other later passes. When stack coloring replaces one alloca with
a bitcast to another one, the key replacement in this map does not work.
Instead, provide an interface to manage this updating directly. This seems like
an improvement over the old behavior, where the layout map would not get
updated at all when the stack slots were merged. In practice, however, there is
likely no observable difference because PEI only did anything special with
'large array' kinds, and if one large array is merged with another, than the
replacement should already have been a large array.

This is an attempt to unbreak the clang-x86_64-darwin11-RA builder.

llvm-svn: 199684

a69e5b8b

[X86] Teach how to combine a vselect into a movss/movsd · 450d1661

Andrea Di Biagio authored Jan 20, 2014

Add target specific rules for combining vselect dag nodes into movss/movsd
when possible.

If the vector type of the vselect dag node in input is either MVT::v4i13 or
MVT::v4f32, then try to fold according to rules:

  1) fold (vselect (build_vector (0, -1, -1, -1)), A, B) -> (movss A, B)
  2) fold (vselect (build_vector (-1, 0, 0, 0)), A, B) -> (movss B, A)

If the vector type of the vselect dag node in input is either MVT::v2i64 or
MVT::v2f64 (and we have SSE2), then try to fold according to rules:

  3) fold (vselect (build_vector (0, -1)), A, B) -> (movsd A, B)
  4) fold (vselect (build_vector (-1, 0)), A, B) -> (movsd B, A)

llvm-svn: 199683

450d1661

ObjectiveC driver. reinstate -fno-objc-legacy-dispatch behavior · 15f60cb2
Fariborz Jahanian authored Jan 20, 2014
```
when the deployment target is 10.5. // rdar://15852259

llvm-svn: 199682
```
15f60cb2

Debug info: On ARM ensure that all __TEXT sections come before the · 671af5ca

Adrian Prantl authored Jan 20, 2014

optional DWARF sections, so compiling with -g does not result in
different code being generated for PC-relative loads.

This is reapplying a diet r197922 (__TEXT-only).

llvm-svn: 199681

671af5ca

Revert "Debug info: On ARM ensure that the data sections come before the" · 1a89924d

Adrian Prantl authored Jan 20, 2014

Cut back on the cargo cult. The order of __DATA sections doesn't affect
generated code.

This reverts commit r197922.

llvm-svn: 199680

1a89924d

Adding a bit of documentation that was missed with r198883 (when... · b39a9b8b

Aaron Ballman authored Jan 20, 2014

Adding a bit of documentation that was missed with r198883 (when ParseArgumentsAsUnevaluated was added).

llvm-svn: 199679

b39a9b8b

Allow SMUL_LOHI and UMUL_LOHI to be narrow to MUL on targets where MUL is... · fb00d5bc

Owen Anderson authored Jan 20, 2014

Allow SMUL_LOHI and UMUL_LOHI to be narrow to MUL on targets where MUL is Custom rather than Legal.  Even if the target is doing some kind of expansion for MUL, it's pretty much guaranteed to be more efficent than whatever it does for SMUL_LOHI or UMUL_LOHI!

llvm-svn: 199678

fb00d5bc

Exposed a declarative way to specify that an attribute can be duplicated when... · b9023ed0

Aaron Ballman authored Jan 20, 2014

Exposed a declarative way to specify that an attribute can be duplicated when merging attributes on a declaration. This replaces some hard-coded functionality from Sema.

llvm-svn: 199677

b9023ed0

Remove some hard-coded specialness for thread-safety attributes from the... · 9a99e0da

Aaron Ballman authored Jan 20, 2014

Remove some hard-coded specialness for thread-safety attributes from the parser, and made it more declarative. If an attribute is allowed to appear on a function definition when late parsed, it can now use the FunctionDefinition attribute subject. It's treated as a FunctionDecl for most purposes, except it also gets exposed on the AttributeList so that it can be used while parsing.

llvm-svn: 199676

9a99e0da

Remove the useless pseudo instructions VDUPfdf and VDUPfqf, replacing them... · 43ccae1b
James Molloy authored Jan 20, 2014
```
Remove the useless pseudo instructions VDUPfdf and VDUPfqf, replacing them with patterns to match VDUPLN.

llvm-svn: 199675
```
43ccae1b
[CMake] LLVMProcessSources.cmake: Add include(CMakeParseArguments). · db2a4af3
NAKAMURA Takumi authored Jan 20, 2014
```
I didn't realize that cmake_parse_arguments() would require explicit inclusion.

llvm-svn: 199674
```
db2a4af3
clang-format: Properly format custom options in protocol buffer definitions. · 929b1db2
Daniel Jasper authored Jan 20, 2014
```
Before:
  option(my_option) = "abc";

After:
  option (my_option) = "abc";

llvm-svn: 199672
```
929b1db2
Formatting cleanups; no functional changes. · 3f5f3e79
Aaron Ballman authored Jan 20, 2014
```
llvm-svn: 199671
```
3f5f3e79
Add a triple. Should fix the 64 bit bots. · 4f5f0b78
Rafael Espindola authored Jan 20, 2014
```
llvm-svn: 199670
```
4f5f0b78
Make the test more strict. · a69024aa
Rafael Espindola authored Jan 20, 2014
```
llvm-svn: 199669
```
a69024aa
Whitespace. · 6515aef0
NAKAMURA Takumi authored Jan 20, 2014
```
llvm-svn: 199667
```
6515aef0

Fixing a typo (turned out to be harmless since the default priority values are... · f28e4993

Aaron Ballman authored Jan 20, 2014

Fixing a typo (turned out to be harmless since the default priority values are the same between the two attributes).

llvm-svn: 199666

f28e4993

Remove virtual methods that were added in 2009 and still had 1 implementation. · 63582d60
Rafael Espindola authored Jan 20, 2014
```
llvm-svn: 199665
```
63582d60

Since the diagnostics engine understands Attr objects, this code is no longer... · 05a63787

Aaron Ballman authored Jan 20, 2014

Since the diagnostics engine understands Attr objects, this code is no longer required -- we can just pass in the attribute directly.

llvm-svn: 199664

05a63787

Making some minor improvements to r199626. · fc1951c5
Aaron Ballman authored Jan 20, 2014
```
llvm-svn: 199663
```
fc1951c5

HasFunctionProto is a more strict version of FunctionLike. Since attribute... · 9e013515

Aaron Ballman authored Jan 20, 2014

HasFunctionProto is a more strict version of FunctionLike. Since attribute subjects are inclusive (passing a single subject test means no subject-related diagnostic will fire), these two subjects should not be combined.

llvm-svn: 199662

9e013515

Fix misched-aa-colored.ll to require asserts (trying again) · 9ff54e1f
Hal Finkel authored Jan 20, 2014
```
Perhaps it needs to be in caps.

llvm-svn: 199661
```
9ff54e1f
clang-format: Leave 2 empty lines in Google's JavaScript style. · a55544a6
Daniel Jasper authored Jan 20, 2014
```
As per the style guide, two lines are required between top-level
elements.

llvm-svn: 199660
```
a55544a6
Fix misched-aa-colored.ll to require asserts. · a6bcadeb
Hal Finkel authored Jan 20, 2014
```
-misched=shuffle is NDEBUG only. Maybe we should change that.

llvm-svn: 199659
```
a6bcadeb

Update IR when merging slots in stack coloring · cd9569c1

Hal Finkel authored Jan 20, 2014

The way that stack coloring updated MMOs when merging stack slots, while
correct, is suboptimal, and is incompatible with the use of AA during
instruction scheduling. The solution, which involves the use of const_cast (and
more importantly, updating the IR from within an MI-level pass), obviously
requires some explanation:

When the stack coloring pass was originally committed, the code in
ScheduleDAGInstrs::buildSchedGraph tracked possible alias sets by using
GetUnderlyingObject, and all load/store and store/store memory control
dependencies where added between SUs at the object level (where only one
object, that returned by GetUnderlyingObject, was used to identify the object
associated with each MMO). When stack coloring merged stack slots, it would
replace MMOs derived from the remapped alloca with the alloca with which the
remapped alloca was being replaced. Because ScheduleDAGInstrs only used single
objects, and tracked alias sets at the object level, this was a fine solution.

In r169744, (Andy and) I updated the code in ScheduleDAGInstrs to use
GetUnderlyingObjects, and track alias sets using, potentially, multiple
underlying objects for each MMO. This was done, primarily, to provide the
ability to look through PHIs, and provide better scheduling for
induction-variable-dependent loads and stores inside loops. At this point, the
MMO-updating code in stack coloring became suboptimal, because it would clear
the MMOs for (i.e. completely pessimize) all instructions for which r169744
might help in scheduling. Updating the IR directly is the simplest fix for this
(and the one with, by far, the least compile-time impact), but others are
possible (we could give each MMO a small vector of potential values, or make
use of a remapping table, constructed from MFI, inside ScheduleDAGInstrs).

Unfortunately, replacing all MMO values derived from the remapped alloca with
the base replacement alloca fundamentally breaks our ability to use AA during
instruction scheduling (which is critical to performance on some targets). The
reason is that the original MMO might have had an offset (either constant or
dynamic) from the base remapped alloca, and that offset is not present in the
updated MMO. One possible way around this would be to use
GetPointerBaseWithConstantOffset, and update not only the MMO's value, but also
its offset based on the original offset. Unfortunately, this solution would
only handle constant offsets, and for safety (because AA is not completely
restricted to deducing relationships with constant offsets), we would need to
clear all MMOs without constant offsets over the entire function. This would be
an even worse pessimization than the current single-object restriction. Any
other solution would involve passing around a vector of remapped allocas, and
teaching AA to use it, introducing additional complexity and overhead into AA.

Instead, when remapping an alloca, we replace all IR uses of that alloca as
well (optionally inserting a bitcast as necessary). This is even more efficient
that the old MMO-updating code in the stack coloring pass (because it removes
the need to call GetUnderlyingObject on all MMO values), removes the
single-object pessimization in the default configuration, and enables the
correct use of AA during instruction scheduling (all without any additional
overhead).

LLVM now no longer miscompiles itself on x86_64 when using -enable-misched
-enable-aa-sched-mi -misched-bottomup=0 -misched-topdown=0 -misched=shuffle!
Fixed PR18497.

Because the alloca replacement is now done at the IR level, unless the MMO
directly refers to the remapped alloca, the change cannot be seen at the MI
level. As a result, there is no good way to fix test/CodeGen/X86/pr14090.ll.

llvm-svn: 199658

cd9569c1

Track multiple stores per object when using AA in ScheduleDAGInstrs · a228a818

Hal Finkel authored Jan 20, 2014

When using AA to break false chain dependencies, we need to track multiple
stores per object in ScheduleDAGInstrs. Historically, we tracked potential alias
chains at the object level, and so all loads of an object would retain
dependencies on any store to that object. With AA, however, this is not
sufficient: non-overlapping stores and loads to the same object all need to be
tested for dependencies separately, we cannot only test all loads to an object
against only the last store (see PR18497 for an explicit example).

To mitigate any unwelcome compile-time impact when not using AA, only one store
is kept in the list per object when not using AA.

This, along with a stack coloring change to come shortly, will provide a test
case, fix PR18497 (and allow LLVM to compile itself using -enable-aa-sched-mi
on x86-64).

llvm-svn: 199657

a228a818

[msandr] Access app TLS directly in native exec mode. · e98f9099

Evgeniy Stepanov authored Jan 20, 2014

In optimized hybrid execution we do not use DynamoRIO private loader, which
mangles TLS access, so we can access the application's TLS directly.

Patch by Qin Zhao.

llvm-svn: 199655

e98f9099

[x86] Fix disassembly of MOV16ao16 et al. · caaa2850

David Woodhouse authored Jan 20, 2014

The addition of IC_OPSIZE_ADSIZE in r198759 wasn't quite complete. It
also turns out to have been unnecessary. The disassembler handles the
AdSize prefix for itself, and doesn't care about the difference between
(e.g.) MOV8ao8 and MOB8ao8_16 definitions. So just let them coexist and
don't worry about it.

llvm-svn: 199654

caaa2850

[x86] Fix 16-bit disassembly of JCXZ/JECXZ · 9c74fdb8
David Woodhouse authored Jan 20, 2014
```
llvm-svn: 199653
```
9c74fdb8

[x86] Rename MOVSD/STOSD/LODSD/OUTSD to MOVSL/STOSL/LODSL/OUTSL · 3442f342

David Woodhouse authored Jan 20, 2014

The disassembler has a special case for 'L' vs. 'W' in its heuristic for
checking for 32-bit and 16-bit equivalents. We could expand the heuristic,
but better just to be consistent in using the 'L' suffix.

llvm-svn: 199652

3442f342

[x86] Fix disassembly of callw instruction · 70ced3e0

David Woodhouse authored Jan 20, 2014

Not quite sure why this was marked isAsmParserOnly, but it means that the
disassembler can't see it either.

llvm-svn: 199651

70ced3e0

[x86] Fix 16-bit handling of OpSize bit · 5cf4c675

David Woodhouse authored Jan 20, 2014

When disassembling in 16-bit mode the meaning of the OpSize bit is
inverted. Instructions found in the IC_OPSIZE context will actually
*not* have the 0x66 prefix, and instructions in the IC context will
have the 0x66 prefix. Make use of the existing special-case handling
for the 0x66 prefix being in the wrong place, to cope with this.

llvm-svn: 199650

5cf4c675

[x86] Infer disassembler mode from SubtargetInfo feature bits · 7dd21824

David Woodhouse authored Jan 20, 2014

Aside from cleaning up the code, this also adds support for the -code16
environment and actually enables the MODE_16BIT mode that was previously
not accessible.

There is no point adding any testing for 16-bit yet though; basically
nothing will work because we aren't handling the OpSize prefix correctly
for 16-bit mode.

llvm-svn: 199649

7dd21824

[x86] Support i386-*-*-code16 triple for emitting 16-bit code · 71d15eda
David Woodhouse authored Jan 20, 2014
```
llvm-svn: 199648
```
71d15eda

[PM] Wire up the Verifier for the new pass manager and connect it to the · 4d35631a

Chandler Carruth authored Jan 20, 2014

various opt verifier commandline options.

Mostly mechanical wiring of the verifier to the new pass manager.
Exercises one of the more unusual aspects of it -- a pass can be either
a module or function pass interchangably. If this is ever problematic,
we can make things more constrained, but for things like the verifier
where there is an "obvious" applicability at both levels, it seems
convenient.

This is the next-to-last piece of basic functionality left to make the
opt commandline driving of the new pass manager minimally functional for
testing and further development. There is still a lot to be done there
(notably the factoring into .def files to kill the current boilerplate
code) but it is relatively uninteresting. The only interesting bit left
for minimal functionality is supporting the registration of analyses.
I'm planning on doing that on top of the .def file switch mostly because
the boilerplate for the analyses would be significantly worse.

llvm-svn: 199646

4d35631a

ARM: add tlsldo relocation · e51c8138

Kai Nacke authored Jan 20, 2014

Add support for the symbol(tlsldo) relocation. This is required in order to 
solve PR18554.

Reviewed by R. Golin, A. Korobeynikov.

llvm-svn: 199644

e51c8138

[ARM] Add ACLE enum/wchar size predefines · 0f28f0cf
Bradley Smith authored Jan 20, 2014
```
llvm-svn: 199642
```
0f28f0cf
[CMake] Apply ADDITIONAL_HEADERS introduced in r199639. · bf6d1efb
NAKAMURA Takumi authored Jan 20, 2014
```
llvm-svn: 199640
```
bf6d1efb

[CMake] llvm_process_sources: Introduce a parameter, ADDITIONAL_HEADERS. · 6acf320a

NAKAMURA Takumi authored Jan 20, 2014

ADDITIONAL_HEADERS is intended to add header files for IDEs as hint.

For example:
  add_llvm_library(LLVMSupport
    Host.cpp
    ADDITIONAL_HEADERS
      Unix/Host.inc
      Windows/Host.inc
    )

llvm-svn: 199639

6acf320a

[ARM] Do not generate Tag_DIV_use=AllowDIVExt when hardware div is... · 10e76a4e

Artyom Skrobov authored Jan 20, 2014

[ARM] Do not generate Tag_DIV_use=AllowDIVExt when hardware div is non-optional: it should have the default value of AllowDIVIfExists

llvm-svn: 199638

10e76a4e