Commits · a4451d88ee456304c26d552749aea6a7f5154bde · Lorenzo Albano / LLVM bpEVL

Jan 18, 2020

Consolidate internal denormal flushing controls · a4451d88

Matt Arsenault authored Nov 01, 2019

Currently there are 4 different mechanisms for controlling denormal
flushing behavior, and about as many equivalent frontend controls.

- AMDGPU uses the fp32-denormals and fp64-f16-denormals subtarget features
- NVPTX uses the nvptx-f32ftz attribute
- ARM directly uses the denormal-fp-math attribute
- Other targets indirectly use denormal-fp-math in one DAGCombine
- cl-denorms-are-zero has a corresponding denorms-are-zero attribute

AMDGPU wants a distinct control for f32 flushing from f16/f64, and as
far as I can tell the same is true for NVPTX (based on the attribute
name).

Work on consolidating these into the denormal-fp-math attribute, and a
new type specific denormal-fp-math-f32 variant. Only ARM seems to
support the two different flush modes, so this is overkill for the
other use cases. Ideally we would error on the unsupported
positive-zero mode on other targets from somewhere.

Move the logic for selecting the flush mode into the compiler driver,
instead of handling it in cc1. denormal-fp-math/denormal-fp-math-f32
are now both cc1 flags, but denormal-fp-math-f32 is not yet exposed as
a user flag.

-cl-denorms-are-zero, -fcuda-flush-denormals-to-zero and
-fno-cuda-flush-denormals-to-zero will be mapped to
-fp-denormal-math-f32=ieee or preserve-sign rather than the old
attributes.

Stop emitting the denorms-are-zero attribute for the OpenCL flag. It
has no in-tree users. The meaning would also be target dependent, such
as the AMDGPU choice to treat this as only meaning allow flushing of
f32 and not f16 or f64. The naming is also potentially confusing,
since DAZ in other contexts refers to instructions implicitly treating
input denormals as zero, not necessarily flushing output denormals to
zero.

This also does not attempt to change the behavior for the current
attribute. The LangRef now states that the default is ieee behavior,
but this is inaccurate for the current implementation. The clang
handling is slightly hacky to avoid touching the existing
denormal-fp-math uses. Fixing this will be left for a future patch.

AMDGPU is still using the subtarget feature to control the denormal
mode, but the new attribute are now emitted. A future change will
switch this and remove the subtarget features.

a4451d88

Remove redundant CXXScopeSpec from TemplateIdAnnotation. · a42fd84c

Richard Smith authored Jan 17, 2020

A TemplateIdAnnotation represents only a template-id, not a
nested-name-specifier plus a template-id. Don't make a redundant copy of
the CXXScopeSpec and store it on the template-id annotation.

This slightly improves error recovery by more properly handling the case
where we would form an invalid CXXScopeSpec while parsing a typename
specifier, instead of accidentally putting the token stream into a
broken "annot_template_id with a scope specifier, but with no preceding
annot_cxxscope token" state.

a42fd84c

Jan 17, 2020

[xray] Allow instrumenting only function entry and/or only function exit · 97ba4830

Ian Levesque authored Jan 17, 2020

Extend -fxray-instrumentation-bundle to split function-entry and
function-exit into two separate options, so that it is possible to
instrument only function entry or only function exit.  For use cases
that only care about one or the other this will save significant overhead
and code size.

Differential Revision: https://reviews.llvm.org/D72890

97ba4830

[clang][xray] Add -fxray-ignore-loops option · 1d62be24

Ian Levesque authored Jan 17, 2020

XRay allows tuning by minimum function size, but also always instruments
functions with loops in them. If the minimum function size is set to a
large value the loop instrumention ends up causing most functions to be
instrumented anyway. This adds a new flag, -fxray-ignore-loops, to disable
the loop detection logic.

Differential Revision: https://reviews.llvm.org/D72873

1d62be24

Move the sysroot attribute from DIModule to DICompileUnit · 7b30370e

Adrian Prantl authored Jan 14, 2020

[this re-applies c0176916
 with the correct commit message and phabricator link]

This addresses point 1 of PR44213.
https://bugs.llvm.org/show_bug.cgi?id=44213

The DW_AT_LLVM_sysroot attribute is used for Clang module debug info,
to allow LLDB to import a Clang module from source. Currently it is
part of each DW_TAG_module, however, it is the same for all modules in
a compile unit. It is more efficient and less ambiguous to store it
once in the DW_TAG_compile_unit.

This should have no effect on DWARF consumers other than LLDB.

Differential Revision: https://reviews.llvm.org/D71732

7b30370e

Revert "Rename DW_AT_LLVM_isysroot to DW_AT_LLVM_sysroot" · c17aee67
Adrian Prantl authored Jan 17, 2020
```
This reverts commit 12e47947.

I accidentally landed this patch with the wrong commit message ...
```
c17aee67
[OPENMP]Improve debug locations in OpenMP regions. · c33ba8c1
Alexey Bataev authored Jan 17, 2020
```
Emit more precise debug locations for the OpenMP outlined regions.
```
c33ba8c1
Update clang test. · 90bdb037
Alina Sbirlea authored Jan 17, 2020

90bdb037
[InterfaceStubs][test] Add -triple to clang/test/InterfaceStubs/externstatic.c to make it robust · d0038012
Fangrui Song authored Jan 17, 2020
```
llvm-nm on Linux prints 0 line while llvm-nm on macOS prints 1 line.
```
d0038012

[clang] Set function attributes on SEH filter functions correctly. · ecfd6d3e

Sanne Wouda authored Dec 13, 2019

Summary:
When compiling with -munwind-tables, the SEH filter funclet needs the uwtable
function attribute, which gets automatically added if we use
SetInternalFunctionAttributes.  The filter funclet is internal so this seems
appropriate.

Reviewers: rnk

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D72786

ecfd6d3e

Reland "[llvm-nm] Don't report "no symbols" error for files that contain symbols" · a9f0025a
Fangrui Song authored Jan 17, 2020

a9f0025a
[test] Fix tests after D52810 · 932b5d6f
Fangrui Song authored Jan 17, 2020

932b5d6f

Rename DW_AT_LLVM_isysroot to DW_AT_LLVM_sysroot · 12e47947

Adrian Prantl authored Jan 14, 2020

This is a purely cosmetic change that is NFC in terms of the binary
output. I bugs me that I called the attribute DW_AT_LLVM_isysroot
since the "i" is an artifact of GCC command line option syntax
(-isysroot is in the category of -i options) and doesn't carry any
useful information otherwise.

This attribute only appears in Clang module debug info.

Differential Revision: https://reviews.llvm.org/D71722

12e47947

Add __warn_memset_zero_len builtin as a workaround for glibc issue · d2934179

serge-sans-paille authored Jan 16, 2020

Glibc issue: https://sourceware.org/bugzilla/show_bug.cgi?id=25399
The fix consist in considering the missing function as a builtin lowered to a nop.

Differential Revision: https://reviews.llvm.org/D72869

d2934179

Reapply Allow system header to provide their own implementation of some builtin · d437fba8

serge-sans-paille authored Jan 16, 2020

This reverts commit 3d210ed3.

See https://reviews.llvm.org/D71082 for the patch and discussion that make it
possible to reapply this patch.

d437fba8

Don't dump IR output from this test to stdout. · 01a6cd47
Richard Smith authored Jan 16, 2020

01a6cd47
Add extra test file forgotten in 45d70806 . · b78e8e0d
Richard Smith authored Jan 16, 2020

b78e8e0d

[modules] Do not cache invalid state for modules that we attempted to load. · 83f4c3af

Volodymyr Sapsai authored Jan 16, 2020

Partially reverts 0a2be46c as it turned
out to cause redundant module rebuilds in multi-process incremental builds.
When a module was getting out of date, all compilation processes started at the
same time were marking it as `ToBuild`. So each process was building the same
module instead of checking if it was built by someone else and using that
result. In addition to the work duplication, contention on the same .pcm file
wasn't making builds faster.

Note that for a single-process build this change would cause redundant module
reads and validations. But reading a module is faster than building it and
multi-process builds are more common than single-process. So I'm willing to
make such a trade-off.

rdar://problem/54395127

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D72860

83f4c3af

[OPENMP]Do not emit RTTI descriptor for NVPTX devices. · 25b542c6

Alexey Bataev authored Jan 16, 2020

Need to disable emission of RTTI descriptors for NVPTX devices to be
able to use dynamic classes without unresolved symbols at link stage.

25b542c6

AMDGPU: Update clang test · 9b549f26
Matt Arsenault authored Jan 16, 2020

9b549f26

Jan 16, 2020

[Hexagon] Update autogenerated intrinsic info in clang · 6f3effbb
Krzysztof Parzyszek authored Jan 16, 2020
```
In addition to that, use target features to validate intrinsic
availability on a given target.
```
6f3effbb
[Hexagon] Add preprocessor test for hexagonv66 · 7f5f6ff5
Krzysztof Parzyszek authored Jan 16, 2020

7f5f6ff5

[HIP][AMDGPU] expand printf when compiling HIP to AMDGPU · ed181efa

Sameer Sahasrabuddhe authored Aug 22, 2019

Summary:
This change implements the expansion in two parts:
- Add a utility function emitAMDGPUPrintfCall() in LLVM.
- Invoke the above function from Clang CodeGen, when processing a HIP
  program for the AMDGPU target.

The printf expansion has undefined behaviour if the format string is
not a compile-time constant. As a sufficient condition, the HIP
ToolChain now emits -Werror=format-nonliteral.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D71365

ed181efa

PR42694 Support explicit(bool) in older language modes as an extension. · 45d70806

Richard Smith authored Jan 15, 2020

This needs somewhat careful disambiguation, as C++2a explicit(bool) is a
breaking change. We only enable it in cases where the source construct
could not possibly be anything else.

45d70806

Fix pack deduction to only deduce the arity of packs that are actually · e8f198dd

Richard Smith authored Jan 15, 2020

expanded by the deduced pack.

We recently started also deducing the arity of separately-expanded packs
that are merely mentioned within the pack in question, which is
incorrect.

e8f198dd

Revert "Further implement CWG 2292" · 44560762

Amy Huang authored Jan 15, 2020

This reverts commit ee0f1f1e because it
causes an error on valid code.
See https://reviews.llvm.org/rGee0f1f1edc3ec0d4e698d50cc3180217448802b7.

44560762

[OPENMP]Use regular processing of vtable used when TU is a prefix. · b841b9e9

Alexey Bataev authored Jan 15, 2020

If current kind of the translation unit is TU_Prefix and it is not
complete, cannot decide what to do with virtual members/table at that
time, need to delay it to later stages.

b841b9e9

Revert "Allow system header to provide their own implementation of some builtin" · 3d210ed3

Amy Huang authored Jan 15, 2020

This reverts commit 921f871a because it
causes libc++ code to trigger __warn_memset_zero_len.

See https://reviews.llvm.org/D71082.

3d210ed3

Jan 15, 2020

Revert "[OPENMP]Do not use RTTI by default for NVPTX devices." · 6b29aa21
Alexey Bataev authored Jan 15, 2020
```
This reverts commit 23058f9d. It breaks
builds of cuda code somehow in some cases.
```
6b29aa21

PR17164: Change clang's default behavior from -flax-vector-conversions=all to... · b72a8c65

Richard Smith authored May 08, 2019

PR17164: Change clang's default behavior from -flax-vector-conversions=all to -flax-vector-conversions=integer.

Summary:
See proposal on cfe-dev:
http://lists.llvm.org/pipermail/cfe-dev/2019-April/062030.html

Reviewers: SjoerdMeijer, eli.friedman

Subscribers: kristof.beyls, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D67678

b72a8c65

Replace CLANG_SPAWN_CC1 env var with a driver mode flag · 8e5018e9

Nico Weber authored Jan 15, 2020

Flags are clang's default UI is flags.

We can have an env var in addition to that, but in D69825 nobody has yet
mentioned why this needs an env var, so omit it for now.  If someone
needs to set the flag via env var, the existing CCC_OVERRIDE_OPTIONS
mechanism works for it (set CCC_OVERRIDE_OPTIONS=+-fno-integrated-cc1
for example).

Also mention the cc1-in-process change in the release notes.

Also spruce up the test a bit so it actually tests something :)

Differential Revision: https://reviews.llvm.org/D72769

8e5018e9

[ARM][MVE][Intrinsics] Add VMINAQ, VMINNMAQ, VMAXAQ, VMAXNMAQ intrinsics. · da9d57d2

Mark Murray authored Jan 13, 2020

Summary: Add VMINAQ, VMINNMAQ, VMAXAQ, VMAXNMAQ intrinsics and unit tests.

Reviewers: simon_tatham, miyuki, dmgreen

Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D72761

da9d57d2

Fix bot by adjusting wildcard matching · 76b92cc7

Teresa Johnson authored Jan 15, 2020

I noticed one bot failure due to
24a00ef2 because the wildcard matching
was not working as intended, fixed it to act similar to other checks of
CGSCCToFunctionPassAdaptor.

76b92cc7

Restore "[ThinLTO] Add additional ThinLTO pipeline testing with new PM" · 24a00ef2

Teresa Johnson authored Jan 13, 2020

This restores 2af97be8 (reverted at
6288f86e), with all the fixes I had
applied at the time, along with a new fix for non-determinism in the
ordering of a couple of passes due to being accessed as parameters on
the same call.

I've also added --dump-input=fail to the new tests so I can more
thoroughly fix any additional failures.

24a00ef2

[clang] New __attribute__((__clang_arm_mve_strict_polymorphism)). · ada01d1b

Simon Tatham authored Jan 15, 2020

This is applied to the vector types defined in <arm_mve.h> for use
with the intrinsics for the ARM MVE vector architecture.

Its purpose is to inhibit lax vector conversions, but only in the
context of overload resolution of the MVE polymorphic intrinsic
functions. This solves an ambiguity problem with polymorphic MVE
intrinsics that take a vector and a scalar argument: the scalar
argument can often have the wrong integer type due to default integer
promotions or unsuffixed literals, and therefore, the type of the
vector argument should be considered trustworthy when resolving MVE
polymorphism.

As part of the same change, I've added the new attribute to the
declarations generated by the MveEmitter Tablegen backend (and
corrected a namespace issue with the other attribute while I was
there).

Reviewers: aaron.ballman, dmgreen

Reviewed By: aaron.ballman

Subscribers: kristof.beyls, JDevlieghere, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D72518

ada01d1b

Further implement CWG 2292 · ee0f1f1e

Soumi Manna authored Jan 15, 2020

The core issue is that simple-template-id is ambiguous between class-name
and type-name. This fixes PR43966.

ee0f1f1e

[Lexer] Allow UCN for dollar symbol '\u0024' in identifiers when using... · a90ea386

Scott Egerton authored Jan 15, 2020

[Lexer] Allow UCN for dollar symbol '\u0024' in identifiers when using -fdollars-in-identifiers flag.

Summary:
Previously, the -fdollars-in-identifiers flag allows the '$' symbol to be used
in an identifier but the universal character name equivalent '\u0024' is not
allowed.
This patch changes this, so that \u0024 is valid in identifiers.

Reviewers: rsmith, jordan_rose

Reviewed By: rsmith

Subscribers: dexonsmith, simoncook, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71758

a90ea386

Revert "[RISCV] Add Clang frontend support for Bitmanip extension" · cbe681bd
Scott Egerton authored Jan 15, 2020
```
This reverts commit 57cf6ee9.
```
cbe681bd
Fix up ms-pch-macro.c test to pass on non-Windows · c42116cc
Reid Kleckner authored Jan 14, 2020

c42116cc

[Driver][X86] Add -malign-branch* and -mbranches-within-32B-boundaries · 5ca24d09

Fangrui Song authored Jan 09, 2020

These driver options perform some checking and delegate to MC options -x86-align-branch* and -x86-branches-within-32B-boundaries.

Reviewed By: skan

Differential Revision: https://reviews.llvm.org/D72463

5ca24d09