Commits · d9e39d53b6ebe528d0c5728a9dd95e227856bfee · Lorenzo Albano / LLVM bpEVL

Jun 12, 2015

[ARM] Disabling vfp4 should disable fp16 · d9e39d53

John Brawn authored Jun 12, 2015

ARMTargetParser::getFPUFeatures should disable fp16 whenever it
disables vfp4, as otherwise something like -mcpu=cortex-a7 -mfpu=none
leaves us with fp16 enabled (though the only effect that will have is
a wrong build attribute).

Differential Revision: http://reviews.llvm.org/D10397

llvm-svn: 239599

d9e39d53

Jun 10, 2015

Add explicit -mtriple=arm-unknown to... · e6c1b567

NAKAMURA Takumi authored Jun 09, 2015

Add explicit -mtriple=arm-unknown to llvm/test/CodeGen/ARM/disable-tail-calls.ll, to satisfy *-win32.

llvm-svn: 239442

e6c1b567

Jun 09, 2015

Remove DisableTailCalls from TargetOptions and the code in resetTargetOptions · d9699bc7

Akira Hatanaka authored Jun 09, 2015

that was resetting it.

Remove the uses of DisableTailCalls in subclasses of TargetLowering and use
the value of function attribute "disable-tail-calls" instead. Also,
unconditionally add pass TailCallElim to the pipeline and check the function
attribute at the start of runOnFunction to disable the pass on a per-function
basis. 
 
This is part of the work to remove TargetMachine::resetTargetOptions, and since
DisableTailCalls was the last non-fast-math option that was being reset in that
function, we should be able to remove the function entirely after the work to
propagate IR-level fast-math flags to DAG nodes is completed.

Out-of-tree users should remove the uses of DisableTailCalls and make changes
to attach attribute "disable-tail-calls"="true" or "false" to the functions in
the IR.

rdar://problem/13752163

Differential Revision: http://reviews.llvm.org/D10099

llvm-svn: 239427

d9699bc7

Jun 08, 2015

[ARM] Pass a callback to FunctionPass constructors to enable skipping execution · 4a61619f

Akira Hatanaka authored Jun 08, 2015

on a per-function basis.

Previously some of the passes were conditionally added to ARM's pass pipeline
based on the target machine's subtarget. This patch makes changes to add those
passes unconditionally and execute them conditonally based on the predicate
functor passed to the pass constructors. This enables running different sets of
passes for different functions in the module.

rdar://problem/20542263

Differential Revision: http://reviews.llvm.org/D8717

llvm-svn: 239325

4a61619f

Jun 05, 2015

[ARM] Add support for -sp- FPUs and FPU none to TargetParser · 985c04e8

John Brawn authored Jun 05, 2015

These are added mainly for the benefit of clang, but this also means that they
are now allowed in .fpu directives and we emit the correct .fpu directive when
single-precision-only is used.

Differential Revision: http://reviews.llvm.org/D10238

llvm-svn: 239151

985c04e8

Jun 03, 2015

ARM: Thumb2 LDRD/STRD supports independent input/output regs · 125c9f5f

Matthias Braun authored Jun 03, 2015

The existing code would unnecessarily break LDRD/STRD apart with
non-adjacent registers, on thumb2 this is not necessary.

Ideally on thumb2 we shouldn't match for ldrd/strd pre-regalloc anymore
as there is not reason to set register hints anymore, changing that is
something for a future patch however.

Differential Revision: http://reviews.llvm.org/D9694

Recommiting after the revert in r238821, the buildbot still failed with
the patch removed so there seems to be another reason for the breakage.

llvm-svn: 238935

125c9f5f

Jun 02, 2015

Revert "ARM: Thumb2 LDRD/STRD supports independent input/output regs" · 3a7bec86

Renato Golin authored Jun 02, 2015

This reverts commit r238795, as it broke the Thumb2 self-hosting buildbot.

Since self-hosting issues with Clang are hard to investigate, I'm taking the
liberty to revert now, so we can investigate it offline.

llvm-svn: 238821

3a7bec86

ARM: Thumb2 LDRD/STRD supports independent input/output regs · e20dc1cd

Matthias Braun authored Jun 01, 2015

The existing code would unnecessarily break LDRD/STRD apart with
non-adjacent registers, on thumb2 this is not necessary.

Ideally on thumb2 we shouldn't match for ldrd/strd pre-regalloc anymore
as there is not reason to set register hints anymore, changing that is
something for a future patch however.

Differential Revision: http://reviews.llvm.org/D9694

llvm-svn: 238795

e20dc1cd

Jun 01, 2015
- Re-commit of r238201 with fix for building with shared libraries. · 85fd06d3
  Luke Cheeseman authored Jun 01, 2015
```
llvm-svn: 238739
```
  85fd06d3
May 31, 2015

ARM: recommit r237590: allow jump tables to be placed as constant islands. · a603c407

Tim Northover authored May 31, 2015

The original version didn't properly account for the base register
being modified before the final jump, so caused miscompilations in
Chromium and LLVM. I've fixed this and tested with an LLVM self-host
(I don't have the means to build & test Chromium).

The general idea remains the same: in pathological cases jump tables
can be too far away from the instructions referencing them (like other
constants) so they need to be movable.

Should fix PR23627.

llvm-svn: 238680

a603c407

May 26, 2015
- Revert "Re-commit changes in r237579 with fix for bug breaking windows builds." · bfecc066
  Diego Novillo authored May 26, 2015
```
This reverts commit r238201 to fix linking problems in x86 Linux
http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150525/278413.html

llvm-svn: 238223
```
  bfecc066
- Re-commit changes in r237579 with fix for bug breaking windows builds. · a5d053d6
  Luke Cheeseman authored May 26, 2015
```
llvm-svn: 238201
```
  a5d053d6
May 23, 2015

Stop resetting NoFramePointerElim in TargetMachine::resetTargetOptions. · ddf76aa3

Akira Hatanaka authored May 23, 2015

This is part of the work to remove TargetMachine::resetTargetOptions.

In this patch, instead of updating global variable NoFramePointerElim in
resetTargetOptions, its use in DisableFramePointerElim is replaced with a call
to TargetFrameLowering::noFramePointerElim. This function determines on a
per-function basis if frame pointer elimination should be disabled.

There is no change in functionality except that cl:opt option "disable-fp-elim"
can now override function attribute "no-frame-pointer-elim".

llvm-svn: 238080

ddf76aa3

May 22, 2015
- Revert r237590, "ARM: allow jump tables to be placed as constant islands." · 7e814d10
  Peter Collingbourne authored May 21, 2015
```
Caused a miscompile of the Android port of Chromium, details
forthcoming.

llvm-svn: 237972
```
  7e814d10
May 20, 2015

[Target/ARM] Only enable OptimizeBarrierPass at -O1 and above. · 141b2891

Davide Italiano authored May 20, 2015

Ideally this is going to be and LLVM IR pass (shared, among others
with AArch64), but for the time being just enable it if consumers
ask us for optimization and not unconditionally.

Discussed with Tim Northover on IRC.

llvm-svn: 237837

141b2891

May 18, 2015

ARM: allow jump tables to be placed as constant islands. · 12c41af0

Tim Northover authored May 18, 2015

Previously, they were forced to immediately follow the actual branch
instruction. This was usually OK (the LEAs actually accessing them got emitted
nearby, and weren't usually separated much afterwards). Unfortunately, a
sufficiently nasty phi elimination dumps many instructions right before the
basic block terminator, and this can increase the range too much.

This patch frees them up to be placed as usual by the constant islands pass,
and consequently has to slightly modify the form of TBB/TBH tables to refer to
a PC-relative label at the final jump. The other jump table formats were
already position-independent.

rdar://20813304

llvm-svn: 237590

12c41af0

Revert r237579, as it broke windows buildbots · 6cb23465
Oliver Stannard authored May 18, 2015
```
llvm-svn: 237583
```
6cb23465

[LLVM - ARM/AArch64] Add ACLE special register intrinsics · 0c553afe

Oliver Stannard authored May 18, 2015

This patch implements LLVM support for the ACLE special register intrinsics in
section 10.1, __arm_{w,r}sr{,p,64}.

This patch is intended to lower the read/write_register instrinsics, used to
implement the special register intrinsics in the clang patch for special
register intrinsics (see http://reviews.llvm.org/D9697), to ARM specific
instructions MRC,MCR,MSR etc. to allow reading an writing of coprocessor
registers in AArch32 and AArch64. This is done by inspecting the register
string passed to the intrinsic and then lowering to the appropriate
instruction.

Patch by Luke Cheeseman.

Differential Revision: http://reviews.llvm.org/D9699

llvm-svn: 237579

0c553afe

May 14, 2015

[CodeGen] Use standard -not gnueabi- naming for f16 libcalls on Darwin. · 6402ad27

Ahmed Bougacha authored May 14, 2015

Other targets probably should as well.  Since r237161, compiler-rt has
both, but I don't see why anything other than gnueabi would use a
gnueabi naming scheme.

llvm-svn: 237324

6402ad27

May 13, 2015

CodeGen: ignore DEBUG_VALUE nodes in KILL tagging · ee13fbe8

Saleem Abdulrasool authored May 12, 2015

DEBUG_VALUE nodes do not take part in code generation.  Ignore them when
performing KILL updates.  Addresses PR23486.

llvm-svn: 237211

ee13fbe8

May 12, 2015

Changed renaming of local symbols by inserting a dot vefore the numeric suffix. · d79dfcbc
Sunil Srivastava authored May 12, 2015
```
One code change and several test changes to match that
details in http://reviews.llvm.org/D9481

llvm-svn: 237150
```
d79dfcbc

[ARM] Use AEABI aligned function variants · 70605f7d

John Brawn authored May 12, 2015

AEABI defines aligned variants of memcpy etc. that can be faster than
the default version due to not having to do alignment checks. When
emitting target code for these functions make use of these aligned
variants if possible. Also convert memset to memclr if possible.

Differential Revision: http://reviews.llvm.org/D8060

llvm-svn: 237127

70605f7d

Migrate existing backends that care about software floating point · 824f42f2

Eric Christopher authored May 12, 2015

to use the information in the module rather than TargetOptions.

We've had and clang has used the use-soft-float attribute for some
time now so have the backends set a subtarget feature based on
a particular function now that subtargets are created based on
functions and function attributes.

For the one middle end soft float check go ahead and create
an overloadable TargetLowering::useSoftFloat function that
just checks the TargetSubtargetInfo in all cases.

Also remove the command line option that hard codes whether or
not soft-float is set by using the attribute for all of the
target specific test cases - for the generic just go ahead and
add the attribute in the one case that showed up.

llvm-svn: 237079

824f42f2

May 09, 2015

[Fast-ISel] Don't mark the first use of a remat constant as killed. · d54fb899

Pete Cooper authored May 09, 2015

When emitting something like 'add x, 1000' if we remat the 1000 then we should be able to
mark the vreg containing 1000 as killed. Given that we go bottom up in fast-isel, a later
use of 1000 will be higher up in the BB and won't kill it, or be impacted by the lower kill.

However, rematerialised constant expressions aren't generated bottom up. The local value save area
grows downwards. This means that if you remat 2 constant expressions which both use 1000 then the
first will kill it, then the second, which is *lower* in the BB will read a killed register.

This is the case in the attached test where the 2 GEPs both need to generate 'add x, 6680' for the constant offset.

Note that this commit only makes kill flag generation conservative. There's nothing else obviously wrong with
the local value save area growing downwards, and in fact it needs to for handling arbitrarily complex constant expressions.

However, it would be nice if there was a solution which would let us generate more accurate kill flags, or just kill flags completely.

llvm-svn: 236922

d54fb899

ScheduleDAGInstrs: In functions with tail calls PseudoSourceValues are not... · f54b73d6

Arnold Schwaighofer authored May 08, 2015

ScheduleDAGInstrs: In functions with tail calls PseudoSourceValues are not non-aliasing distinct objects

The code that builds the dependence graph assumes that two PseudoSourceValues
don't alias. In a tail calling function two FixedStackObjects might refer to the
same location. Worse 'immutable' fixed stack objects like function arguments are
not immutable and will be clobbered.

Change this so that a load from a FixedStackObject is not invariant in a tail
calling function and don't return a PseudoSourceValue for an instruction in tail
calling functions when building the dependence graph so that we handle function
arguments conservatively.

Fix for PR23459.

rdar://20740035

llvm-svn: 236916

f54b73d6

May 08, 2015

[Fast-ISel] Clear kill flags on registers replaced by updateValueMap. · e4bb07ec

Pete Cooper authored May 08, 2015

When selecting an extract instruction, we don't actually generate code but instead work out which register we are reading, and rewrite uses of the extract def to the source register. This is done via updateValueMap,.

However, its possible that the source register we are rewriting *to* to also have uses. If those uses are after a kill of the value we are rewriting *from* then we have uses after a kill and the verifier fails.

This code checks for the case where the to register is also used, and if so it clears all kill on the from register. This is conservative, but better that always clearing kills on the from register.

llvm-svn: 236897

e4bb07ec

May 07, 2015

Clear kill flags in tail duplication. · ba593ad3

Pete Cooper authored May 07, 2015

If we duplicate an instruction then we must also clear kill flags on any uses we rewrite.
Otherwise we might be killing a register which was used in other BBs.

For example, here the entry BB ended up with these instructions, the ADD having been tail duplicated.

	%vreg24<def> = t2ADDri %vreg10<kill>, 1, pred:14, pred:%noreg, opt:%noreg; GPRnopc:%vreg24 rGPR:%vreg10
	%vreg22<def> = COPY %vreg10; GPR:%vreg22 rGPR:%vreg10

	The copy here is inserted after the add and so needs vreg10 to be live.

llvm-svn: 236782

ba593ad3

Handle dead defs in the if converter. · 27483915

Pete Cooper authored May 06, 2015

We had code such as this:
  r2 = ...
  t2Bcc

label1:
  ldr ... r2

label2;
  return r2<dead, def>

The if converter was transforming this to
   r2<def> = ...
   return [pred] r2<dead,def>
   ldr <r2, kill>
   return

which fails the machine verifier because the ldr now reads from a dead def.

The fix here detects dead defs in stepForward and passes them back to the caller in the clobbers list.  The caller then clears the dead flag from the def is the value is live.

llvm-svn: 236660

27483915

Fix incorrect kill flags in fastisel. · 54085cdc

Pete Cooper authored May 06, 2015

If called twice in the same BB on the same constant, FastISel::fastEmit_ri_ was marking the materialized vreg as killed on each use, instead of only the last use.

Change this to only mark the last use as killed by making earlier uses check if the vreg is already used elsewhere.

llvm-svn: 236650

54085cdc

May 06, 2015

[ARM] Fast-Isel was incorrectly selecting <2 x double> adds. · d927c6ea

Pete Cooper authored May 06, 2015

With neon enabled, we reach SelectBinaryFPOp and are able to get registers for a <2 x double> add.

However, we shouldn't actually attempt arithmetic on it as ARMIselLowering says "v2f64 is legal so that QR subregs can be extracted as f64 elements, but neither Neon nor VFP support any arithmetic operations on it."

This commit disables SelectBinaryFPOp for any vector types. There's already a FIXME to try handle neon. Doing so would require fixing this conditional which isn't safe for vectors 'VT == MVT::f64 || VT == MVT::i64'

llvm-svn: 236609

d927c6ea

[ARM] generate VMAXNM/VMINNM for a compare followed by a select, in safe math mode too · 3f8eae92
Artyom Skrobov authored May 06, 2015
```
llvm-svn: 236590
```
3f8eae92

[ARM][FastISel] Use TST #1 instead of CMP #0 for select. · e8d0c4cc

Ahmed Bougacha authored May 06, 2015

Since r234249, i1 are sext instead of zext; because of that, doing
"CMP rN, #0; IT EQ/NE" isn't correct anymore.

"TST #1" is the conservatively correct alternative - the tradeoff being
that it doesn't have a 16-bit encoding -, so use that instead.

llvm-svn: 236569

e8d0c4cc

Fix IfConverter to handle regmask machine operands. · ce9ad757

Pete Cooper authored May 05, 2015

Note, this is a recommit of r236515 after fixing an error in r236514. The buildbot ran fast enough that it picked up r236514 prior to r236515 and threw an error. r236515 itself ran 'make check' without errors.

Original commit message follows:

A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks.

These are slightly different to a def in that we need to add both an implicit use and def to appease the machine verifier. Otherwise, uses after the if converted call could think they are reading an undefined register.

Reviewed by Matthias Braun and Quentin Colombet.

llvm-svn: 236550

ce9ad757

May 05, 2015

Revert "Fix IfConverter to handle regmask machine operands." · 05b84d41

Pete Cooper authored May 05, 2015

This reverts commit b27413cbfd78d959c18e713bfa271fb69e6b3303 (ie r236515).

This is to get the bots green while i investigate the failures.

llvm-svn: 236517

05b84d41

Fix IfConverter to handle regmask machine operands. · 6ebc2077

Pete Cooper authored May 05, 2015

A regmask (typically seen on a call) clobbers the set of registers it lists. The IfConverter, in UpdatePredRedefs, was handling register defs, but not regmasks.

Reviewed by Matthias Braun and Quentin Colombet.

llvm-svn: 236515

6ebc2077

May 01, 2015

ARM: Align functions containing Thumb-2 jump tables to 4 bytes. · d27d3a15

Peter Collingbourne authored May 01, 2015

Functions with jump tables need an alignment of 4 because they use the ADR
instruction, which aligns the PC to 4 bytes before adding an offset.

Differential Revision: http://reviews.llvm.org/D9424

llvm-svn: 236327

d27d3a15

[ARM][TEST] Strengthen test against smarter reg alloc. · 65b5b01d
Quentin Colombet authored May 01, 2015
```
Follow-up of r236247.

rdar://problem/20770899

llvm-svn: 236296
```
65b5b01d

[ARM] optimizeSelect should clear kill flags. · 2127b00c

Pete Cooper authored Apr 30, 2015

If we move an instruction from one block down to a MOVC and predicate it,
then the original instruction could be moved in to a loop.  In this case,
its invalid for any kill flags to remain on there.

Fails with -verfy-machineinstrs.

rdar://problem/20752113

llvm-svn: 236290

2127b00c

Commute the internal flag on MachineOperands. · 451755d3

Pete Cooper authored Apr 30, 2015

When commuting a thumb instruction in the size reduction pass, thumb
instructions are represented as a bundle and so some operands may be marked
as internal.  The internal flag has to move with the operand when commuting.

This test is sensitive to register allocation so can't specifically check that
this error was happening, but so long as it continues to pass with -verify then
hopefully its still ok.

rdar://problem/20752113

llvm-svn: 236282

451755d3

Don't always apply kill flag in thumb2 ABS pseudo expansion. · 5111881c

Pete Cooper authored Apr 30, 2015

The expansion for t2ABS was always setting the kill flag on the rsb instruction.
It should instead only be set on rsb if it was set on the original ABS instruction.

rdar://problem/20752113

llvm-svn: 236272

5111881c