Commits · 715528be0bffc028da31ed647f9415269656fc46 · Roger Ferrer / llvm-epi-0.8

Aug 19, 2013

DebugInfo: don't emit zero-length names for parameters · 715528be

David Blaikie authored Aug 19, 2013

We check this in many/all other cases, just missed this one it seems.
Perhaps it'd be worth unifying this so we never emit zero-length
DW_AT_names.

llvm-svn: 188649

715528be

Remove SpecialCaseList::findCategory. · 03c3324c
Peter Collingbourne authored Aug 19, 2013
```
It turned out that I didn't need this for DFSan.

llvm-svn: 188646
```
03c3324c

Aug 18, 2013

ARM: make sure we keep inline asm operands tied. · 55349a29

Tim Northover authored Aug 18, 2013

When patching inlineasm nodes to use GPRPair for 64-bit values, we
were dropping the information that two operands were tied, which
effectively broke the live-interval of vregs affected.

llvm-svn: 188643

55349a29

AVX-512: Added VMOVD, VMOVQ, VMOVSS, VMOVSD instructions. · 3ce8dbba
Elena Demikhovsky authored Aug 18, 2013
```
llvm-svn: 188637
```
3ce8dbba
Make more of the lowering helpers static. Also use MVT instead of EVT in a couple places. · e6861c9c
Craig Topper authored Aug 18, 2013
```
llvm-svn: 188629
```
e6861c9c
Remove unused stdio.h includes · 8b2a3d1f
Dmitri Gribenko authored Aug 18, 2013
```
llvm-svn: 188626
```
8b2a3d1f

Go through the really awkward dance required to delete the memory · 67ff8b71

Chandler Carruth authored Aug 18, 2013

allocated by setupterm. Without this, some folks are seeing leaked
memory whenever this routine is called more than once. Thanks to Craig
Topper for the report.

llvm-svn: 188615

67ff8b71

Fix SCEVExpander creating distinct duplicate PHI entries · 3f5279cc

Hal Finkel authored Aug 18, 2013

This fixes SCEVExpander so that it does not create multiple distinct induction
variables for duplicate PHI entries. Specifically, given some code like this:

do.body6:                                         ; preds = %do.body6, %do.body6, %if.then5
  %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ]
...

Note that it is legal to have multiple entries for a basic block so long as the
associated value is the same. So the above input is okay, but expanding an
AddRec in this loop could produce code like this:

do.body6:                                         ; preds = %do.body6, %do.body6, %if.then5
  %indvar = phi i64 [ %indvar.next, %do.body6 ], [ %indvar.next1, %do.body6 ], [ 0, %if.then5 ]
  %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ]
...
  %indvar.next = add i64 %indvar, 1
  %indvar.next1 = add i64 %indvar, 1

And this is not legal because there are two PHI entries for %do.body6 each with
a distinct value.

Unfortunately, I don't have an in-tree test case.

llvm-svn: 188614

3f5279cc

Aug 17, 2013

PR 16899: Do not modify the basic block using the iterator, but keep the · 8e3050db
Joerg Sonnenberger authored Aug 17, 2013
```
next value. This avoids crashes due to invalidation.

Patch by Joey Gouly.

llvm-svn: 188605
```
8e3050db
R600: Fix possible use of an uninitialized variable · 59ed08b2
Tom Stellard authored Aug 17, 2013
```
Spotted by Nick Lewycky!

llvm-svn: 188599
```
59ed08b2
R600: Expand vector FRINT ops · b249b757
Tom Stellard authored Aug 16, 2013
```
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 188598
```
b249b757
R600: Expand vector FFLOOR ops · ad3aff24
Tom Stellard authored Aug 16, 2013
```
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 188597
```
ad3aff24
R600: Expand vector float operations for both SI and R600 · a92ff879
Tom Stellard authored Aug 16, 2013
```
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 188596
```
a92ff879
ARM: Properly constrain comparison fastisel register classes. · d7866790
Jim Grosbach authored Aug 16, 2013
```
Ongoing 'make the verifier happy' improvements to ARM fast-isel.

rdar://12594152

llvm-svn: 188595
```
d7866790

ARM: Fast-isel register class constrain for extends. · 3fa74910

Jim Grosbach authored Aug 16, 2013

Properly constrain the operand register class for instructions used
in [sz]ext expansion. Update more tests to use the verifier now that
we're getting the register classes correct.

rdar://12594152

llvm-svn: 188594

3fa74910

ARM: Fix more fast-isel verifier failures. · 06c2a681

Jim Grosbach authored Aug 16, 2013

Teach the generic instruction selection helper functions to constrain
the register classes of their input operands. For non-physical register
references, the generic code needs to be careful not to mess that up
when replacing references to result registers. As the comment indicates
for MachineRegisterInfo::replaceRegWith(), it's important to call
constrainRegClass() first.

rdar://12594152

llvm-svn: 188593

06c2a681

ARM: Clean up fast-isel machine verifier errors. · d69f3ed9

Jim Grosbach authored Aug 16, 2013

Lots of machine verifier errors result from using a plain GPR regclass
for incoming argument copies. A more restrictive rGPR class is more
appropriate since it more accurately represents what's happening, plus
it lines up better with isel later on so the verifier is happier.
Reduces the number of ARM fast-isel tests not running with the verifier
enabled by over half.

rdar://12594152

llvm-svn: 188592

d69f3ed9

Fix a subtle difference between running clang vs llc for mips16. · 0eae85fb

Reed Kotler authored Aug 16, 2013

This regards how mips16 is viewed. It's not really a target type but
there has always been a target for it in the td files. It's more properly
-mcpu=mips32 -mattr=+mips16 . This is how clang treats it but we have
always had the -mcpu=mips16 which I probably should delete now but it will
require updating all the .ll test cases for mips16. In this case it changed
how we decide if we have a count bits instruction and whether instruction
lowering should then expand ctlz. Now that we have dual mode compilation,
-mattr=+mips16 really just indicates the inital processor mode that
we are compiling for. (It is also possible to have -mcpu=64 -mattr=+mips16
but as far as I know, nobody has even built such a processor, though there
is an architecture manual for this).

llvm-svn: 188586

0eae85fb

Actually, use GNU inline asm for cpuid with clang · bf4f9ebb

Reid Kleckner authored Aug 16, 2013

Clang doesn't support the MSVC __cpuid intrinsic yet, and fixing that is
blocked on some fairly complicated issues.

llvm-svn: 188584

bf4f9ebb

Aug 16, 2013

DebugInfo: Allow the addition of other (such as static data) members to a... · d4e106e3

David Blaikie authored Aug 16, 2013

DebugInfo: Allow the addition of other (such as static data) members to a record type after construction

Plus a type cleanup & minor fix to enumerate members of declarations.

llvm-svn: 188577

d4e106e3

[PowerPC] Preparatory refactoring for making prologue and epilogue · 8893a3d1

Bill Schmidt authored Aug 16, 2013

safe on PPC32 SVR4 ABI

[Patch and following text by Mark Minich; committing on his behalf.]

There are FIXME's in PowerPC/PPCFrameLowering.cpp, method
PPCFrameLowering::emitPrologue() related to "negative offsets of R1"
on PPC32 SVR4. They're true, but the real issue is that on PPC32 SVR4
(and any ABI without a Red Zone), no spills may be made until after
the stackframe is claimed, which also includes the LR spill which is
at a positive offset. The same problem exists in emitEpilogue(),
though there's no FIXME for it. I intend to fix this issue, making
LLVM-compiled code finally safe for use on SVR4/EABI/e500 32-bit
platforms (including in particular, OS-free embedded systems & kernel
code, where interrupts may share the same stack as user code).

In preparation for making these changes, to make the diffs for the
functional changes less cluttered, I am providing the non-functional
refactorings in two stages:

Stage 1 does some minor fluffy refactorings to pull multiple method
calls up into a single bool, creating named bools for repeated uses of
obscure logic, moving some code up earlier because either stage 2 or
my final version will require it earlier, and rewording/adding some
comments. My stage 1 changes can be characterized as primarily fluffy
cleanup, the purpose of which may be unclear until the stage 2 or
final changes are made.

My stage 2 refactorings combine the separate PPC32 & PPC64 logic,
which is currently performed by largely duplicate code, into a single
flow, with the differences handled by a group of constants initialized
early in the methods.

This submission is for my stage 1 changes. There should be no
functional changes whatsoever; this is a pure refactoring.

llvm-svn: 188573

8893a3d1

Fixed RuntimeDyldELF absolute relocations. · ad6d349f

Richard Mitton authored Aug 16, 2013

If an ELF relocation is pointed at an absolute address, it will have a symbol ID of zero.
RuntimeDyldELF::processRelocationRef was not previously handling this case, and was instead trying to handle it as a section-relative fixup.

I think this is the right fix here, but my elf-fu is poor on some of the more exotic platforms, so I'd appreciate it if anyone with greater knowledge could verify this.

llvm-svn: 188572

ad6d349f

Switching to using a helper function instead of manually converting the string to UTF-8. · b16cf535
Aaron Ballman authored Aug 16, 2013
```
llvm-svn: 188566
```
b16cf535
Removing unused functionality. · d9fd87bd
Aaron Ballman authored Aug 16, 2013
```
llvm-svn: 188565
```
d9fd87bd
InstCombine: Use isAllOnesValue() instead of explicit -1. · d0de8ace
Jim Grosbach authored Aug 16, 2013
```
llvm-svn: 188563
```
d0de8ace

R600/SI: Add pattern for xor of i1 · 8522270d

Michel Danzer authored Aug 16, 2013



Fixes two recent piglit regressions with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 188559

8522270d

R600/SI: Fix broken encoding of DS_WRITE_B32 · 20680b1c

Michel Danzer authored Aug 16, 2013



The logic in SIInsertWaits::getHwCounts() only really made sense for SMRD
instructions, and trying to shoehorn it into handling DS_WRITE_B32 caused
it to corrupt the encoding of that by clobbering the first operand with
the second one.

Undo that damage and only apply the SMRD logic to that.

Fixes some derivates related piglit regressions with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 188558

20680b1c

Reverted test commit (r188556) · 6b32f892
Daniel Sanders authored Aug 16, 2013
```
llvm-svn: 188557
```
6b32f892
Test commit. Just a blank line · 7a2c9bc8
Daniel Sanders authored Aug 16, 2013
```
llvm-svn: 188556
```
7a2c9bc8
R600: Allocate memoperand in the MachienFunction so it doesn't leak. · a8eecee1
Benjamin Kramer authored Aug 16, 2013
```
llvm-svn: 188555
```
a8eecee1
Updating function comments; no functional changes intended. · dcd57573
Aaron Ballman authored Aug 16, 2013
```
llvm-svn: 188554
```
dcd57573

When initializing the PIC global base register on ARM/ELF add pc to fix the address. · 30920666

Benjamin Kramer authored Aug 16, 2013

This unbreaks PIC with fast isel on ELF targets (PR16717). The output matches
what GCC and SDag do for PIC but may not cover all of the many flavors of PIC
that exist.

llvm-svn: 188551

30920666

Add support for Thumb2 literal loads with negative zero offset · 46c1bcb4

Mihai Popa authored Aug 16, 2013

Thumb2 literal loads use an offset encoding which allows for 
negative zero. This fixes parsing and encoding so that #-0 
is correctly processed. The parser represents #-0 as INT32_MIN.

llvm-svn: 188549

46c1bcb4

Fix Thumb2 aliasing complementary instructions taking modified immediates · cf276b2c

Mihai Popa authored Aug 16, 2013

There are many Thumb instructions which take 12-bit immediates encoded in a special
8-byte value + 4-byte rotator form. Not all numbers are represented, and it's legal
to transform an assembly instruction to be able to encode the immediate.

For example: AND and BIC are complementary instructions; one can switch the AND
to a BIC as long as the immediate is complemented. 

The intent is to switch one instruction into its complementary one when the immediate
cannot be encoded in the form requested in the original assembly and when the 
complementary immediate is encodable.

The patch addresses two issues:
1. definition of t2SOImmNot immediate - it has to check that the orignal value is
not encoded naturally
2. t2AND and t2BIC instruction aliases which should use the Thumb2 SOImm operand 
rather than the ARM one.

llvm-svn: 188548

cf276b2c

[SystemZ] Use SRST to implement strlen and strnlen · 0dec06a2
Richard Sandiford authored Aug 16, 2013
```
It would also make sense to use it for memchr; I'm working on that now.

llvm-svn: 188547
```
0dec06a2
[SystemZ] Use MVST to implement strcpy and stpcpy · bb83a50f
Richard Sandiford authored Aug 16, 2013
```
llvm-svn: 188546
```
bb83a50f
[SystemZ] Use CLST to implement strcmp · ca232710
Richard Sandiford authored Aug 16, 2013
```
llvm-svn: 188544
```
ca232710

[SystemZ] Fix handling of 64-bit memcmp results · e3827751

Richard Sandiford authored Aug 16, 2013

Generalize r188163 to cope with return types other than MVT::i32, just
as the existing visitMemCmpCall code did.  I've split this out into a
subroutine so that it can be used for other upcoming patches.

I also noticed that I'd used the wrong API to record the out chain.
It's a load that uses DAG.getRoot() rather than getRoot(), so the out
chain should go on PendingLoads.  I don't have a testcase for that because
we don't do any interesting scheduling on z yet.

llvm-svn: 188540

e3827751

[SystemZ] Fix sign of integer memcmp result · a5901257

Richard Sandiford authored Aug 16, 2013

r188163 used CLC to implement memcmp.  Code that compares the result
directly against zero can test the CC value produced by CLC, but code
that needs an integer result must use IPM.  The sequence I'd used was:

   ipm <reg>
   sll <reg>, 2
   sra <reg>, 30

but I'd forgotten that this inverts the order, so that CC==1 ("less")
becomes an integer greater than zero, and CC==2 ("greater") becomes
an integer less than zero.  This sequence should only be used if the
CLC arguments are reversed to compensate.  The problem then is that
the branch condition must also be reversed when testing the CLC
result directly.

Rather than do that, I went for a different sequence that works with
the natural CLC order:

   ipm <reg>
   srl <reg>, 28
   rll <reg>, <reg>, 31

One advantage of this is that it doesn't clobber CC.  A disadvantage
is that any sign extension to 64 bits must be done separately,
rather than being folded into the shifts.

llvm-svn: 188538

a5901257

This patch implements wait instruction for mips. Examples are added in test files. · 2df9ee6e
Vladimir Medic authored Aug 16, 2013
```
llvm-svn: 188537
```
2df9ee6e