Commits · bd5bd89e7706a942e5157d890ee69f8ff3f7bbf2 · Roger Ferrer / llvm-epi-0.8

Mar 17, 2013

Build LLVMgold.so on FreeBSD using cmake. · bd5bd89e
Rafael Espindola authored Mar 17, 2013
```
Patch by Stephen Checkoway.

llvm-svn: 177233
```
bd5bd89e

The promised test case for r175939. · 97821831

Michael Gottesman authored Mar 17, 2013

This test makes sure that the ObjCARC escape analysis looks at the uses of
instructions which copy the block pointer value by checking all four cases where
that can occur.

llvm-svn: 177232

97821831

Improve PPC VR (Altivec) register spilling · fcc51d4f

Hal Finkel authored Mar 17, 2013

This change cleans up two issues with Altivec register spilling:

  1. The spilling code was inefficient (using two instructions, and add and a
     load, when just one would do)

  2. The code assumed that r0 would always be available (true for now, but this
     will change)

The new code handles VR spilling just like GPR spills but forced into r+r mode.
As a result, when any VR spills are present, we must now always allocate the
register-scavenger spill slot.

llvm-svn: 177231

fcc51d4f

Remove FIXMEs in PPC test cases related to unaligned loads/stores · 57080382
Hal Finkel authored Mar 16, 2013
```
As pointed out by Bill in response to r177160, these two FIXMEs
can also be removed.

llvm-svn: 177229
```
57080382

Mar 16, 2013

Remove PPC avoidWriteAfterWrite callback · 8b047039

Hal Finkel authored Mar 16, 2013

As a follow-up to r158719, remove PPCRegisterInfo::avoidWriteAfterWrite.
Jakob pointed out in response to r158719 that this callback is currently unused
and so this has no effect (and the speedups that I thought that I had observed
as a result of implementing this function must have been noise).

llvm-svn: 177228

8b047039

Change the default latency for implicit defs. · 6057017c

Andrew Trick authored Mar 16, 2013

Implicit defs are not currently positional and not modeled by the
per-operand machine model. Unfortunately, we treat defs that are part
of the architectural instruction description, like flags, the same as
other implicit defs. Really, they should have a fixed MachineInstr
layout and probably shouldn't be "implicit" at all.

For now, we'll change the default latency to be the max operand
latency. That will give flag setting operands full latency for x86
folded loads. Other kinds of "fake" implicit defs don't occur prior to
regalloc anyway, and we would like them to go away postRegAlloc as
well.

llvm-svn: 177227

6057017c

Machine model. Allow mixed itinerary classes and SchedRW lists. · bf8a28dc

Andrew Trick authored Mar 16, 2013

We always supported a mixture of the old itinerary model and new
per-operand model, but it required a level of indirection to map
itinerary classes to SchedRW lists. This was done for ARM A9.

Now we want to define x86 SchedRW lists, with the goal of removing its
itinerary classes, but still support the itineraries in the mean
time. When I original developed the model, Atom did not have
itineraries, so there was no reason to expect this requirement.

llvm-svn: 177226

bf8a28dc

[docs] Discuss a potential bug to be aware of. · ca11d2c7
Sean Silva authored Mar 16, 2013
```
llvm-svn: 177224
```
ca11d2c7
Test case for graceful handling of long file names on Windows. Patch thanks to Paul Robinson! · fcdf9a82
Aaron Ballman authored Mar 16, 2013
```
llvm-svn: 177223
```
fcdf9a82
Add X86 code emitter support AVX encoded MRMDestReg instructions. · 612f7bfa
Craig Topper authored Mar 16, 2013
```
Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao.

llvm-svn: 177221
```
612f7bfa

Define more SchedWrites for annotating X86 instructions. · 63bff2eb

Jakob Stoklund Olesen authored Mar 16, 2013

Since almost all X86 instructions can fold loads, use a multiclass to
define register/memory pairs of SchedWrites.

An X86FoldableSchedWrite represents the register version of an
instruction. It holds a reference to the SchedWrite to use when the
instruction folds a load.

This will be used inside multiclasses that define rr and rm instruction
versions together.

llvm-svn: 177210

63bff2eb

Mar 15, 2013

Add SchedRW as an Instruction field. · a4a361df

Jakob Stoklund Olesen authored Mar 15, 2013

Don't require instructions to inherit Sched<...>. Sometimes it is more
convenient to say:

  let SchedRW = ... in {
    ...
  }

Which is now possible.

llvm-svn: 177199

a4a361df

[ADT] Fix StringSet::insert() to not allocate on every lookup. · 3145eb8e

Daniel Dunbar authored Mar 15, 2013

 - The previous implementation always constructed the StringMap entry, even if
   the key was present in the set.

llvm-svn: 177178

3145eb8e

[Support][Path][Windows] Fix dangling else. Don't call CloseHandle when CloseFD is false. · d932d411
Michael J. Spencer authored Mar 15, 2013
```
llvm-svn: 177175
```
d932d411

ARM cost model: Fix costs for some vector selects · 9d7a3827

Arnold Schwaighofer authored Mar 15, 2013

I was too pessimistic in r177105. Vector selects that fit into a legal register
type lower just fine. I was mislead by the code fragment that I was using. The
stores/loads that I saw in those cases came from lowering the conditional off
an address.

Changing the code fragment to:

%T0_3 = type <8 x i18>
%T1_3 = type <8 x i1>

define void @func_blend3(%T0_3* %loadaddr, %T0_3* %loadaddr2,
                         %T1_3* %blend, %T0_3* %storeaddr) {
  %v0 = load %T0_3* %loadaddr
  %v1 = load %T0_3* %loadaddr2
==> FROM:
  ;%c = load %T1_3* %blend
==> TO:
  %c = icmp slt %T0_3 %v0, %v1
==> USE:
  %r = select %T1_3 %c, %T0_3 %v0, %T0_3 %v1

  store %T0_3 %r, %T0_3* %storeaddr
  ret void
}

revealed this mistake.

radar://13403975

llvm-svn: 177170

9d7a3827

Adding an A15 specific optimization pass for interactions between S/D/Q... · 82dd6ac3

Silviu Baranga authored Mar 15, 2013

Adding an A15 specific optimization pass for interactions between S/D/Q registers. The pass handles all the required transformations pre-regalloc.

llvm-svn: 177169

82dd6ac3

ARM: Fix an old refacto. · 2f545714
Benjamin Kramer authored Mar 15, 2013
```
Fixes PR15520.

llvm-svn: 177167
```
2f545714

Enable unaligned memory access on PPC for scalar types · 8d7fbc9d

Hal Finkel authored Mar 15, 2013

Unaligned access is supported on PPC for non-vector types, and is generally
more efficient than manually expanding the loads and stores.

A few of the existing test cases were using expanded unaligned loads and stores
to test other features (like load/store with update), and for these test cases,
unaligned access remains disabled.

llvm-svn: 177160

8d7fbc9d

ARM cost model: Fix cost of fptrunc and fpext instructions · f5284ff6
Arnold Schwaighofer authored Mar 15, 2013
```
A vector fptrunc and fpext simply gets split into scalar instructions.

radar://13192358

llvm-svn: 177159
```
f5284ff6

Protect PPC Altivec patterns with a predicate · b0fac429

Hal Finkel authored Mar 15, 2013

In preparation for the addition of other SIMD ISA extensions (such as QPX) we
need to make sure that all Altivec patterns are properly predicated on having
Altivec support.

No functionality change intended (one test case needed to be updated b/c it
assumed that Altivec intrinsics would be supported without enabling Altivec
support).

llvm-svn: 177152

b0fac429

Fixup for r176933: more careful setup of path to llvm-symbolizer · cd27b98d
Alexey Samsonov authored Mar 15, 2013
```
llvm-svn: 177144
```
cd27b98d

Use NumBaseBits in a few more places in SmallBitVector instead of... · f6f549ce

Craig Topper authored Mar 15, 2013

Use NumBaseBits in a few more places in SmallBitVector instead of recalculating it. No functional change.

llvm-svn: 177142

f6f549ce

Fix the FDE encoding to be relative on ELF. · ef9d3494

Rafael Espindola authored Mar 15, 2013

This is a very late complement to r130637 which fixed this on x86_64. Fixes
pr15448.

Since it looks like that every elf architecture uses this encoding when using
cfi, make it the default for elf. Just exclude mips64el. It has a lovely
.ll -> .o test (ef_frame.ll) that tests that nothing changes in the binary
content of the .eh_frame produced by llc. Oblige it.

llvm-svn: 177141

ef9d3494

Allocate the RS spill slot for any PPC function with spills and a large stack frame · bb420f10

Hal Finkel authored Mar 15, 2013

For spills into a large stack frame, the FI-elimination code uses the register
scavenger to obtain a free GPR for use with an r+r-addressed load or store.
When there are no available GPRs, the scavenger gets one by using its spill
slot. Previously, we were not always allocating that spill slot and the RS
would assert when the spill slot was needed.

I don't currently have a small test that triggered the assert, but I've
created a small regression test that verifies that the spill slot is now
added when the stack frame is sufficiently large.

llvm-svn: 177140

bb420f10

Turn anonymous type in anonymous union warning back on after cleaning up · 31f4354c
Eric Christopher authored Mar 15, 2013
```
issues.

llvm-svn: 177136
```
31f4354c
Silence anonymous type in anonymous union warnings. · 8996c5d4
Eric Christopher authored Mar 15, 2013
```
llvm-svn: 177135
```
8996c5d4
Add a triple to the test. · 4a4827ce
Nadav Rotem authored Mar 15, 2013
```
llvm-svn: 177131
```
4a4827ce
Unaligned loads should use the VMOVUPS opcode. · adfa5eaf
Nadav Rotem authored Mar 14, 2013
```
llvm-svn: 177130
```
adfa5eaf
Remove some unused variables to clean the Clang -Werror build · 6e5e0316
David Blaikie authored Mar 14, 2013
```
(these were added in r177089)

llvm-svn: 177129
```
6e5e0316
[mips] Set isAllocatable bit of unallocatable register classes to 0. · b83b2eda
Akira Hatanaka authored Mar 14, 2013
```
llvm-svn: 177128
```
b83b2eda

Mar 14, 2013

Fix r177112: Add ProcResGroup. · a5c747b0

Andrew Trick authored Mar 14, 2013

This is the other half of r177122 that I meant to commit at the same time.

llvm-svn: 177123

a5c747b0

Prepare for adding InstrSchedModel annotations to X86 instructions. · 71236682

Jakob Stoklund Olesen authored Mar 14, 2013

The new InstrSchedModel is easier to use than the instruction
itineraries. It will be used to model instruction latency and throughput
in modern Intel microarchitectures like Sandy Bridge.

InstrSchedModel should be able to coexist with instruction itinerary
classes, but for cleanliness we should switch the Atom processor model
to the new InstrSchedModel as well.

llvm-svn: 177122

71236682

Add a new method which enables one to change register classes. · fafaa9d9

Reed Kotler authored Mar 14, 2013

See the Mips16ISetLowering.cpp patch to see a use of this.
For now now the extra code in Mips16ISetLowering.cpp is a nop but is
used for test purposes. Mips32 registers are setup and then removed and
then the Mips16 registers are setup. 

Normally you need to add register classes and then call
computeRegisterProperties.

llvm-svn: 177120

fafaa9d9

LoopVectorizer: Insert some white space to make test case more readable · 9b55e31b
Arnold Schwaighofer authored Mar 14, 2013
```
Also remove some unneeded function attributes.

llvm-svn: 177114
```
9b55e31b
[fast-isel] The X86FastISel::FastLowerArguments function doesn't properly handle · 4b54f594
Chad Rosier authored Mar 14, 2013
```
the win64 calling convention.
rdar://13423768

llvm-svn: 177113
```
4b54f594

MachineModel: Add a ProcResGroup class. · 4e67cba8

Andrew Trick authored Mar 14, 2013

This allows abitrary groups of processor resources. Using something in
a subset automatically counts againts the superset. Currently, this
only works if the superset is also a ProcResGroup as opposed to a
SuperUnit.

This allows SandyBridge to be expressed naturally, which will be
checked in shortly.

def SBPort01 : ProcResGroup<[SBPort0, SBPort1]>;
def SBPort15 : ProcResGroup<[SBPort1, SBPort5]>;
def SBPort23  : ProcResGroup<[SBPort2, SBPort3]>;
def SBPort015 : ProcResGroup<[SBPort0, SBPort1, SBPort5]>;

llvm-svn: 177112

4e67cba8

Move estimateStackSize from ARM into MachineFrameInfo · 628ba128

Hal Finkel authored Mar 14, 2013

This is a generic function (derived from PEI); moving it into
MachineFrameInfo eliminates a current redundancy between the ARM and AArch64
backends, and will allow it to be used by the PowerPC target code.

No functionality change intended.

llvm-svn: 177111

628ba128

Provide the register scavenger to processFunctionBeforeFrameFinalized · 5a765fdd

Hal Finkel authored Mar 14, 2013

Add the current PEI register scavenger as a parameter to the
processFunctionBeforeFrameFinalized callback.

This change is necessary in order to allow the PowerPC target code to
set the register scavenger frame index after the save-area offset
adjustments performed by processFunctionBeforeFrameFinalized. Only
after these adjustments have been made is it possible to estimate
the size of the stack frame.

llvm-svn: 177108

5a765fdd

Use frame-index scavenging for PPC register spilling · ad26f4de

Hal Finkel authored Mar 14, 2013

Make requiresFrameIndexScavenging return true, and create virtual registers in
the spilling code instead of using the register scavenger directly. This makes
the target-level code simpler, and importantly, delays the scavenging until
after callee-saved register processing (which will be important for later
changes).

Also cleans up trackLivenessAfterRegAlloc (makes it inline in the header with
the other related functions). This makes it clear that it always returns true.

No functionality change intended.

llvm-svn: 177107

ad26f4de

Not all PPC functions with a frame pointer need a RS spill slot · e987a311

Hal Finkel authored Mar 14, 2013

We used to add a spill slot for the register scavenger whenever the function
has a frame pointer. This is unnecessarily conservative: We may need the spill
slot for dynamic stack allocations, and functions with dynamic stack
allocations always have a FP, but we might also have a FP for other reasons
(such as the user explicitly disabling frame-pointer elimination), and we don't
necessarily need a spill slot for those functions.

The structsinregs test needed adjustment because it disables FP elimination.

llvm-svn: 177106

e987a311