Commits · 49374087f5cfdbdd47e398db455094421fab38ad · Roger Ferrer / llvm-epi-0.8

Mar 18, 2013

R600/SI: implement SI.load.const intrinsic · 49374087

Christian Konig authored Mar 18, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 177273

49374087

R600/SI: enable all S_LOAD and S_BUFFER_LOAD opcodes · 9c7afd11

Christian Konig authored Mar 18, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 177272

9c7afd11

R600/SI: fix inserting waits for all defines · f1fd5fad

Christian Konig authored Mar 18, 2013



Unfortunately the previous fix for inserting waits for unordered
defines wasn't sufficient, cause it's possible that even ordered
defines are only partially used (or not used at all).

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 177271

f1fd5fad

[asan] when creating string constants, set unnamed_attr and align 1 so that... · 10cc12f2

Kostya Serebryany authored Mar 18, 2013

[asan] when creating string constants, set unnamed_attr and align 1 so that equal strings are merged by the linker. Observed up to 1% binary size reduction. Thanks to Anton Korobeynikov for the suggestion

llvm-svn: 177264

10cc12f2

Mark internal classes as POD-like to get better behavior out of · f74654d2
Chandler Carruth authored Mar 18, 2013
```
SmallVector and DenseMap.

This speeds up SROA by 25% on PR15412.

llvm-svn: 177259
```
f74654d2

TLS support for MinGW targets. · 3e7005f1

Anton Korobeynikov authored Mar 18, 2013

MinGW is almost completely compatible to MSVC, with the exception of the _tls_array global not being available.

Patch by David Nadlinger!

llvm-svn: 177257

3e7005f1

Windows TLS: Section name prefix to ensure correct order · 2810a0ab

Anton Korobeynikov authored Mar 18, 2013

The linker sorts the .tls$<xyz> sections by name, and we need
to make sure any extra sections we produce (e.g. for weak globals) 
always end up between .tls$AAA and .tls$ZZZ, even if the name 
starts with e.g. an underscore.

Patch by David Nadlinger!

llvm-svn: 177256

2810a0ab

[asan] while generating the description of a global variable, emit the module... · bd016bb6

Kostya Serebryany authored Mar 18, 2013

[asan] while generating the description of a global variable, emit the module name in a separate field, thus not duplicating this information if every description. This decreases the binary size (observed up to 3%). https://code.google.com/p/address-sanitizer/issues/detail?id=168 . This changes the asan API version. llvm-part

llvm-svn: 177254

bd016bb6

[asan] don't instrument functions with available_externally linkage. This... · 6b5b58de

Kostya Serebryany authored Mar 18, 2013

[asan] don't instrument functions with available_externally linkage. This saves a bit of compile time and reduces the number of redundant global strings generated by asan (https://code.google.com/p/address-sanitizer/issues/detail?id=167) 

llvm-svn: 177250

6b5b58de

Extract a method. · 57a86508

Jakob Stoklund Olesen authored Mar 18, 2013

This computes the type of an instruction operand or result based on the
records in the instruction's ins and outs lists.

llvm-svn: 177244

57a86508

Post process ADC/SBB and use a shorter encoding if they use a sign extended immediate. · 0498b88d
Craig Topper authored Mar 18, 2013
```
llvm-svn: 177243
```
0498b88d
Refactor some duplicated code into helper functions. · 7e9a1cb1
Craig Topper authored Mar 18, 2013
```
llvm-svn: 177242
```
7e9a1cb1

Mar 17, 2013

Fix the build broken in r177239 · 5f78b37a

David Blaikie authored Mar 17, 2013

Seems some accidental C++11 crept in there. Reported by the C++98 buildbots.

llvm-svn: 177241

5f78b37a

Reduced dont-infinite-loop-during-block-escape-analysis.ll with bugpoint and... · a8b60a4f

Michael Gottesman authored Mar 17, 2013

Reduced dont-infinite-loop-during-block-escape-analysis.ll with bugpoint and moved it to retain-block-escape-analysis.ll.

*NOTE* I verified that the original bug behind
dont-infinite-loop-during-block-escape-analysis.ll occurs when using opt on
retain-block-escape-analysis.ll.

llvm-svn: 177240

a8b60a4f

Split out filename & directory from DIFile to start generalizing over DIScopes · 8fb82245

David Blaikie authored Mar 17, 2013

This is the first step to making all DIScopes have a common metadata prefix (so
that things (using directives, for example) that can appear in any scope can be
added to that common prefix). DIFile is itself a DIScope so the common prefix
of all DIScopes cannot be a DIFile - instead it's the raw filename/directory
name pair.

llvm-svn: 177239

8fb82245

Generalize debug info test to be resilient to changes in metadata node numbering · 2e488d1f
David Blaikie authored Mar 17, 2013
```
llvm-svn: 177238
```
2e488d1f
Improve DIFile debug info annotation by letting it fallback to DIScope · 08fb5457
David Blaikie authored Mar 17, 2013
```
llvm-svn: 177236
```
08fb5457
Use ArrayRef<MVT::SimpleValueType> when possible. · 13d4a07f
Jakob Stoklund Olesen authored Mar 17, 2013
```
Not passing vector references around makes it possible to use
SmallVector in most places.

llvm-svn: 177235
```
13d4a07f
To avoid symbol clash, undefine PPC here. PPC may be predefined on some hosts. · 37ef20d3
Sylvestre Ledru authored Mar 17, 2013
```
llvm-svn: 177234
```
37ef20d3
Build LLVMgold.so on FreeBSD using cmake. · bd5bd89e
Rafael Espindola authored Mar 17, 2013
```
Patch by Stephen Checkoway.

llvm-svn: 177233
```
bd5bd89e

The promised test case for r175939. · 97821831

Michael Gottesman authored Mar 17, 2013

This test makes sure that the ObjCARC escape analysis looks at the uses of
instructions which copy the block pointer value by checking all four cases where
that can occur.

llvm-svn: 177232

97821831

Improve PPC VR (Altivec) register spilling · fcc51d4f

Hal Finkel authored Mar 17, 2013

This change cleans up two issues with Altivec register spilling:

  1. The spilling code was inefficient (using two instructions, and add and a
     load, when just one would do)

  2. The code assumed that r0 would always be available (true for now, but this
     will change)

The new code handles VR spilling just like GPR spills but forced into r+r mode.
As a result, when any VR spills are present, we must now always allocate the
register-scavenger spill slot.

llvm-svn: 177231

fcc51d4f

Remove FIXMEs in PPC test cases related to unaligned loads/stores · 57080382
Hal Finkel authored Mar 16, 2013
```
As pointed out by Bill in response to r177160, these two FIXMEs
can also be removed.

llvm-svn: 177229
```
57080382

Mar 16, 2013

Remove PPC avoidWriteAfterWrite callback · 8b047039

Hal Finkel authored Mar 16, 2013

As a follow-up to r158719, remove PPCRegisterInfo::avoidWriteAfterWrite.
Jakob pointed out in response to r158719 that this callback is currently unused
and so this has no effect (and the speedups that I thought that I had observed
as a result of implementing this function must have been noise).

llvm-svn: 177228

8b047039

Change the default latency for implicit defs. · 6057017c

Andrew Trick authored Mar 16, 2013

Implicit defs are not currently positional and not modeled by the
per-operand machine model. Unfortunately, we treat defs that are part
of the architectural instruction description, like flags, the same as
other implicit defs. Really, they should have a fixed MachineInstr
layout and probably shouldn't be "implicit" at all.

For now, we'll change the default latency to be the max operand
latency. That will give flag setting operands full latency for x86
folded loads. Other kinds of "fake" implicit defs don't occur prior to
regalloc anyway, and we would like them to go away postRegAlloc as
well.

llvm-svn: 177227

6057017c

Machine model. Allow mixed itinerary classes and SchedRW lists. · bf8a28dc

Andrew Trick authored Mar 16, 2013

We always supported a mixture of the old itinerary model and new
per-operand model, but it required a level of indirection to map
itinerary classes to SchedRW lists. This was done for ARM A9.

Now we want to define x86 SchedRW lists, with the goal of removing its
itinerary classes, but still support the itineraries in the mean
time. When I original developed the model, Atom did not have
itineraries, so there was no reason to expect this requirement.

llvm-svn: 177226

bf8a28dc

[docs] Discuss a potential bug to be aware of. · ca11d2c7
Sean Silva authored Mar 16, 2013
```
llvm-svn: 177224
```
ca11d2c7
Test case for graceful handling of long file names on Windows. Patch thanks to Paul Robinson! · fcdf9a82
Aaron Ballman authored Mar 16, 2013
```
llvm-svn: 177223
```
fcdf9a82
Add X86 code emitter support AVX encoded MRMDestReg instructions. · 612f7bfa
Craig Topper authored Mar 16, 2013
```
Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao.

llvm-svn: 177221
```
612f7bfa

Define more SchedWrites for annotating X86 instructions. · 63bff2eb

Jakob Stoklund Olesen authored Mar 16, 2013

Since almost all X86 instructions can fold loads, use a multiclass to
define register/memory pairs of SchedWrites.

An X86FoldableSchedWrite represents the register version of an
instruction. It holds a reference to the SchedWrite to use when the
instruction folds a load.

This will be used inside multiclasses that define rr and rm instruction
versions together.

llvm-svn: 177210

63bff2eb

Mar 15, 2013

Add SchedRW as an Instruction field. · a4a361df

Jakob Stoklund Olesen authored Mar 15, 2013

Don't require instructions to inherit Sched<...>. Sometimes it is more
convenient to say:

  let SchedRW = ... in {
    ...
  }

Which is now possible.

llvm-svn: 177199

a4a361df

[ADT] Fix StringSet::insert() to not allocate on every lookup. · 3145eb8e

Daniel Dunbar authored Mar 15, 2013

 - The previous implementation always constructed the StringMap entry, even if
   the key was present in the set.

llvm-svn: 177178

3145eb8e

[Support][Path][Windows] Fix dangling else. Don't call CloseHandle when CloseFD is false. · d932d411
Michael J. Spencer authored Mar 15, 2013
```
llvm-svn: 177175
```
d932d411

ARM cost model: Fix costs for some vector selects · 9d7a3827

Arnold Schwaighofer authored Mar 15, 2013

I was too pessimistic in r177105. Vector selects that fit into a legal register
type lower just fine. I was mislead by the code fragment that I was using. The
stores/loads that I saw in those cases came from lowering the conditional off
an address.

Changing the code fragment to:

%T0_3 = type <8 x i18>
%T1_3 = type <8 x i1>

define void @func_blend3(%T0_3* %loadaddr, %T0_3* %loadaddr2,
                         %T1_3* %blend, %T0_3* %storeaddr) {
  %v0 = load %T0_3* %loadaddr
  %v1 = load %T0_3* %loadaddr2
==> FROM:
  ;%c = load %T1_3* %blend
==> TO:
  %c = icmp slt %T0_3 %v0, %v1
==> USE:
  %r = select %T1_3 %c, %T0_3 %v0, %T0_3 %v1

  store %T0_3 %r, %T0_3* %storeaddr
  ret void
}

revealed this mistake.

radar://13403975

llvm-svn: 177170

9d7a3827

Adding an A15 specific optimization pass for interactions between S/D/Q... · 82dd6ac3

Silviu Baranga authored Mar 15, 2013

Adding an A15 specific optimization pass for interactions between S/D/Q registers. The pass handles all the required transformations pre-regalloc.

llvm-svn: 177169

82dd6ac3

ARM: Fix an old refacto. · 2f545714
Benjamin Kramer authored Mar 15, 2013
```
Fixes PR15520.

llvm-svn: 177167
```
2f545714

Enable unaligned memory access on PPC for scalar types · 8d7fbc9d

Hal Finkel authored Mar 15, 2013

Unaligned access is supported on PPC for non-vector types, and is generally
more efficient than manually expanding the loads and stores.

A few of the existing test cases were using expanded unaligned loads and stores
to test other features (like load/store with update), and for these test cases,
unaligned access remains disabled.

llvm-svn: 177160

8d7fbc9d

ARM cost model: Fix cost of fptrunc and fpext instructions · f5284ff6
Arnold Schwaighofer authored Mar 15, 2013
```
A vector fptrunc and fpext simply gets split into scalar instructions.

radar://13192358

llvm-svn: 177159
```
f5284ff6

Protect PPC Altivec patterns with a predicate · b0fac429

Hal Finkel authored Mar 15, 2013

In preparation for the addition of other SIMD ISA extensions (such as QPX) we
need to make sure that all Altivec patterns are properly predicated on having
Altivec support.

No functionality change intended (one test case needed to be updated b/c it
assumed that Altivec intrinsics would be supported without enabling Altivec
support).

llvm-svn: 177152

b0fac429

Fixup for r176933: more careful setup of path to llvm-symbolizer · cd27b98d
Alexey Samsonov authored Mar 15, 2013
```
llvm-svn: 177144
```
cd27b98d