Commits · 09c8a3dde5a184487413b52bb09c53b92de9fbc8 · Roger Ferrer / llvm-epi-0.8

Sep 05, 2012

Remove unused typedefs gcc4.8 warns about. · 09c8a3dd
Roman Divacky authored Sep 05, 2012
```
llvm-svn: 163225
```
09c8a3dd

MCJIT: getPointerToFunction() references target address space. · dc1123fc

Jim Grosbach authored Sep 05, 2012

Make sure to return a pointer into the target memory, not the local memory.
Often they are the same, but we can't assume that.

llvm-svn: 163217

dc1123fc

MCJIT: Add faux remote target execution to lli for the MCJIT. · 0f435d08

Jim Grosbach authored Sep 05, 2012

Simulate a remote target address space by allocating a seperate chunk of
memory for the target and re-mapping section addresses to that prior to
execution. Later we'll want to have a truly remote process, but for now
this gets us closer to being able to test the remote target
functionality outside LLDB.

rdar://12157052

llvm-svn: 163216

0f435d08

Switch BasicAliasAnalysis' cache to SmallDenseMap. · 6c2649ca

Benjamin Kramer authored Sep 05, 2012

It relies on clear() being fast and the cache rarely has more than 1 or 2
elements, so give it an inline capacity and always shrink it back down in case
it grows. DenseMap will grow to 64 buckets which makes clear() a lot slower.

llvm-svn: 163215

6c2649ca

LLVM Bug Fix 13709: Remove needless lsr(Rp, #32) instruction access the · 823f9eba

Pranav Bhandarkar authored Sep 05, 2012

subreg_hireg of register pair Rp.

	* lib/Target/Hexagon/HexagonPeephole.cpp(PeepholeDoubleRegsMap): New
	 DenseMap similar to PeepholeMap that additionally records subreg info
	 too.
        (runOnMachineFunction): Record information in PeepholeDoubleRegsMap
        and copy propagate the high sub-reg of Rp0 in Rp1 = lsr(Rp0, #32) to
	the instruction Rx = COPY Rp1:logreg_subreg.
	* test/CodeGen/Hexagon/remove_lsr.ll: New test.
	

llvm-svn: 163214

823f9eba

[asan] fix lint · 5f5973df
Kostya Serebryany authored Sep 05, 2012
```
llvm-svn: 163205
```
5f5973df

Fixed the DAG combiner to better handle the folding of AND nodes for vector... · 3f40d872

Silviu Baranga authored Sep 05, 2012

Fixed the DAG combiner to better handle the folding of AND nodes for vector types. The previous code was making the assumption that the length of the bitmask returned by isConstantSplat was equal to the size of the vector type. Now we first make sure that the splat value has at least the length of the vector lane type, then we only use as many fields as we have available in the splat value.

llvm-svn: 163203

3f40d872

[asan] extend the blacklist functionality to handle global-init. Patch by Reid Watson · 2fa38f8c
Kostya Serebryany authored Sep 05, 2012
```
llvm-svn: 163199
```
2fa38f8c

Remove some of the patterns added in r163196. Increasing the complexity on... · 81f06df6

Craig Topper authored Sep 05, 2012

Remove some of the patterns added in r163196. Increasing the complexity on insert_subvector into undef accomplishes the same thing.

llvm-svn: 163198

81f06df6

Add patterns for integer forms of VINSERTF128/VINSERTI128 folded with loads.... · f7c87d6e

Craig Topper authored Sep 05, 2012

Add patterns for integer forms of VINSERTF128/VINSERTI128 folded with loads. Also add patterns to turn subvector inserts with loads to index 0 of an undef into VMOVAPS.

llvm-svn: 163196

f7c87d6e

Add a FIXME that assumes we maintain backward compatibility until the next major release. · 5895edaf
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163195
```
5895edaf
Reorder the comments of EmitExceptionTable. · 1b170de7
Logan Chien authored Sep 05, 2012
```
llvm-svn: 163194
```
1b170de7
Fix UseInitArray option for MIPS target. · eeaaf65c
Logan Chien authored Sep 05, 2012
```
llvm-svn: 163193
```
eeaaf65c

Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build... · 2db2353b

Craig Topper authored Sep 05, 2012

Convert vextracti128/vextractf128 intrinsics to extract_subvector at DAG build time. Similar was previously done for vinserti128/vinsertf128. Add patterns for folding these extract_subvectors with stores.

llvm-svn: 163192

2db2353b

Removed Trie.h; unused in a long time · 4a18731f
Marshall Clow authored Sep 05, 2012
```
llvm-svn: 163191
```
4a18731f
Remove redundant semicolons to fix -pedantic-errors build. · 398bd481
Richard Smith authored Sep 05, 2012
```
llvm-svn: 163190
```
398bd481
Fix function name per coding standard. · a05ea0f3
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163187
```
a05ea0f3
Fix function name per coding standard. · ba284b9b
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163186
```
ba284b9b
[ms-inline asm] Add support for the nsdialect keyword in the Bitcode · 18fcdcfb
Chad Rosier authored Sep 05, 2012
```
Reader/Writer.

llvm-svn: 163185
```
18fcdcfb
[ms-inline asm] Add the nsdialect keyword to the lexer. · 9772d82d
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163184
```
9772d82d
[ms-inline asm] Emit the (new) inline asm Non-Standard Dialect attribute. · f42fad62
Chad Rosier authored Sep 05, 2012
```
llvm-svn: 163181
```
f42fad62

Make provenance checking conservative in cases when · df476e5e

Dan Gohman authored Sep 04, 2012

pointers-to-strong-pointers may be in play. These can lead to retains and
releases happening in unstructured ways, foiling the optimizer. This fixes
rdar://12150909.

llvm-svn: 163180

df476e5e

BypassSlowDivision: Assign to reference, don't copy the object. · e535c1a1
Jakub Staszak authored Sep 04, 2012
```
llvm-svn: 163179
```
e535c1a1

Search the whole instruction for tied operands. · ade363e8

Jakob Stoklund Olesen authored Sep 04, 2012

Implicit uses can be dynamically tied to defs. This will soon be used
for predicated instructions on ARM.

llvm-svn: 163177

ade363e8

[ms-inline asm] Add the inline assembly dialect, AsmDialect, to the InlineAsm · 8b3014ea
Chad Rosier authored Sep 04, 2012
```
class.

llvm-svn: 163175
```
8b3014ea

[ms-inline asm] Remove the Inline Asm Non-Standard Dialect attribute. This · 38d24e67

Chad Rosier authored Sep 04, 2012

implementation does not co-exist well with how the sideeffect and alignstack
attributes are handled.  The reverts r161641.

llvm-svn: 163174

38d24e67

[LIT] Add a clang_tools_extra_site_cfg to match the various other site_cfg. · f1a9a565

David Blaikie authored Sep 04, 2012

This doesn't seem ideal, perhaps we could just keep the llvm_site_cfg and have
other config (clang and clang-tools-extra) derive their site_cfg from that.

Suggestions/complaints/ideas welcome.

llvm-svn: 163171

f1a9a565

Sep 04, 2012

Fix my previous patch (r163164). It does now what it is supposed to do: · 85a77875
Jakub Staszak authored Sep 04, 2012
```
Doesn't set MadeChange to TRUE if BypassSlowDivision doesn't change anything.

llvm-svn: 163165
```
85a77875

Return false if BypassSlowDivision doesn't change anything. · 46beca63

Jakub Staszak authored Sep 04, 2012

Also a few minor changes:
- use pre-inc instead of post-inc
- use isa instead of dyn_cast
- 80 col
- trailing spaces

llvm-svn: 163164

46beca63

Remove unneeded code. · ee2b3259
Jakub Staszak authored Sep 04, 2012
```
llvm-svn: 163160
```
ee2b3259
Typo. · d92e2bc2
Jakob Stoklund Olesen authored Sep 04, 2012
```
llvm-svn: 163154
```
d92e2bc2

Actually use the MachineOperand field for isRegTiedToDefOperand(). · 9fceda74

Jakob Stoklund Olesen authored Sep 04, 2012

The MachineOperand::TiedTo field was maintained, but not used.

This patch enables it in isRegTiedToDefOperand() and
isRegTiedToUseOperand() which are the actual functions use by the
register allocator.

llvm-svn: 163153

9fceda74

Move tie checks into MachineVerifier::visitMachineOperand. · c7579cdd
Jakob Stoklund Olesen authored Sep 04, 2012
```
llvm-svn: 163152
```
c7579cdd

Allow tied uses and defs in different orders. · 0a09da83

Jakob Stoklund Olesen authored Sep 04, 2012

After much agonizing, use a full 4 bits of precious MachineOperand space
to encode this. This uses existing padding, and doesn't grow
MachineOperand beyond its current 32 bytes.

This allows tied defs among the first 15 operands on a normal
instruction, just like the current MCInstrDesc constraint encoding.
Inline assembly needs to be able to tie more than the first 15 operands,
and gets special treatment.

Tied uses can appear beyond 15 operands, as long as they are tied to a
def that's in range.

llvm-svn: 163151

0a09da83

Generic Bypass Slow Div · cdf540d5

Preston Gurd authored Sep 04, 2012

- CodeGenPrepare pass for identifying div/rem ops
- Backend specifies the type mapping using addBypassSlowDivType
- Enabled only for Intel Atom with O2 32-bit -> 8-bit
- Replace IDIV with instructions which test its value and use DIVB if the value
is positive and less than 256.
- In the case when the quotient and remainder of a divide are used a DIV
and a REM instruction will be present in the IR. In the non-Atom case
they are both lowered to IDIVs and CSE removes the redundant IDIV instruction,
using the quotient and remainder from the first IDIV. However,
due to this optimization CSE is not able to eliminate redundant
IDIV instructions because they are located in different basic blocks.
This is overcome by calculating both the quotient (DIV) and remainder (REM)
in each basic block that is inserted by the optimization and reusing the result
values when a subsequent DIV or REM instruction uses the same operands.
- Test cases check for the presents of the optimization when calculating
either the quotient, remainder,  or both.

Patch by Tyler Nowicki!

llvm-svn: 163150

cdf540d5

Make sure macros in the include subdirectory are not used without being defined. · d43a50d3

Bob Wilson authored Sep 04, 2012

Rationale: For each preprocessor macro, either the definedness is what's
meaningful, or the value is what's meaningful, or both. If definedness is
meaningful, we should use #ifdef. If the value is meaningful, we should use
and #ifdef interchangeably for the same macro, seems ugly to me, even if
undefined macros are zero if used.

This also has the benefit that including an LLVM header doesn't prevent
you from compiling with -Wundef -Werror.

Patch by John Garvin!
<rdar://problem/12189979>

llvm-svn: 163148

d43a50d3

Porting Hexagon MI Scheduler to the new API. · 4d8986af

Sergei Larin authored Sep 04, 2012

Change current Hexagon MI scheduler to use new converging
scheduler. Integrates DFA resource model into it.

llvm-svn: 163137

4d8986af

Patch to implement UMLAL/SMLAL instructions for the ARM architecture · f00fb1c5

Arnold Schwaighofer authored Sep 04, 2012

This patch corrects the definition of umlal/smlal instructions and adds support
for matching them to the ARM dag combiner.

Bug 12213

Patch by Yin Ma!

llvm-svn: 163136

f00fb1c5

This patch optimizes shuffle instruction - generates 2 instructions instead of 4. · cbe99bbb

Elena Demikhovsky authored Sep 04, 2012

Since this specific shuffle is widely used in many workloads we have ~10% performance on them.

shufflevector <8 x float> %A, <8 x float> %B, <8 x i32> <i32 0, i32 8, i32 2, i32 10, i32 4, i32 12, i32 6, i32 14>

vmovaps (%rdx), %ymm0
vshufps $8, %ymm0, %ymm0, %ymm0
vmovaps (%rcx), %ymm1
vshufps $8, %ymm0, %ymm1, %ymm1
vunpcklps       %ymm0, %ymm1, %ymm0

vmovaps (%rcx), %ymm0
vmovsldup       (%rdx), %ymm1
vblendps        $85, %ymm0, %ymm1, %ymm0

llvm-svn: 163134

cbe99bbb

LICM may hoist an instruction with undefined behavior above a trap. · 03dcd85b

Nadav Rotem authored Sep 04, 2012

Scan the body of the loop and find instructions that may trap.
Use this information when deciding if it is safe to hoist or sink instructions.
Notice that we can optimize the search of instructions that may throw in the case of nested loops.

rdar://11518836

llvm-svn: 163132

03dcd85b