Commits · 2a592dcc466dfba0a87c2b96c17c3f57c5dea8d8 · Roger Ferrer / llvm-epi-0.8

Apr 11, 2012

llvm-stress: don't make vectors of x86_mmx type · 2a592dcc

Dylan Noblesmith authored Apr 10, 2012

LangRef.html says:
"There are no arrays, vectors or constants of this type."

This was hitting assertions when passing the -generate-x86-mmx
option.

PR12452.

llvm-svn: 154445

2a592dcc

[tsan] two more compile-time optimizations: · 5ba61ac6

Kostya Serebryany authored Apr 10, 2012

- don't isntrument reads from constant globals.
Saves ~1.5% of instrumented instructions on CPU2006
(counting static instructions, not their execution).
- don't insrument reads from vtable (which is a global constant too).
Saves ~5%.

I did not measure the run-time impact of this,
but it is certainly non-negative.

llvm-svn: 154444

5ba61ac6

Apr 10, 2012

Handle llvm.fma.* intrinsics. rdar://10914096 · d0007f3c
Evan Cheng authored Apr 10, 2012
```
llvm-svn: 154439
```
d0007f3c
Add a comment noting that the fdiv -> fmul conversion won't generate · 4f53074c
Duncan Sands authored Apr 10, 2012
```
multiplication by a denormal, and some tests checking that.

llvm-svn: 154431
```
4f53074c

The MDString class stored a StringRef to the string which was already in a · c4c568b2

Bill Wendling authored Apr 10, 2012

StringMap. This was redundant and unnecessarily bloated the MDString class.

Because the MDString class is a "Value" and will never have a "name", and
because the Name field in the Value class is a pointer to a StringMap entry, we
repurpose the Name field for an MDString. It stores the StringMap entry in the
Name field, and uses the normal methods to get the string (name) back.

PR12474

llvm-svn: 154429

c4c568b2

Whitespace. · f7345b02
Chad Rosier authored Apr 10, 2012
```
llvm-svn: 154427
```
f7345b02
Revert r154396, which looks to be the real culprit behind the bot failures. · 235a7a17
Chad Rosier authored Apr 10, 2012
```
llvm-svn: 154426
```
235a7a17
Temporarily revert this patch to see if it brings the buildbots back. · 65ada95b
Eric Christopher authored Apr 10, 2012
```
llvm-svn: 154425
```
65ada95b

[tsan] compile-time instrumentation: do not instrument a read if · bf2de80b

Kostya Serebryany authored Apr 10, 2012

a write to the same temp follows in the same BB.
Also add stats printing.

On Spec CPU2006 this optimization saves roughly 4% of instrumented reads
(which is 3% of all instrumented accesses):
Writes            : 161216
Reads             : 446458
Reads-before-write: 18295

llvm-svn: 154418

bf2de80b

To ensure that we have more accurate line information for a block · e9abba71

Eric Christopher authored Apr 10, 2012

don't elide the branch instruction if it's the only one in the block,
otherwise it's ok.

PR9796 and rdar://11215207

llvm-svn: 154417

e9abba71

Revert r154397, which was causing make check failures on the buildbots. · 3efc8f22
Owen Anderson authored Apr 10, 2012
```
llvm-svn: 154414
```
3efc8f22

ARM fix cc_out operand handling for t2SUBrr instructions. · df5a2447

Jim Grosbach authored Apr 10, 2012

We were incorrectly conflating some add variants which don't have a
cc_out operand with the mirroring sub encodings, which do. Part of the
awesome non-orthogonality legacy of thumb1. Similarly, handling of
add/sub of an immediate was sometimes incorrectly removing the cc_out
operand for add/sub register variants.

rdar://11216577

llvm-svn: 154411

df5a2447

Remove unused variable. · 27351366
David Blaikie authored Apr 10, 2012
```
llvm-svn: 154398
```
27351366
Fix a dagcombine optimization which assumes that the vsetcc result type is always · 065564d8
Nadav Rotem authored Apr 10, 2012
```
of the same size as the compared values. This is ture for SSE/AVX/NEON but not
for all targets.

llvm-svn: 154397
```
065564d8

Modify the code that lowers shuffles to blends from using blendvXX to vblendXX. · f934f917

Nadav Rotem authored Apr 10, 2012

blendv uses a register for the selection while vblend uses an immediate.
On sandybridge they still have the same latency and execute on the same execution ports.

llvm-svn: 154396

f934f917

Make a somewhat subtle change in the logic of block placement. Sometimes · 68062617

Chandler Carruth authored Apr 10, 2012

the loop header has a non-loop predecessor which has been pre-fused into
its chain due to unanalyzable branches. In this case, rotating the
header into the body of the loop in order to place a loop exit at the
bottom of the loop is a Very Bad Idea as it makes the loop
non-contiguous.

I'm working on a good test case for this, but it's a bit annoynig to
craft. I should get one shortly, but I'm submitting this now so I can
begin the (lengthy) performance analysis process. An initial run of LNT
looks really, really good, but there is too much noise there for me to
trust it much.

llvm-svn: 154395

68062617

Transform div to mul with reciprocal only when fp imm is legal. · 4d1220de
Anton Korobeynikov authored Apr 10, 2012
```
This fixes PR12516 and uncovers one weird problem in legalize (workarounded)

llvm-svn: 154394
```
4d1220de
Use the correct section types on Solaris for unwind data on both x86 and x86-64. · bbec8720
David Chisnall authored Apr 10, 2012
```
Patch by Dmitri Shubin!

llvm-svn: 154391
```
bbec8720
Express the number of ULPs in fpaccuracy metadata as a real rather than a · af06b26c
Duncan Sands authored Apr 10, 2012
```
rational number, eg as 2.5 rather than 5, 2.  OK'd by Peter Collingbourne.

llvm-svn: 154387
```
af06b26c

Fix 12513: Loop unrolling breaks with indirect branches. · 4442bfe5

Andrew Trick authored Apr 10, 2012

Take this opportunity to generalize the indirectbr bailout logic for
loop transformations. CFG transformations will never get indirectbr
right, and there's no point trying.

llvm-svn: 154386

4442bfe5

whitespace · 4104ed9c
Andrew Trick authored Apr 10, 2012
```
llvm-svn: 154385
```
4104ed9c
Fix for register pressure tables. · 7d52db98
Andrew Trick authored Apr 10, 2012
```
Recent refactoring introduced a bug. Fix: added buildRegUnitSets.

llvm-svn: 154382
```
7d52db98
Add proper checks. · 07526249
Evan Cheng authored Apr 10, 2012
```
llvm-svn: 154379
```
07526249
Make the code slightly more palatable. · 136861d9
Evan Cheng authored Apr 10, 2012
```
llvm-svn: 154378
```
136861d9
Use std::includes instead of my own implementation. · 9002c315
Andrew Trick authored Apr 10, 2012
```
Jakob's review.

llvm-svn: 154377
```
9002c315
Added a TargetRegisterInfo interface for accessing register pressure sets. · 31f64875
Andrew Trick authored Apr 10, 2012
```
llvm-svn: 154375
```
31f64875

Added register unit sets to the target description. · 739a0038

Andrew Trick authored Apr 10, 2012

This is a new algorithm that finds sets of register units that can be
used to model registers pressure. This handles arbitrary, overlapping
register classes. Each register class is associated with a (small)
list of pressure sets. These are the dimensions of pressure affected
by the register class's liveness.

llvm-svn: 154374

739a0038

Added register unit weights to the target description. · 1d7a2c57

Andrew Trick authored Apr 10, 2012

This is a new algorithm that associates registers with weighted
register units to accuretely model their effect on register
pressure. This handles registers with multiple overlapping
subregisters. It is possible, but almost inconceivable that the
algorithm fails to find an exact solution for a target description. If
an exact solution cannot be found, an inexact, but reasonable solution
will be chosen.

llvm-svn: 154373

1d7a2c57

Fix header comment · 3a6e88dc
Andrew Trick authored Apr 10, 2012
```
llvm-svn: 154372
```
3a6e88dc
Add a constructor for DataRefImpl and remove excess initialization. · 549515e1
Danil Malyshev authored Apr 10, 2012
```
llvm-svn: 154371
```
549515e1

Fix a long standing tail call optimization bug. When a libcall is emitted · f8bad080

Evan Cheng authored Apr 10, 2012

legalizer always use the DAG entry node. This is wrong when the libcall is
emitted as a tail call since it effectively folds the return node. If
the return node's input chain is not the entry (i.e. call, load, or store)
use that as the tail call input chain.

PR12419
rdar://9770785
rdar://11195178

llvm-svn: 154370

f8bad080

Don't try to zExt just to check if an integer constant is zero, it might · 1d9672bd
Rafael Espindola authored Apr 10, 2012
```
not fit in a i64.

llvm-svn: 154364
```
1d9672bd
ARM LDR/LDRT has the same encoding collision as STR/STRT. · 8f99bc3a
Jim Grosbach authored Apr 10, 2012
```
Generalized logic of r154141.

llvm-svn: 154362
```
8f99bc3a
Test case for PR12495. · ec96cd06
Lang Hames authored Apr 09, 2012
```
llvm-svn: 154359
```
ec96cd06
Revert the 'EnableInitializing' flag. There is debate on whether we should run... · b5cedde6
Bill Wendling authored Apr 09, 2012
```
Revert the 'EnableInitializing' flag. There is debate on whether we should run that pass by default in LTO.

llvm-svn: 154356
```
b5cedde6

Apply the scope restrictions after parsing the command line options. There may... · 383fda29

Bill Wendling authored Apr 09, 2012

Apply the scope restrictions after parsing the command line options. There may be some which are used in that function.

llvm-svn: 154348

383fda29

Apr 09, 2012

Have TargetLowering::getPICJumpTableRelocBase return a node that points to the · 8483a6c4
Akira Hatanaka authored Apr 09, 2012
```
GOT if jump table uses 64-bit gp-relative relocation.

llvm-svn: 154341
```
8483a6c4

When performing a truncating store, it's possible to rearrange the data · e0e38f61

Chad Rosier authored Apr 09, 2012

in-register, such that we can use a single vector store rather then a 
series of scalar stores.

For func_4_8 the generated code

	vldr	d16, LCPI0_0
	vmov	d17, r0, r1
	vadd.i16	d16, d17, d16
	vmov.u16	r0, d16[3]
	strb	r0, [r2, #3]
	vmov.u16	r0, d16[2]
	strb	r0, [r2, #2]
	vmov.u16	r0, d16[1]
	strb	r0, [r2, #1]
	vmov.u16	r0, d16[0]
	strb	r0, [r2]
	bx	lr

becomes

	vldr	d16, LCPI0_0
	vmov	d17, r0, r1
	vadd.i16	d16, d17, d16
	vuzp.8	d16, d17
	vst1.32	{d16[0]}, [r2, :32]
	bx	lr

I'm not fond of how this combine pessimizes 2012-03-13-DAGCombineBug.ll,
but I couldn't think of a way to judiciously apply this combine.

This

	ldrh	r0, [r0, #4]
	strh	r0, [r1]

becomes

	vldr	d16, [r0]
	vmov.u16	r0, d16[2]
	vmov.32	d16[0], r0
	vuzp.16	d16, d17
	vst1.32	{d16[0]}, [r1, :32]

PR11158
rdar://10703339

llvm-svn: 154340

e0e38f61

Patch r153892 for PR11861 apparently broke an external project (see PR12493). · 3ad11ff9

Lang Hames authored Apr 09, 2012

This patch restores TwoAddressInstructionPass's pre-r153892 behaviour when
rescheduling instructions in TryInstructionTransform. Hopefully this will fix
PR12493. To refix PR11861, lowering of INSERT_SUBREGS is deferred until after
the copy that unties the operands is emitted (this seems to be a more
appropriate fix for that issue anyway).

llvm-svn: 154338

3ad11ff9

Update comments and remove unnecessary isVolatile() check. · 99cbde9e
Chad Rosier authored Apr 09, 2012
```
llvm-svn: 154336
```
99cbde9e