Commits · 88564f3cf7dc3f9d925cdcc181e885e893b8bb6f · Roger Ferrer / llvm-epi-0.8

Apr 19, 2013

PR14606: debug info imported_module support · 88564f3c

David Blaikie authored Apr 19, 2013

Adding another CU-wide list, in this case of imported_modules (since they
should be relatively rare, it seemed better to add a list where each element
had a "context" value, rather than add a (usually empty) list to every scope).
This takes care of DW_TAG_imported_module, but to fully address PR14606 we'll
need to expand this to cover DW_TAG_imported_declaration too.

llvm-svn: 179836

88564f3c

Add some more stats for fast isel vs. SelectionDAG, w.r.t lowering function · 6084f45f
Eli Bendersky authored Apr 19, 2013
```
arguments in entry BBs.

llvm-svn: 179824
```
6084f45f

Apr 17, 2013
- Add support for subsections to the ELF assembler. Fixes PR8717. · 2f495b93
  Peter Collingbourne authored Apr 17, 2013
```
Differential Revision: http://llvm-reviews.chandlerc.com/D598

llvm-svn: 179725
```
  2f495b93
Apr 15, 2013

Replace uses of the deprecated std::auto_ptr with OwningPtr. · b23ea72e

Andy Gibbs authored Apr 15, 2013

This is a rework of the broken parts in r179373 which were subsequently reverted in r179374 due to incompatibility with C++98 compilers.  This version should be ok under C++98.

llvm-svn: 179520

b23ea72e

Apr 14, 2013
- Document the decision to assume that the cost of floats is twice as much as integers. · 0db0690a
  Nadav Rotem authored Apr 14, 2013
```
llvm-svn: 179478
```
  0db0690a
Apr 13, 2013

MI-Sched: DEBUG formatting. · 1f0bb69b
Andrew Trick authored Apr 13, 2013
```
llvm-svn: 179452
```
1f0bb69b
MI-Sched cleanup. If an instruction has no valid sched class, do not attempt... · be2bccbc
Andrew Trick authored Apr 13, 2013
```
MI-Sched cleanup. If an instruction has no valid sched class, do not attempt to check for a variant.

llvm-svn: 179451
```
be2bccbc

MI-Sched: schedule physreg copies. · e833e1cd

Andrew Trick authored Apr 13, 2013

The register allocator expects minimal physreg live ranges. Schedule
physreg copies accordingly. This is slightly tricky when they occur in
the middle of the scheduling region. For now, this is handled by
rescheduling the copy when its associated instruction is
scheduled. Eventually we may instead bundle them, but only if we can
preserve the bundles as parallel copies during regalloc.

llvm-svn: 179449

e833e1cd

Apr 12, 2013

CostModel: increase the default cost of supported floating point operations... · 87a0af6e

Nadav Rotem authored Apr 12, 2013

CostModel: increase the default cost of supported floating point operations from 1 to two. Fixed a few tests that changes because now the cost of one insert + a vector operation on two doubles is lower than two scalar operations on doubles.

llvm-svn: 179413

87a0af6e

Revert broken pieces of r179373. · dae08512

Benjamin Kramer authored Apr 12, 2013

You can't copy an OwningPtr, and move semantics aren't available in C++98.

llvm-svn: 179374

dae08512

Replace uses of the deprecated std::auto_ptr with OwningPtr. · 95777550
Andy Gibbs authored Apr 12, 2013
```
llvm-svn: 179373
```
95777550
Don't disable block layout when forcing block alignment. · c0adc9fd
Nadav Rotem authored Apr 12, 2013
```
llvm-svn: 179355
```
c0adc9fd

Add a flag to align all basic blocks in the function. · c3b0f50a

Nadav Rotem authored Apr 12, 2013

When debugging performance regressions we often ask ourselves if the regression
that we see is due to poor isel/sched/ra or due to some micro-architetural
problem. When comparing two code sequences one good way to rule out front-end
bottlenecks (and other the issues) is to force code alignment. This pass adds
a flag that forces the alignment of all of the basic blocks in the program.

llvm-svn: 179353

c3b0f50a

Apr 11, 2013

Add braces around || in && to pacify GCC. · e7c45bc6
Benjamin Kramer authored Apr 11, 2013
```
llvm-svn: 179275
```
e7c45bc6

Manually remove successors in if conversion when CopyAndPredicateBlock is used · 95081bff

Hal Finkel authored Apr 10, 2013

In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is
used because the to-be-predicated block has other predecessors, we need to
explicitly remove the old copied block from the successors list. Normally if
conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges
to cleanup the successors list, but if the predicated block contained an
un-analyzable branch (such as a now-predicated return), then this will fail.

These extra successors were causing a problem on PPC because it was causing
later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in
the code.

llvm-svn: 179227

95081bff

Apr 10, 2013

Generalize the PassConfig API and remove addFinalizeRegAlloc(). · e220323c

Andrew Trick authored Apr 10, 2013

The target hooks are getting out of hand. What does it mean to run
before or after regalloc anyway? Allowing either Pass* or AnalysisID
pass identification should make it much easier for targets to use the
substitutePass and insertPass APIs, and create less need for badly
named target hooks.

llvm-svn: 179140

e220323c

Apr 09, 2013

The .dwo section shouldn't contain the unrelocated values (and · 52ce7189

Eric Christopher authored Apr 09, 2013

therefore not at all) of the pc or statement list. We also don't
need to emit the compilation dir so save so space and time
and don't bother.

Fix up the testcase accordingly and verify that we don't emit
the attributes or the items that they use.

llvm-svn: 179114

52ce7189

DAGCombiner: Fold a shuffle on CONCAT_VECTORS into a new CONCAT_VECTORS if possible. · bbae991d

Benjamin Kramer authored Apr 09, 2013

This pattern occurs in SROA output due to the way vector arguments are lowered
on ARM.

The testcase from PR15525 now compiles into this, which is better than the code
we got with the old scalarrepl:
_Store:
	ldr.w	r9, [sp]
	vmov	d17, r3, r9
	vmov	d16, r1, r2
	vst1.8	{d16, d17}, [r0]
	bx	lr

Differential Revision: http://llvm-reviews.chandlerc.com/D647

llvm-svn: 179106

bbae991d

Apr 07, 2013

DW_FORM_sec_offset should be a relocation on platforms that use · 55863bef

Eric Christopher authored Apr 07, 2013

a relocation across sections. Do this for DW_AT_stmt list in the
skeleton CU and check the relocations in the debug_info section.

Add a FIXME for multiple CUs.

llvm-svn: 178969

55863bef

Apr 06, 2013

typo · c4bd84c1
Nadav Rotem authored Apr 06, 2013
```
llvm-svn: 178949
```
c4bd84c1

Dwarf: use utostr on CUID to append to SmallString. · 5b22f9fe

Manman Ren authored Apr 06, 2013

We used to do "SmallString += CUID", which is incorrect, since CUID will
be truncated to a char.

rdar://problem/13573833

llvm-svn: 178941

5b22f9fe

Reapply r178845 with fix - Fix bug in PEI's virtual-register scavenging · 3005c299

Hal Finkel authored Apr 05, 2013

This fixes PEI as previously described, but correctly handles the case where
the instruction defining the virtual register to be scavenged is the first in
the block. Arnold provided me with a bugpoint-reduced test case, but even that
seems too large to use as a regression test. If I'm successful in cleaning it
up then I'll commit that as well.

Original commit message:

This change fixes a bug that I introduced in r178058. After a register is
scavenged using one of the available spills slots the instruction defining the
virtual register needs to be moved to after the spill code. The scavenger has
already processed the defining instruction so that registers killed by that
instruction are available for definition in that same instruction. Unfortunately,
after this, the scavenger needs to iterate through the spill code and then
visit, again, the instruction that defines the now-scavenged register. In order
to avoid confusion, the register scavenger needs the ability to 'back up'
through the spill code so that it can again process the instructions in the
appropriate order. Prior to this fix, once the scavenger reached the
just-moved instruction, it would assert if it killed any registers because,
having already processed the instruction, it believed they were undefined.

Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar
for diagnosing the problem and testing this fix.

llvm-svn: 178919

3005c299

Apr 05, 2013

Use the target options specified on a function to reset the back-end. · eb108bad

Bill Wendling authored Apr 05, 2013

During LTO, the target options on functions within the same Module may
change. This would necessitate resetting some of the back-end. Do this for X86,
because it's a Friday afternoon.

llvm-svn: 178917

eb108bad

Revert r178845 - Fix bug in PEI's virtual-register scavenging · 81c46d08

Hal Finkel authored Apr 05, 2013

Reverting because this breaks one of the LTO builders. Original commit message:

Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar
for diagnosing the problem and testing this fix.

llvm-svn: 178916

81c46d08

Fix bug in PEI's virtual-register scavenging · e6f48e4e

Hal Finkel authored Apr 05, 2013

Unfortunately, I don't yet have a small test case. Thanks to Pranav Bhandarkar
for diagnosing the problem and testing this fix.

llvm-svn: 178845

e6f48e4e

RegisterPressure heuristics currently require signed comparisons. · 80e66ce0
Andrew Trick authored Apr 05, 2013
```
llvm-svn: 178823
```
80e66ce0

Disable DFSResult for ConvergingScheduler. · 96ce3848

Andrew Trick authored Apr 05, 2013

For now, just save the compile time since the ConvergingScheduler
heuristics don't use this analysis. We'll probably enable it later
after compile-time investigation.

llvm-svn: 178822

96ce3848

MachineScheduler: format DEBUG output. · 419d4917

Andrew Trick authored Apr 05, 2013

I'm getting more serious about tuning and enabling on x86/ARM. Start
by making the trace readable.

llvm-svn: 178821

419d4917

CostModel: Add parameter to instruction cost to further classify operand values · b9773871

Arnold Schwaighofer authored Apr 04, 2013

On certain architectures we can support efficient vectorized version of
instructions if the operand value is uniform (splat) or a constant scalar.
An example of this is a vector shift on x86.

We can efficiently support

for (i = 0 ; i < ; i += 4)
  w[0:3] = v[0:3] << <2, 2, 2, 2>

but not

for (i = 0; i < ; i += 4)
  w[0:3] = v[0:3] << x[0:3]

This patch adds a parameter to getArithmeticInstrCost to further qualify operand
values as uniform or uniform constant.

Targets can then choose to return a different cost for instructions with such
operand values.

A follow-up commit will test this feature on x86.

radar://13576547

llvm-svn: 178807

b9773871

Debug Info: revert 178722 for now. · bdcb4464

Manman Ren authored Apr 04, 2013

There is a difference for FORM_ref_addr between DWARF 2 and DWARF 3+.
Since Eric is against guarding DWARF 2 ref_addr with DarwinGDBCompat, we are
still in discussion on how to handle this.

The correct solution is to update our header to say version 4 instead of version
2 and update tool chains as well.

rdar://problem/13559431

llvm-svn: 178806

bdcb4464

typo · 322f41d0
Adrian Prantl authored Apr 04, 2013
```
llvm-svn: 178804
```
322f41d0

Apr 04, 2013

Formatting · fc186358
Eli Bendersky authored Apr 04, 2013
```
llvm-svn: 178771
```
fc186358

Debug Info: according to DWARF 2, FORM_ref_addr the same size as an address on · 5a15c9ed

Manman Ren authored Apr 04, 2013

the target system.

It was hard-coded to 4 bytes before. I can't get llvm to generate a
ref_addr on a reasonably sized testing case.

rdar://problem/13559431

llvm-svn: 178722

5a15c9ed

Apr 03, 2013
- Fix PR15632: No support for ppcf128 floating-point remainder on PowerPC. · 92e26646
  Bill Schmidt authored Apr 03, 2013
```
For this we need to use a libcall.  Previously LLVM didn't implement
libcall support for frem, so I've added it in the usual
straightforward manner.  A test case from the bug report is included.

llvm-svn: 178639
```
  92e26646
- Fix grammar. · 14c2067c
  Eric Christopher authored Apr 03, 2013
```
llvm-svn: 178624
```
  14c2067c
- Remove ZeroOrMore from the option description. We don't need it here. · 5590949f
  Eric Christopher authored Apr 03, 2013
```
llvm-svn: 178623
```
  5590949f
- Allow MachineTraceMetrics to be used when the model has no resources. · aeb69a54
  Jakob Stoklund Olesen authored Apr 02, 2013
```
It it still possible to extract information from itineraries, for
example.

llvm-svn: 178582
```
  aeb69a54
Apr 02, 2013

Don't attempt MTM heuristics without a scheduling model present. · 8fbfc591
Jakob Stoklund Olesen authored Apr 02, 2013
```
This should fix the PPC buildbots.

llvm-svn: 178558
```
8fbfc591

Count processor resources individually in MachineTraceMetrics. · 3ca14772

Jakob Stoklund Olesen authored Apr 02, 2013

The new instruction scheduling models provide information about the
number of cycles consumed on each processor resource. This makes it
possible to estimate ILP more accurately than simply counting
instructions / issue width.

The functions getResourceDepth() and getResourceLength() now identify
the limiting processor resource, and return a cycle count based on that.

This gives more precise resource information, particularly in traces
that use one resource a lot more than others.

llvm-svn: 178553

3ca14772

DAGCombiner: Merge store/loads when we have extload/truncstores · d6c6e868

Arnold Schwaighofer authored Apr 02, 2013

This is helps on architectures where i8,i16 are not legal but we have byte, and
short loads/stores. Allowing us to merge copies like the one below on ARM.

copy(char *a, char *b, int n) {
 do {
   int t0 = a[0];
   int t1 = a[1];
   b[0] = t0;
   b[1] = t1;

radar://13536387

llvm-svn: 178546

d6c6e868