Commits · 3e3cdecf98e5a5a2ef1296f96c52da4b435dd950 · Roger Ferrer / llvm-epi-0.8

Jun 06, 2012

Andrew Trick authored Jun 06, 2012

Allow targets to access this API. It's required for RegisterPressure.

llvm-svn: 158102

05ff4667

Remove unused private fields found by clang's new -Wunused-private-field. · 628a39fa

Benjamin Kramer authored Jun 06, 2012

There are some that I didn't remove this round because they looked like
obvious stubs. There are dead variables in gtest too, they should be
fixed upstream.

llvm-svn: 158090

628a39fa

Jun 02, 2012

Switch all register list clients to the new MC*Iterator interface. · 54038d79

Jakob Stoklund Olesen authored Jun 01, 2012

No functional change intended.

Sorry for the churn. The iterator classes are supposed to help avoid
giant commits like this one in the future. The TableGen-produced
register lists are getting quite large, and it may be necessary to
change the table representation.

This makes it possible to do so without changing all clients (again).

llvm-svn: 157854

54038d79

Apr 23, 2012

This patch fixes a problem which arose when using the Post-RA scheduler · 9a091475

Preston Gurd authored Apr 23, 2012

on X86 Atom. Some of our tests failed because the tail merging part of
the BranchFolding pass was creating new basic blocks which did not
contain live-in information. When the anti-dependency code in the Post-RA
scheduler ran, it would sometimes rename the register containing
the function return value because the fact that the return value was
live-in to the subsequent block had been lost. To fix this, it is necessary
to run the RegisterScavenging code in the BranchFolding pass.

This patch makes sure that the register scavenging code is invoked
in the X86 subtarget only when post-RA scheduling is being done.
Post RA scheduling in the X86 subtarget is only done for Atom.

This patch adds a new function to the TargetRegisterClass to control
whether or not live-ins should be preserved during branch folding.
This is necessary in order for the anti-dependency optimizations done
during the PostRASchedulerList pass to work properly when doing
Post-RA scheduling for the X86 in general and for the Intel Atom in particular.

The patch adds and invokes the new function trackLivenessAfterRegAlloc()
instead of using the existing requiresRegisterScavenging().
It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of
requiresRegisterScavenging(). It changes the all the targets that
implemented requiresRegisterScavenging() to also implement
trackLivenessAfterRegAlloc().  

It adds an assertion in the Post RA scheduler to make sure that post RA
liveness information is available when it is needed.

It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order
to avoid running into the added assertion.

Finally, this patch restores the use of anti-dependency checking
(which was turned off temporarily for the 3.1 release) for
Intel Atom in the Post RA scheduler.

Patch by Andy Zhang!

Thanks to Jakob and Anton for their reviews.

llvm-svn: 155395

9a091475

Mar 09, 2012
- misched interface: rename Begin/End to RegionBegin/RegionEnd since they are not private. · 8c207e47
  Andrew Trick authored Mar 09, 2012
```
llvm-svn: 152382
```
  8c207e47
Mar 08, 2012
- misched prep: Expose the ScheduleDAGInstrs interface so targets may · 9a0c5839
  Andrew Trick authored Mar 07, 2012
```
implement their own MachineScheduler.

llvm-svn: 152261
```
  9a0c5839
- misched prep: rename InsertPos to End. · a316faab
  Andrew Trick authored Mar 07, 2012
```
ScheduleDAGInstrs knows nothing about how instructions will be moved or inserted.

llvm-svn: 152256
```
  a316faab
- misched preparation: rename core scheduler methods for consistency. · 52226d40
  Andrew Trick authored Mar 07, 2012
```
We had half the API with one convention, half with another. Now was a
good time to clean it up.

llvm-svn: 152255
```
  52226d40
Mar 07, 2012

misched preparation: clarify ScheduleDAG and ScheduleDAGInstrs roles. · 60cf03e7

Andrew Trick authored Mar 07, 2012

ScheduleDAG is responsible for the DAG: SUnits and SDeps. It provides target hooks for latency computation.

ScheduleDAGInstrs extends ScheduleDAG and defines the current scheduling region in terms of MachineInstr iterators. It has access to the target's scheduling itinerary data. ScheduleDAGInstrs provides the logic for building the ScheduleDAG for the sequence of MachineInstrs in the current region. Target's can implement highly custom schedulers by extending this class.

ScheduleDAGPostRATDList provides the driver and diagnostics for current postRA scheduling. It maintains a current Sequence of scheduled machine instructions and logic for splicing them into the block. During scheduling, it uses the ScheduleHazardRecognizer provided by the target.

Specific changes:
- Removed driver code from ScheduleDAG. clearDAG is the only interface needed.

- Added enterRegion/exitRegion hooks to ScheduleDAGInstrs to delimit the scope of each scheduling region and associated DAG. They should be used to setup and cleanup any region-specific state in addition to the DAG itself. This is necessary because we reuse the same ScheduleDAG object for the entire function. The target may extend these hooks to do things at regions boundaries, like bundle terminators. The hooks are called even if we decide not to schedule the region. So all instructions in a block are "covered" by these calls.

- Added ScheduleDAGInstrs::begin()/end() public API.

- Moved Sequence into the driver layer, which is specific to the scheduling algorithm.

llvm-svn: 152208

60cf03e7

misched preparation: modularize schedule emission. · e932bb77
Andrew Trick authored Mar 07, 2012
```
ScheduleDAG has nothing to do with how the instructions are scheduled.

llvm-svn: 152206
```
e932bb77
misched preparation: modularize schedule printing. · edee68ce
Andrew Trick authored Mar 07, 2012
```
ScheduleDAG will not refer to the scheduled instruction sequence.

llvm-svn: 152205
```
edee68ce
misched preparation: modularize schedule verification. · 46a58664
Andrew Trick authored Mar 07, 2012
```
ScheduleDAG will not refer to the scheduled instruction sequence.

llvm-svn: 152204
```
46a58664

Mar 05, 2012
- Convert more GenRegisterInfo tables from unsigned to uint16_t to reduce static data size. · 4b02a29e
  Craig Topper authored Mar 05, 2012
```
llvm-svn: 152016
```
  4b02a29e
Feb 23, 2012

BitVectorize loop. · ef8bf395
Benjamin Kramer authored Feb 23, 2012
```
llvm-svn: 151274
```
ef8bf395
post-ra-sched: Turn the KillIndices vector into a bitvector, it only stored two meaningful states. · 796fd469
Benjamin Kramer authored Feb 23, 2012
```
Rename it to LiveRegs to make it more clear what's stored inside.

llvm-svn: 151273
```
796fd469

post-ra-sched: Replace a std::set of regs with a bitvector. · 21974b1f

Benjamin Kramer authored Feb 23, 2012

Assuming that a single std::set node adds 3 control words, a bitvector
can store (3*8+4)*8=224 registers in the allocated memory of a single
element in the std::set (x86_64). Also we don't have to call malloc
for every register added.

llvm-svn: 151269

21974b1f

Make calls scheduling boundaries post-ra. · a793a59f

Jakob Stoklund Olesen authored Feb 23, 2012

Before register allocation, instructions can be moved across calls in
order to reduce register pressure. After register allocation, we don't
gain a lot by moving callee-saved defs across calls. In fact, since the
scheduler doesn't have a good idea how registers are used in the callee,
it can't really make good scheduling decisions.

This changes the schedule in two ways: 1. Latencies to call uses and
defs are no longer accounted for, causing some random shuffling around
calls. This isn't really a problem since those uses and defs are
inaccurate proxies for what happens inside the callee. They don't
represent registers used by the call instruction itself.

2. Instructions are no longer moved across calls. This didn't happen
very often, and the scheduling decision was made on dubious information
anyway.

As with any scheduling change, benchmark numbers shift around a bit,
but there is no positive or negative trend from this change.

This makes the post-ra scheduler 5% faster for ARM targets.

The secret motivation for this patch is the introduction of register
mask operands representing call clobbers. The most efficient way of
handling regmasks in ScheduleDAGInstrs is to model them as barriers for
physreg live ranges, but not for virtreg live ranges. That's fine
pre-ra, but post-ra it would have the same effect as this patch.

llvm-svn: 151265

a793a59f

Handle regmasks in FixupKills. · 28d4803a
Jakob Stoklund Olesen authored Feb 23, 2012
```
llvm-svn: 151226
```
28d4803a

Feb 22, 2012

Make all pointers to TargetRegisterClass const since they are all pointers to... · 760b134f

Craig Topper authored Feb 22, 2012

Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified.

llvm-svn: 151134

760b134f

Feb 08, 2012

Codegen pass definition cleanup. No functionality. · 1fa5bcbe

Andrew Trick authored Feb 08, 2012

Moving toward a uniform style of pass definition to allow easier target configuration.
Globally declare Pass ID.
Globally declare pass initializer.
Use INITIALIZE_PASS consistently.
Add a call to the initializer from CodeGen.cpp.
Remove redundant "createPass" functions and "getPassName" methods.

While cleaning up declarations, cleaned up comments (sorry for large diff).

llvm-svn: 150100

1fa5bcbe

Move pass configuration out of pass constructors: PostRAScheduler. · df7e3769
Andrew Trick authored Feb 08, 2012
```
llvm-svn: 150096
```
df7e3769

Jan 14, 2012
- misched: Added ScheduleDAGInstrs::IsPostRA · 1d028a36
  Andrew Trick authored Jan 14, 2012
```
llvm-svn: 148172
```
  1d028a36
Dec 14, 2011

- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function · 7fae11b2

Evan Cheng authored Dec 14, 2011

  to finalize MI bundles (i.e. add BUNDLE instruction and computing register def
  and use lists of the BUNDLE instruction) and a pass to unpack bundles.
- Teach more of MachineBasic and MachineInstr methods to be bundle aware.
- Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to
  prevent IT blocks from being broken apart.

llvm-svn: 146542

7fae11b2

Dec 07, 2011

Add bundle aware API for querying instruction properties and switch the code · 7f8e563a

Evan Cheng authored Dec 07, 2011

generator to it. For non-bundle instructions, these behave exactly the same
as the MC layer API.

For properties like mayLoad / mayStore, look into the bundle and if any of the
bundled instructions has the property it would return true.
For properties like isPredicable, only return true if *all* of the bundled
instructions have the property.
For properties like canFoldAsLoad, isCompare, conservatively return false for
bundles.

llvm-svn: 146026

7f8e563a

Nov 15, 2011
- Remove all remaining uses of Value::getNameStr(). · 1f97a5a6
  Benjamin Kramer authored Nov 15, 2011
```
llvm-svn: 144648
```
  1f97a5a6
Jul 01, 2011
- Rename TargetSubtarget to TargetSubtargetInfo for consistency. · 0d639a28
  Evan Cheng authored Jul 01, 2011
```
llvm-svn: 134259
```
  0d639a28
Jun 16, 2011
- Teach antidependency breakers to use RegisterClassInfo. · 4f5f84c7
  Jakob Stoklund Olesen authored Jun 16, 2011
```
No functional change was intended.

llvm-svn: 133202
```
  4f5f84c7
Jun 02, 2011
- Update DBG_VALUEs while breaking anti dependencies. · f02a376f
  Devang Patel authored Jun 02, 2011
```
llvm-svn: 132487
```
  f02a376f
Jun 01, 2011

Add an issue width check to the postRA scheduler. Patch by Max Kazakov! · 18c9b37a

Andrew Trick authored Jun 01, 2011

For targets with no itinerary (x86) it is a nop by default. For
targets with issue width already expressed in the itinerary (ARM) it
bypasses a scoreboard check but otherwise does not affect the
schedule. It does make the code more consistent and complete and
allows new targets to specify their issue width in an arbitrary way.

llvm-svn: 132385

18c9b37a

May 06, 2011

Typo: Reviewed by Alistair. · 84f9ad9b
Andrew Trick authored May 06, 2011
```
llvm-svn: 131001
```
84f9ad9b

Post-RA scheduler compile time fix. Quadratic computation of DAG node depth. · aab77fe5

Andrew Trick authored May 06, 2011

The post-ra scheduler was explicitly updating the depth of a node's
successors after scheduling it, regardless of whether the successor
was ready. This is quadratic for DAGs with transitively redundant
edges. I simply removed the useless update of depth, which is lazilly
computed later.
Fixes <rdar://problem/9044332> compiler takes way too long to build TextInput.

llvm-svn: 130992

aab77fe5

Dec 24, 2010

Various bits of framework needed for precise machine-level selection · 10ffc2b6

Andrew Trick authored Dec 24, 2010

DAG scheduling during isel. Most new functionality is currently
guarded by -enable-sched-cycles and -enable-sched-hazard.

Added InstrItineraryData::IssueWidth field, currently derived from
ARM itineraries, but could be initialized differently on other targets.

Added ScheduleHazardRecognizer::MaxLookAhead to indicate whether it is
active, and if so how many cycles of state it holds.

Added SchedulingPriorityQueue::HasReadyFilter to allowing gating entry
into the scheduler's available queue.

ScoreboardHazardRecognizer now accesses the ScheduleDAG in order to
get information about it's SUnits, provides RecedeCycle for bottom-up
scheduling, correctly computes scoreboard depth, tracks IssueCount, and
considers potential stall cycles when checking for hazards.

ScheduleDAGRRList now models machine cycles and hazards (under
flags). It tracks MinAvailableCycle, drives the hazard recognizer and
priority queue's ready filter, manages a new PendingQueue, properly
accounts for stall cycles, etc.

llvm-svn: 122541

10ffc2b6

Sep 10, 2010

Teach if-converter to be more careful with predicating instructions that would · bf407075

Evan Cheng authored Sep 10, 2010

take multiple cycles to decode.
For the current if-converter clients (actually only ARM), the instructions that
are predicated on false are not nops. They would still take machine cycles to
decode. Micro-coded instructions such as LDM / STM can potentially take multiple
cycles to decode. If-converter should take treat them as non-micro-coded
simple instructions.

llvm-svn: 113570

bf407075

Aug 06, 2010
- Reapply r110396, with fixes to appease the Linux buildbot gods. · a7aed186
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110460
```
  a7aed186
- Revert r110396 to fix buildbots. · bda59bd2
  Owen Anderson authored Aug 06, 2010
```
llvm-svn: 110410
```
  bda59bd2
- Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static · 755aceb5
  Owen Anderson authored Aug 05, 2010
```
ID member as the sole unique type identifier.  Clean up APIs related to this change.

llvm-svn: 110396
```
  755aceb5
Jul 15, 2010
- Use std::vector instead of TargetRegisterInfo::FirstVirtualRegister. · dd5e9d8f
  Bill Wendling authored Jul 15, 2010
```
llvm-svn: 108450
```
  dd5e9d8f
Jun 19, 2010

Allow ARM if-converter to be run after post allocation scheduling. · 2d51c7c5

Evan Cheng authored Jun 18, 2010

- This fixed a number of bugs in if-converter, tail merging, and post-allocation
  scheduler. If-converter now runs branch folding / tail merging first to
  maximize if-conversion opportunities.
- Also changed the t2IT instruction slightly. It now defines the ITSTATE
  register which is read by instructions in the IT block.
- Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't
  change the instruction ordering in the IT block (since IT mask has been
  finalized). It also ensures no other instructions can be scheduled between
  instructions in the IT block.

This is not yet enabled.

llvm-svn: 106344

2d51c7c5

Jun 14, 2010

- Do away with SimpleHazardRecognizer.h. It's not used and offers little value. · 078f4cec

Evan Cheng authored Jun 14, 2010

- Rename ExactHazardRecognizer to PostRAHazardRecognizer and move its header to include to allow targets to extend it.

llvm-svn: 105959

078f4cec

Jun 12, 2010
- Allow target to provide its own hazard recognizer to post-ra scheduler. · e60273fd
  Evan Cheng authored Jun 12, 2010
```
llvm-svn: 105862
```
  e60273fd