Commits · be8dc6499ac00b2e5a608dbcec3ecb17348299e0 · Lorenzo Albano / LLVM bpEVL

Feb 12, 2013

[NVPTX] Disable vector registers · be8dc649

Justin Holewinski authored Feb 12, 2013

Vectors were being manually scalarized by the backend.  Instead,
let the target-independent code do all of the work.  The manual
scalarization was from a time before good target-independent support
for scalarization in LLVM. However, this forces us to specially-handle
vector loads and stores, which we can turn into PTX instructions that
produce/consume multiple operands.

llvm-svn: 174968

be8dc649

Feb 09, 2013
- [NVPTX] Make address space errors more explicit (llvm_unreachable -> report_fatal_error) · 36a50991
  Justin Holewinski authored Feb 09, 2013
  
  llvm-svn: 174808
  36a50991
Jan 31, 2013
- [PEI] Pass the frame index operand number to the eliminateFrameIndex function. · df782d22
  Chad Rosier authored Jan 31, 2013
  
  Each target implementation was needlessly recomputing the index. Part of rdar://13076458 llvm-svn: 174083
  df782d22
Jan 23, 2013

Clean up assignment of CalleeSaveStackSlotSize: get rid of the default and... · 32aab221

Eli Bendersky authored Jan 23, 2013

Clean up assignment of CalleeSaveStackSlotSize: get rid of the default and explicitly set this in every target that needs to change it from the default.

llvm-svn: 173270

32aab221

NVPTX: Stop leaking memory by using a managed constant instead of a new Argument. · c4231cc9

Benjamin Kramer authored Jan 23, 2013

This is still an egregious hack since we don't have a nice interface for this
kind of thing but should help the valgrind leak check buildbot to become green.

llvm-svn: 173267

c4231cc9

Jan 07, 2013

Switch TargetTransformInfo from an immutable analysis pass that requires · 664e354d

Chandler Carruth authored Jan 07, 2013

a TargetMachine to construct (and thus isn't always available), to an
analysis group that supports layered implementations much like
AliasAnalysis does. This is a pretty massive change, with a few parts
that I was unable to easily separate (sorry), so I'll walk through it.

The first step of this conversion was to make TargetTransformInfo an
analysis group, and to sink the nonce implementations in
ScalarTargetTransformInfo and VectorTargetTranformInfo into
a NoTargetTransformInfo pass. This allows other passes to add a hard
requirement on TTI, and assume they will always get at least on
implementation.

The TargetTransformInfo analysis group leverages the delegation chaining
trick that AliasAnalysis uses, where the base class for the analysis
group delegates to the previous analysis *pass*, allowing all but tho
NoFoo analysis passes to only implement the parts of the interfaces they
support. It also introduces a new trick where each pass in the group
retains a pointer to the top-most pass that has been initialized. This
allows passes to implement one API in terms of another API and benefit
when some other pass above them in the stack has more precise results
for the second API.

The second step of this conversion is to create a pass that implements
the TargetTransformInfo analysis using the target-independent
abstractions in the code generator. This replaces the
ScalarTargetTransformImpl and VectorTargetTransformImpl classes in
lib/Target with a single pass in lib/CodeGen called
BasicTargetTransformInfo. This class actually provides most of the TTI
functionality, basing it upon the TargetLowering abstraction and other
information in the target independent code generator.

The third step of the conversion adds support to all TargetMachines to
register custom analysis passes. This allows building those passes with
access to TargetLowering or other target-specific classes, and it also
allows each target to customize the set of analysis passes desired in
the pass manager. The baseline LLVMTargetMachine implements this
interface to add the BasicTTI pass to the pass manager, and all of the
tools that want to support target-aware TTI passes call this routine on
whatever target machine they end up with to add the appropriate passes.

The fourth step of the conversion created target-specific TTI analysis
passes for the X86 and ARM backends. These passes contain the custom
logic that was previously in their extensions of the
ScalarTargetTransformInfo and VectorTargetTransformInfo interfaces.
I separated them into their own file, as now all of the interface bits
are private and they just expose a function to create the pass itself.
Then I extended these target machines to set up a custom set of analysis
passes, first adding BasicTTI as a fallback, and then adding their
customized TTI implementations.

The fourth step required logic that was shared between the target
independent layer and the specific targets to move to a different
interface, as they no longer derive from each other. As a consequence,
a helper functions were added to TargetLowering representing the common
logic needed both in the target implementation and the codegen
implementation of the TTI pass. While technically this is the only
change that could have been committed separately, it would have been
a nightmare to extract.

The final step of the conversion was just to delete all the old
boilerplate. This got rid of the ScalarTargetTransformInfo and
VectorTargetTransformInfo classes, all of the support in all of the
targets for producing instances of them, and all of the support in the
tools for manually constructing a pass based around them.

Now that TTI is a relatively normal analysis group, two things become
straightforward. First, we can sink it into lib/Analysis which is a more
natural layer for it to live. Second, clients of this interface can
depend on it *always* being available which will simplify their code and
behavior. These (and other) simplifications will follow in subsequent
commits, this one is clearly big enough.

Finally, I'm very aware that much of the comments and documentation
needs to be updated. As soon as I had this working, and plausibly well
commented, I wanted to get it committed and in front of the build bots.
I'll be doing a few passes over documentation later if it sticks.

Commits to update DragonEgg and Clang will be made presently.

llvm-svn: 171681

664e354d

Jan 02, 2013

Move all of the header files which are involved in modelling the LLVM IR · 9fb823bb

Chandler Carruth authored Jan 02, 2013

into their new header subdirectory: include/llvm/IR. This matches the
directory structure of lib, and begins to correct a long standing point
of file layout clutter in LLVM.

There are still more header files to move here, but I wanted to handle
them in separate commits to make tracking what files make sense at each
layer easier.

The only really questionable files here are the target intrinsic
tablegen files. But that's a battle I'd rather not fight today.

I've updated both CMake and Makefile build systems (I think, and my
tests think, but I may have missed something).

I've also re-sorted the includes throughout the project. I'll be
committing updates to Clang, DragonEgg, and Polly momentarily.

llvm-svn: 171366

9fb823bb

Dec 30, 2012
- convert a bunch of callers from DataLayout::getIndexedOffset() to GEP::accumulateConstantOffset(). · b6ad9822
  Nuno Lopes authored Dec 30, 2012
  
  The later API is nicer than the former, and is correct regarding wrap-around offsets (if anyone cares). There are a few more places left with duplicated code, which I'll remove soon. llvm-svn: 171259
  b6ad9822
- Use the predicate methods off of AttributeSet instead of Attribute. · 749a43d8
  Bill Wendling authored Dec 30, 2012
  
  llvm-svn: 171257
  749a43d8
Dec 21, 2012
- Remove duplicate includes. · a229186a
  Roman Divacky authored Dec 21, 2012
  
  llvm-svn: 170902
  a229186a
Dec 20, 2012
- MachineInstrBuilderize NVPTX. · 4255c96a
  Jakob Stoklund Olesen authored Dec 20, 2012
  
  llvm-svn: 170794
  4255c96a
Dec 19, 2012
- Rename the 'Attributes' class to 'Attribute'. It's going to represent a single... · 3d7b0b8a
  Bill Wendling authored Dec 19, 2012
  
  Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502
  3d7b0b8a
Dec 13, 2012
- Add a way of printing out an arbitrary label name for a section · 80882db8
  Eric Christopher authored Dec 13, 2012
  
  given the section. llvm-svn: 170087
  80882db8
Dec 08, 2012
- s/AttrListPtr/AttributeSet/g to better label what this class is going to be in the near future. · e94d843e
  Bill Wendling authored Dec 07, 2012
  
  llvm-svn: 169651
  e94d843e
Dec 05, 2012
- [NVPTX] Fix crash with unnamed struct arguments · fb711156
  Justin Holewinski authored Dec 05, 2012
  
  Patch by Eric Holk llvm-svn: 169418
  fb711156
Dec 04, 2012

Sort includes for all of the .h files under the 'lib' tree. These were · 802d7555

Chandler Carruth authored Dec 04, 2012

missed in the first pass because the script didn't yet handle include
guards.

Note that the script is now able to handle all of these headers without
manual edits. =]

llvm-svn: 169224

802d7555

Dec 03, 2012

Use the new script to sort the includes of every file under lib. · ed0881b2

Chandler Carruth authored Dec 03, 2012

Sooooo many of these had incorrect or strange main module includes.
I have manually inspected all of these, and fixed the main module
include to be the nearest plausible thing I could find. If you own or
care about any of these source files, I encourage you to take some time
and check that these edits were sensible. I can't have broken anything
(I strictly added headers, and reordered them, never removed), but they
may not be the headers you'd really like to identify as containing the
API being implemented.

Many forward declarations and missing includes were added to a header
files to allow them to parse cleanly when included first. The main
module rule does in fact have its merits. =]

llvm-svn: 169131

ed0881b2

Nov 29, 2012

Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing... · bc45119b

Justin Holewinski authored Nov 29, 2012

Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors

For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>.

llvm-svn: 168882

bc45119b

Nov 16, 2012
- [NVPTX] Order global variables in def-use order before emiting them in the final assembly · 2c5ac70d
  Justin Holewinski authored Nov 16, 2012
  
  llvm-svn: 168198
  2c5ac70d
Nov 15, 2012
- NVPTXISelLowering.cpp: Fix warnings. [-Wunused-variable] · 5bbe0e18
  NAKAMURA Takumi authored Nov 14, 2012
  
  llvm-svn: 168001
  5bbe0e18
Nov 14, 2012
- Fix invalid asserts, use llvm_unreachable instead. · d17df318
  Jakub Staszak authored Nov 14, 2012
  
  llvm-svn: 167976
  d17df318
- [NVPTX] Implement custom lowering of loads/stores for i1 · c6462aac
  Justin Holewinski authored Nov 14, 2012
  
  Loads from i1 become loads from i8 followed by trunc Stores to i1 become zext to i8 followed by store to i8 Fixes PR13291 llvm-svn: 167948
  c6462aac
Nov 12, 2012

Remove unused field. · 16631130
Eric Christopher authored Nov 12, 2012
```
llvm-svn: 167719
```
16631130

[NVPTX] Add more precise PTX/SM target attributes · 1812ee9a

Justin Holewinski authored Nov 12, 2012

Each SM and PTX version is modeled as a subtarget feature/CPU. Additionally,
PTX 3.1 is added as the default PTX version to be out-of-the-box compatible
with CUDA 5.0.

Available CPUs for this target:

  sm_10 - Select the sm_10 processor.
  sm_11 - Select the sm_11 processor.
  sm_12 - Select the sm_12 processor.
  sm_13 - Select the sm_13 processor.
  sm_20 - Select the sm_20 processor.
  sm_21 - Select the sm_21 processor.
  sm_30 - Select the sm_30 processor.
  sm_35 - Select the sm_35 processor.

Available features for this target:

  ptx30 - Use PTX version 3.0.
  ptx31 - Use PTX version 3.1.
  sm_10 - Target SM 1.0.
  sm_11 - Target SM 1.1.
  sm_12 - Target SM 1.2.
  sm_13 - Target SM 1.3.
  sm_20 - Target SM 2.0.
  sm_21 - Target SM 2.1.
  sm_30 - Target SM 3.0.
  sm_35 - Target SM 3.5.

llvm-svn: 167699

1812ee9a

Nov 10, 2012
- [NVPTX] Use ABI alignment for parameters when alignment is not specified. · 2dc9d072
  Justin Holewinski authored Nov 09, 2012
  
  Affects SM 2.0+. Fixes bug 13324. llvm-svn: 167646
  2dc9d072
Nov 01, 2012

Revert the majority of the next patch in the address space series: · 5da3f051

Chandler Carruth authored Nov 01, 2012

r165941: Resubmit the changes to llvm core to update the functions to
         support different pointer sizes on a per address space basis.

Despite this commit log, this change primarily changed stuff outside of
VMCore, and those changes do not carry any tests for correctness (or
even plausibility), and we have consistently found questionable or flat
out incorrect cases in these changes. Most of them are probably correct,
but we need to devise a system that makes it more clear when we have
handled the address space concerns correctly, and ideally each pass that
gets updated would receive an accompanying test case that exercises that
pass specificaly w.r.t. alternate address spaces.

However, from this commit, I have retained the new C API entry points.
Those were an orthogonal change that probably should have been split
apart, but they seem entirely good.

In several places the changes were very obvious cleanups with no actual
multiple address space code added; these I have not reverted when
I spotted them.

In a few other places there were merge conflicts due to a cleaner
solution being implemented later, often not using address spaces at all.
In those cases, I've preserved the new code which isn't address space
dependent.

This is part of my ongoing effort to clean out the partial address space
code which carries high risk and low test coverage, and not likely to be
finished before the 3.2 release looms closer. Duncan and I would both
like to see the above issues addressed before we return to these
changes.

llvm-svn: 167222

5da3f051

Revert the series of commits starting with r166578 which introduced the · 7ec5085e

Chandler Carruth authored Nov 01, 2012

getIntPtrType support for multiple address spaces via a pointer type,
and also introduced a crasher bug in the constant folder reported in
PR14233.

These commits also contained several problems that should really be
addressed before they are re-committed. I have avoided reverting various
cleanups to the DataLayout APIs that are reasonable to have moving
forward in order to reduce the amount of churn, and minimize the number
of commits that were reverted. I've also manually updated merge
conflicts and manually arranged for the getIntPtrType function to stay
in DataLayout and to be defined in a plausible way after this revert.

Thanks to Duncan for working through this exact strategy with me, and
Nick Lewycky for tracking down the really annoying crasher this
triggered. (Test case to follow in its own commit.)

After discussing with Duncan extensively, and based on a note from
Micah, I'm going to continue to back out some more of the more
problematic patches in this series in order to ensure we go into the
LLVM 3.2 branch with a reasonable story here. I'll send a note to
llvmdev explaining what's going on and why.

Summary of reverted revisions:

r166634: Fix a compiler warning with an unused variable.
r166607: Add some cleanup to the DataLayout changes requested by
         Chandler.
r166596: Revert "Back out r166591, not sure why this made it through
         since I cancelled the command. Bleh, sorry about this!
r166591: Delete a directory that wasn't supposed to be checked in yet.
r166578: Add in support for getIntPtrType to get the pointer type based
         on the address space.
llvm-svn: 167221

7ec5085e

Oct 24, 2012
- Add some cleanup to the DataLayout changes requested by Chandler. · bf3eeb2d
  Micah Villmow authored Oct 24, 2012
  
  llvm-svn: 166607
  bf3eeb2d
- Implement a basic VectorTargetTransformInfo interface to be used by the loop... · 2289f2c9
  Nadav Rotem authored Oct 24, 2012
  
  Implement a basic VectorTargetTransformInfo interface to be used by the loop and bb vectorizers for modeling the cost of instructions. llvm-svn: 166593
  2289f2c9
- Add in support for getIntPtrType to get the pointer type based on the address space. · 12d91278
  Micah Villmow authored Oct 24, 2012
  
  This checkin also adds in some tests that utilize these paths and updates some of the clients. llvm-svn: 166578
  12d91278
Oct 19, 2012
- Reapply the TargerTransformInfo changes, minus the changes to LSR and Lowerinvoke. · 5dc203e8
  Nadav Rotem authored Oct 18, 2012
  
  llvm-svn: 166248
  5dc203e8
Oct 18, 2012

Temporarily revert the TargetTransform changes. · d6d9ccca

Bob Wilson authored Oct 18, 2012

The TargetTransform changes are breaking LTO bootstraps of clang.  I am
working with Nadav to figure out the problem, but I am reverting it for now
to get our buildbots working.

This reverts svn commits: 165665 165669 165670 165786 165787 165997
and I have also reverted clang svn 165741

llvm-svn: 166168

d6d9ccca

Oct 15, 2012

Resubmit the changes to llvm core to update the functions to support different... · 4bb926d9

Micah Villmow authored Oct 15, 2012

Resubmit the changes to llvm core to update the functions to support different pointer sizes on a per address space basis.

llvm-svn: 165941

4bb926d9

Oct 11, 2012

Revert 165732 for further review. · 0c61134d
Micah Villmow authored Oct 11, 2012
```
llvm-svn: 165747
```
0c61134d

Add in the first iteration of support for llvm/clang/lldb to allow variable... · 08318973

Micah Villmow authored Oct 11, 2012

Add in the first iteration of support for llvm/clang/lldb to allow variable per address space pointer sizes to be optimized correctly.

llvm-svn: 165726

08318973

· e1032873

Nadav Rotem authored Oct 10, 2012

Add a new interface to allow IR-level passes to access codegen-specific information.

llvm-svn: 165665

e1032873

Oct 09, 2012

Create enums for the different attributes. · c9b22d73

Bill Wendling authored Oct 09, 2012

We use the enums to query whether an Attributes object has that attribute. The
opaque layer is responsible for knowing where that specific attribute is stored.

llvm-svn: 165488

c9b22d73

Oct 08, 2012
- Move TargetData to DataLayout. · cdfe20b9
  Micah Villmow authored Oct 08, 2012
  
  llvm-svn: 165402
  cdfe20b9
Oct 04, 2012
- Use new accessor methods to query for attributes. · b0a290ef
  Bill Wendling authored Oct 04, 2012
  
  llvm-svn: 165205
  b0a290ef
Jul 02, 2012

Extend TargetPassConfig to allow running only a subset of the normal passes. · cac3b906

Bob Wilson authored Jul 02, 2012

This is still a work in progress but I believe it is currently good enough
to fix PR13122 "Need unit test driver for codegen IR passes". For example,
you can run llc with -stop-after=loop-reduce to have it dump out the IR after
running LSR. Serializing machine-level IR is not yet supported but we have
some patches in progress for that.

The plan is to serialize the IR to a YAML file, containing separate sections
for the LLVM IR, machine-level IR, and whatever other info is needed. Chad
suggested that we stash the stop-after pass in the YAML file and use that
instead of the start-after option to figure out where to restart the
compilation. I think that's a great idea, but since it's not implemented yet
I put the -start-after option into this patch for testing purposes.

llvm-svn: 159570

cac3b906