- Apr 18, 2017
-
-
Matt Arsenault authored
llvm-svn: 300596
-
- Apr 17, 2017
-
-
Matt Arsenault authored
Avoid looping through program to determine register counts. This avoids needing to look at regmask operands. Also fixes some counting errors with flat_scr when there are no stack objects. llvm-svn: 300482
-
- Apr 12, 2017
-
-
Matt Arsenault authored
Split into smaller functions and prepare for handling non-entry functions. llvm-svn: 299998
-
- Mar 31, 2017
-
-
Matt Arsenault authored
What we really want to do is distinguish functions that may be called by other functions, and graphics shaders are not called kernels. llvm-svn: 299140
-
- Mar 27, 2017
-
-
Yaxun Liu authored
As we introduced target triple environment amdgiz and amdgizcl, the address space values are no longer enums. We have to decide the value by target triple. The basic idea is to use struct AMDGPUAS to represent address space values. For address space values which are not depend on target triple, use static const members, so that they don't occupy extra memory space and is equivalent to a compile time constant. Since the struct is lightweight and cheap, it can be created on the fly at the point of usage. Or it can be added as member to a pass and created at the beginning of the run* function. Differential Revision: https://reviews.llvm.org/D31284 llvm-svn: 298846
-
- Mar 23, 2017
-
-
Konstantin Zhuravlyov authored
- It was decided to expose this information through other means (rocr) Differential Revision: https://reviews.llvm.org/D30970 llvm-svn: 298560
-
- Mar 22, 2017
-
-
Konstantin Zhuravlyov authored
- These are not required for low level runtime Differential Revision: https://reviews.llvm.org/D29949 llvm-svn: 298556
-
Konstantin Zhuravlyov authored
- Rename runtime metadata -> code object metadata - Make metadata not flow - Switch enums to use ScalarEnumerationTraits - Cleanup and move AMDGPUCodeObjectMetadata.h to AMDGPU/MCTargetDesc - Introduce in-memory representation for attributes - Code object metadata streamer - Create metadata for isa and printf during EmitStartOfAsmFile - Create metadata for kernel during EmitFunctionBodyStart - Finalize and emit metadata to .note during EmitEndOfAsmFile - Other minor improvements/bug fixes Differential Revision: https://reviews.llvm.org/D29948 llvm-svn: 298552
-
- Feb 22, 2017
-
-
Matt Arsenault authored
Change implementation to use max instead of add. min/max/med3 do not flush denormals regardless of the mode, so it is OK to use it whether or not they are enabled. Also allow using clamp with f16, and use knowledge of dx10_clamp. llvm-svn: 295788
-
- Feb 18, 2017
-
-
Matt Arsenault authored
llvm-svn: 295554
-
- Feb 10, 2017
-
-
Wei Ding authored
Differential Revision: http://reviews.llvm.org/D26010 llvm-svn: 294692
-
- Feb 08, 2017
-
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D28760#fb670e28 llvm-svn: 294449
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D29633 llvm-svn: 294441
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D29318 llvm-svn: 294440
-
- Jan 27, 2017
-
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D29224 llvm-svn: 293318
-
- Jan 25, 2017
-
-
Tom Stellard authored
Summary: This lets you select which sort of spilling you want, either s[0:1] or 64-bit loads from s[0:1]. Patch By: Dave Airlie Reviewers: nhaehnle, arsenm, tstellarAMD Reviewed By: arsenm Subscribers: mareko, llvm-commits, kzhuravl, wdng, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25428 llvm-svn: 293000
-
- Jan 06, 2017
-
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D27732 llvm-svn: 291245
-
- Dec 19, 2016
-
-
Sam Kolton authored
Summary: Added pair of directives .hsa_code_object_metadata/.end_hsa_code_object_metadata. Between them user can put YAML string that would be directly put to the generated note. E.g.: ''' .hsa_code_object_metadata { amd.MDVersion: [ 2, 0 ] } .end_hsa_code_object_metadata ''' Based on D25046 Reviewers: vpykhtin, nhaustov, yaxunl, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, mgorny, tony-tye Differential Revision: https://reviews.llvm.org/D27619 llvm-svn: 290097
-
- Dec 14, 2016
-
-
Yaxun Liu authored
Differential Revision: https://reviews.llvm.org/D25046 llvm-svn: 289674
-
- Dec 09, 2016
-
-
Marek Olsak authored
Summary: This frees 2 scalar registers. Reviewers: tstellarAMD Subscribers: qcolombet, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27150 llvm-svn: 289261
-
Marek Olsak authored
Summary: There is no point in setting SGPRS=104, because VI allocates SGPRs in multiples of 16, so 104 -> 112. That enables us to use all 102 SGPRs for general purposes. Reviewers: tstellarAMD Subscribers: qcolombet, arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D27149 llvm-svn: 289260
-
- Dec 06, 2016
-
-
Tom Stellard authored
Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D27416 llvm-svn: 288852
-
- Nov 10, 2016
-
-
Yaxun Liu authored
Currently runtime metadata is emitted as an ELF section with name .AMDGPU.runtime_metadata. However there is a standard way to convey vendor specific information about how to run an ELF binary, which is called vendor-specific note element (http://www.netbsd.org/docs/kernel/elf-notes.html). This patch lets AMDGPU backend emits runtime metadata as a note element in .note section. Differential Revision: https://reviews.llvm.org/D25781 llvm-svn: 286502
-
- Oct 28, 2016
-
-
Matt Arsenault authored
This is possible when using inline asm. llvm-svn: 285447
-
- Oct 20, 2016
-
-
Wei Ding authored
respectively. Differential Revision: http://reviews.llvm.org/D25789 llvm-svn: 284655
-
- Oct 10, 2016
-
-
Mehdi Amini authored
This avoids "static initialization order fiasco" Differential Revision: https://reviews.llvm.org/D25412 llvm-svn: 283702
-
- Oct 06, 2016
-
-
Matt Arsenault authored
Fix bad merge llvm-svn: 283470
-
Matt Arsenault authored
llvm-svn: 283469
-
Matt Arsenault authored
Make the necessary refactorings to make use of PseudoInstExpansion llvm-svn: 283467
-
Matt Arsenault authored
AMDGPU needs to expand unconditional branches in a new block with an indirect branch. llvm-svn: 283464
-
Matt Arsenault authored
These ones need to have the size on the pseudo instruction set for getInstSizeInBytes to work correctly. These also have a statically known size. llvm-svn: 283437
-
- Oct 01, 2016
-
-
Mehdi Amini authored
llvm-svn: 283004
-
- Sep 26, 2016
-
-
Tom Stellard authored
Summary: We need to call AsmPrinter::getNameWithPrefix() in order to handle anonymous GlobalValues (e.g. @0, @1). Reviewers: arsenm, b-sumner Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D24865 llvm-svn: 282420
-
- Sep 23, 2016
-
-
Tom Stellard authored
Reviewers: arsenm Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D24835 llvm-svn: 282223
-
- Sep 16, 2016
-
-
Tom Stellard authored
Summary: mesa3d will use the same kernel calling convention as amdhsa, but it will handle everything else like the default 'unknown' OS type. Reviewers: arsenm Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D22783 llvm-svn: 281779
-
- Sep 07, 2016
-
-
Yaxun Liu authored
llvm-svn: 280841
-
Yaxun Liu authored
OpenCL kernels have hidden kernel arguments for global offset and printf buffer. For consistency, these hidden argument should be included in the runtime metadata. Also updated kernel argument kind metadata. Differential Revision: https://reviews.llvm.org/D23424 llvm-svn: 280829
-
- Sep 06, 2016
-
-
Konstantin Zhuravlyov authored
- Implemented amdgpu-flat-work-group-size attribute - Implemented amdgpu-num-active-waves-per-eu attribute - Implemented amdgpu-num-sgpr attribute - Implemented amdgpu-num-vgpr attribute - Dynamic LDS constraints are in a separate patch Patch by Tom Stellard and Konstantin Zhuravlyov Differential Revision: https://reviews.llvm.org/D21562 llvm-svn: 280747
-
- Sep 01, 2016
-
-
Yaxun Liu authored
Add runtime metdata for pointee alignment of pointer type kernel argument. The key is KeyArgPointeeAlign and the value is a 32 bit unsigned integer. Differential Revision: https://reviews.llvm.org/D24145 llvm-svn: 280399
-
- Aug 17, 2016
-
-
Justin Bogner authored
Follow up to r278902. I had missed "fall through", with a space. llvm-svn: 278970
-