Commits · 4dd14fb5eb6cd5d8ac822e7d37b6c3ca0475e762 · Roger Ferrer / llvm-epi-0.8

Dec 12, 2012

Add ARM NONE and PREL31 relocation types. · 4dd14fb5

Logan Chien authored Dec 12, 2012

Add R_ARM_NONE and R_ARM_PREL31 relocation types
to MCExpr.  Both of them will be used while
generating .ARM.extab and .ARM.exidx sections.

llvm-svn: 169965

4dd14fb5

Dec 11, 2012

This patch implements the general dynamic TLS model for 64-bit PowerPC. · c56f1d34

Bill Schmidt authored Dec 11, 2012

Given a thread-local symbol x with global-dynamic access, the generated
code to obtain x's address is:

     Instruction                            Relocation            Symbol
  addis ra,r2,x@got@tlsgd@ha           R_PPC64_GOT_TLSGD16_HA       x
  addi  r3,ra,x@got@tlsgd@l            R_PPC64_GOT_TLSGD16_L        x
  bl __tls_get_addr(x@tlsgd)           R_PPC64_TLSGD                x
                                       R_PPC64_REL24           __tls_get_addr
  nop
  <use address in r3>

The implementation borrows from the medium code model work for introducing
special forms of ADDIS and ADDI into the DAG representation.  This is made
slightly more complicated by having to introduce a call to the external
function __tls_get_addr.  Using the full call machinery is overkill and,
more importantly, makes it difficult to add a special relocation.  So I've
introduced another opcode GET_TLS_ADDR to represent the function call, and
surrounded it with register copies to set up the parameter and return value.

Most of the code is pretty straightforward.  I ran into one peculiarity
when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like
BL8_NOP_ELF except that it takes another parameter to represent the symbol
("x" above) that requires a relocation on the call.  Something in the 
TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated
identically during the emit phase, so this second operand was never
visited to generate relocations.  This is the reason for the slightly
messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding().

Two new tests are included to demonstrate correct external assembly and
correct generation of relocations using the integrated assembler.

Comments welcome!

Thanks,
Bill

llvm-svn: 169910

c56f1d34

Dec 04, 2012

This patch introduces initial-exec model support for thread-local storage · ca4a0c9d

Bill Schmidt authored Dec 04, 2012

on 64-bit PowerPC ELF.

The patch includes code to handle external assembly and MC output with the
integrated assembler.  It intentionally does not support the "old" JIT.

For the initial-exec TLS model, the ABI requires the following to calculate
the address of external thread-local variable x:

 Code sequence            Relocation                  Symbol
  ld 9,x@got@tprel(2)      R_PPC64_GOT_TPREL16_DS      x
  add 9,9,x@tls            R_PPC64_TLS                 x

The register 9 is arbitrary here.  The linker will replace x@got@tprel
with the offset relative to the thread pointer to the generated GOT
entry for symbol x.  It will replace x@tls with the thread-pointer
register (13).

The two test cases verify correct assembly output and relocation output
as just described.

PowerPC-specific selection node variants are added for the two
instructions above:  LD_GOT_TPREL and ADD_TLS.  These are inserted
when an initial-exec global variable is encountered by
PPCTargetLowering::LowerGlobalTLSAddress(), and later lowered to
machine instructions LDgotTPREL and ADD8TLS.  LDgotTPREL is a pseudo
that uses the same LDrs support added for medium code model's LDtocL,
with a different relocation type.

The rest of the processing is straightforward.

llvm-svn: 169281

ca4a0c9d

Nov 27, 2012

This patch implements medium code model support for 64-bit PowerPC. · 34627e34

Bill Schmidt authored Nov 27, 2012

The default for 64-bit PowerPC is small code model, in which TOC entries
must be addressable using a 16-bit offset from the TOC pointer.  Additionally,
only TOC entries are addressed via the TOC pointer.

With medium code model, TOC entries and data sections can all be addressed
via the TOC pointer using a 32-bit offset.  Cooperation with the linker
allows 16-bit offsets to be used when these are sufficient, reducing the
number of extra instructions that need to be executed.  Medium code model
also does not generate explicit TOC entries in ".section toc" for variables
that are wholly internal to the compilation unit.

Consider a load of an external 4-byte integer.  With small code model, the
compiler generates:

	ld 3, .LC1@toc(2)
	lwz 4, 0(3)

	.section	.toc,"aw",@progbits
.LC1:
	.tc ei[TC],ei

With medium model, it instead generates:

	addis 3, 2, .LC1@toc@ha
	ld 3, .LC1@toc@l(3)
	lwz 4, 0(3)

	.section	.toc,"aw",@progbits
.LC1:
	.tc ei[TC],ei

Here .LC1@toc@ha is a relocation requesting the upper 16 bits of the
32-bit offset of ei's TOC entry from the TOC base pointer.  Similarly,
.LC1@toc@l is a relocation requesting the lower 16 bits.  Note that if
the linker determines that ei's TOC entry is within a 16-bit offset of
the TOC base pointer, it will replace the "addis" with a "nop", and
replace the "ld" with the identical "ld" instruction from the small
code model example.

Consider next a load of a function-scope static integer.  For small code
model, the compiler generates:

	ld 3, .LC1@toc(2)
	lwz 4, 0(3)

	.section	.toc,"aw",@progbits
.LC1:
	.tc test_fn_static.si[TC],test_fn_static.si
	.type	test_fn_static.si,@object
	.local	test_fn_static.si
	.comm	test_fn_static.si,4,4

For medium code model, the compiler generates:

	addis 3, 2, test_fn_static.si@toc@ha
	addi 3, 3, test_fn_static.si@toc@l
	lwz 4, 0(3)

	.type	test_fn_static.si,@object
	.local	test_fn_static.si
	.comm	test_fn_static.si,4,4

Again, the linker may replace the "addis" with a "nop", calculating only
a 16-bit offset when this is sufficient.

Note that it would be more efficient for the compiler to generate:

	addis 3, 2, test_fn_static.si@toc@ha
        lwz 4, test_fn_static.si@toc@l(3)

The current patch does not perform this optimization yet.  This will be
addressed as a peephole optimization in a later patch.

For the moment, the default code model for 64-bit PowerPC will remain the
small code model.  We plan to eventually change the default to medium code
model, which matches current upstream GCC behavior.  Note that the different
code models are ABI-compatible, so code compiled with different models will
be linked and execute correctly.

I've tested the regression suite and the application/benchmark test suite in
two ways:  Once with the patch as submitted here, and once with additional
logic to force medium code model as the default.  The tests all compile
cleanly, with one exception.  The mandel-2 application test fails due to an
unrelated ABI compatibility with passing complex numbers.  It just so happens
that small code model was incredibly lucky, in that temporary values in 
floating-point registers held the expected values needed by the external
library routine that was called incorrectly.  My current thought is to correct
the ABI problems with _Complex before making medium code model the default,
to avoid introducing this "regression."

Here are a few comments on how the patch works, since the selection code
can be difficult to follow:

The existing logic for small code model defines three pseudo-instructions:
LDtoc for most uses, LDtocJTI for jump table addresses, and LDtocCPT for
constant pool addresses.  These are expanded by SelectCodeCommon().  The
pseudo-instruction approach doesn't work for medium code model, because
we need to generate two instructions when we match the same pattern.
Instead, new logic in PPCDAGToDAGISel::Select() intercepts the TOC_ENTRY
node for medium code model, and generates an ADDIStocHA followed by either
a LDtocL or an ADDItocL.  These new node types correspond naturally to
the sequences described above.

The addis/ld sequence is generated for the following cases:
 * Jump table addresses
 * Function addresses
 * External global variables
 * Tentative definitions of global variables (common linkage)

The addis/addi sequence is generated for the following cases:
 * Constant pool entries
 * File-scope static global variables
 * Function-scope static variables

Expanding to the two-instruction sequences at select time exposes the
instructions to subsequent optimization, particularly scheduling.

The rest of the processing occurs at assembly time, in
PPCAsmPrinter::EmitInstruction.  Each of the instructions is converted to
a "real" PowerPC instruction.  When a TOC entry needs to be created, this
is done here in the same manner as for the existing LDtoc, LDtocJTI, and
LDtocCPT pseudo-instructions (I factored out a new routine to handle this).

I had originally thought that if a TOC entry was needed for LDtocL or
ADDItocL, it would already have been generated for the previous ADDIStocHA.
However, at higher optimization levels, the ADDIStocHA may appear in a 
different block, which may be assembled textually following the block
containing the LDtocL or ADDItocL.  So it is necessary to include the
possibility of creating a new TOC entry for those two instructions.

Note that for LDtocL, we generate a new form of LD called LDrs.  This
allows specifying the @toc@l relocation for the offset field of the LD
instruction (i.e., the offset is replaced by a SymbolLo relocation).
When the peephole optimization described above is added, we will need
to do similar things for all immediate-form load and store operations.

The seven "mcm-n.ll" test cases are kept separate because otherwise the
intermingling of various TOC entries and so forth makes the tests fragile
and hard to understand.

The above assumes use of an external assembler.  For use of the
integrated assembler, new relocations are added and used by
PPCELFObjectWriter.  Testing is done with "mcm-obj.ll", which tests for
proper generation of the various relocations for the same sequences
tested with the external assembler.

llvm-svn: 168708

34627e34

Nov 21, 2012
- Add relocations used for mips big GOT. · 64b52d84
  Akira Hatanaka authored Nov 21, 2012
```
llvm-svn: 168448
```
  64b52d84
Nov 09, 2012
- Add ARM TARGET2 relocation. The testcase will follow with actualy use-case. · a305ea55
  Anton Korobeynikov authored Nov 09, 2012
```
Based on the patch by Logan Chien!

llvm-svn: 167633
```
  a305ea55
Sep 26, 2012
- Rename virtual table anchors from Anchor() to anchor() for consistency with the rest of the tree. · 2a6a08b1
  Craig Topper authored Sep 26, 2012
```
llvm-svn: 164666
```
  2a6a08b1
Sep 12, 2012

Release build: guard dump functions with · 49d684e1

Manman Ren authored Sep 12, 2012

"#if !defined(NDEBUG) || defined(LLVM_ENABLE_DUMP)"

No functional change. Update r163344.

llvm-svn: 163679

49d684e1

Sep 06, 2012
- Release build: guard dump functions with "ifndef NDEBUG" · c3366cce
  Manman Ren authored Sep 06, 2012
```
No functional change.

llvm-svn: 163344
```
  c3366cce
Aug 24, 2012
- Lower constant pools and jump tables via TOC on PPC64/SVR4. · ace4707e
  Roman Divacky authored Aug 24, 2012
```
In collaboration with Adhemerval Zanella.

llvm-svn: 162562
```
  ace4707e
Jul 21, 2012
- Add VK_Mips_HIGHER and VK_Mips_HIGHEST to MCSymbolRefExpr::VariantKind. · f73e3627
  Akira Hatanaka authored Jul 21, 2012
```
Test case will be added later when long branch patch is checked in.

llvm-svn: 160597
```
  f73e3627
Jun 04, 2012
- Implement local-exec TLS on PowerPC. · e3f15c98
  Roman Divacky authored Jun 04, 2012
```
llvm-svn: 157935
```
  e3f15c98
Mar 26, 2012
- Prune some includes and forward declarations. · 6e80c280
  Craig Topper authored Mar 26, 2012
```
llvm-svn: 153429
```
  6e80c280
Feb 24, 2012

ARM Thumb symbol references in assembly need the low bit set. · 213039a5

Jim Grosbach authored Feb 24, 2012

Add support for a missed case when the symbols in a difference
expression are in the same section but not the same fragment.

rdar://10924681

llvm-svn: 151345

213039a5

Feb 11, 2012
- Add support for implicit TLS model used with MS VC runtime. · c6b4017c
  Anton Korobeynikov authored Feb 11, 2012
```
Patch by Kai Nacke!

llvm-svn: 150307
```
  c6b4017c
Feb 07, 2012
- Convert assert(0) to llvm_unreachable · a2886c21
  Craig Topper authored Feb 07, 2012
```
llvm-svn: 149967
```
  a2886c21
Jan 26, 2012

Add support for the R_ARM_TARGET1 relocation, which should be given to... · 6685c08e

James Molloy authored Jan 26, 2012

Add support for the R_ARM_TARGET1 relocation, which should be given to relocations applied to all C++ constructors and destructors.

This enables the linker to match concrete relocation types (absolute or relative) with whatever library or C++ support code is being linked against.

llvm-svn: 149057

6685c08e

Jan 10, 2012
- Add 'llvm_unreachable' to passify GCC's understanding of the constraints · f3e8502c
  Chandler Carruth authored Jan 10, 2012
```
of several newly un-defaulted switches. This also helps optimizers
(including LLVM's) recognize that every case is covered, and we should
assume as much.

llvm-svn: 147861
```
  f3e8502c
- Remove unnecessary default cases in switches that cover all enum values. · edbb58c5
  David Blaikie authored Jan 10, 2012
```
llvm-svn: 147855
```
  edbb58c5
Dec 22, 2011
- Local dynamic TLS model for direct object output. Create the correct TLS MIPS · e2eed964
  Akira Hatanaka authored Dec 22, 2011
```
ELF relocations.

Patch by Jack Carter.

llvm-svn: 147118
```
  e2eed964
Nov 15, 2011
- Remove function printMipsSymbolRef. · d519d8ca
  Akira Hatanaka authored Nov 15, 2011
```
llvm-svn: 144663
```
  d519d8ca
Oct 25, 2011
- This is the first of several patches for Mips direct object generation. · 82b077ec
  Bruno Cardoso Lopes authored Oct 25, 2011
```
This first patch is for expression variable kinds.

Patch by Jack Carter!

llvm-svn: 142934
```
  82b077ec
Jul 26, 2011
- Rename TargetAsmBackend to MCAsmBackend; rename createAsmBackend to createMCAsmBackend. · 5928e69d
  Evan Cheng authored Jul 25, 2011
```
llvm-svn: 136010
```
  5928e69d
Jul 23, 2011
- Move TargetAsmParser.h TargetAsmBackend.h and TargetAsmLexer.h to MC where they belong. · f2596bc6
  Evan Cheng authored Jul 23, 2011
```
llvm-svn: 135833
```
  f2596bc6
Jun 09, 2011

Fix emission of PPC64 assembler on non-darwin platforms by splitting · 4b5665a1

Roman Divacky authored Jun 09, 2011

VK_PPC_{HA,LO}16 into darwin and gas variants.

Darwin wants {ha,lo}16(symbol) while gnu as wants symbol@{ha,l}.

llvm-svn: 132802

4b5665a1

Apr 29, 2011
- MCExpr: Add FindAssociatedSection, which attempts to mirror the 'as' semantics · dc3e4cc5
  Daniel Dunbar authored Apr 29, 2011
```
that associate sections with expressions.

llvm-svn: 130517
```
  dc3e4cc5
Apr 15, 2011
- Fix a ton of comment typos found by codespell. Patch by · 0ab5e2cd
  Chris Lattner authored Apr 15, 2011
```
Luis Felipe Strano Moraes!

llvm-svn: 129558
```
  0ab5e2cd
Mar 22, 2011

Add support for Thumb interworking addresses for symbol offsets that get... · 9746286b

Owen Anderson authored Mar 21, 2011

Add support for Thumb interworking addresses for symbol offsets that get constant folded very early.
This fixes SPASS with -integrated-as.  <rdar://problem/9165399>

llvm-svn: 128037

9746286b

Jan 23, 2011
- Add support for lowercase variants. · 8bac423d
  Rafael Espindola authored Jan 23, 2011
```
llvm-svn: 124071
```
  8bac423d
Jan 13, 2011

Model :upper16: and :lower16: as ARM specific MCTargetExpr. This is a step · 965b3c73

Evan Cheng authored Jan 13, 2011

in the right direction. It eliminated some hacks and will unblock codegen
work. But it's far from being done. It doesn't reject illegal expressions,
e.g. (FOO - :lower16:BAR). It also doesn't work in Thumb2 mode at all.

llvm-svn: 123369

965b3c73

Dec 22, 2010
- Add r122359 back now that the bug in MCDwarfLineAddrFragment fragment has been · 4124ab12
  Rafael Espindola authored Dec 22, 2010
```
fixed.

llvm-svn: 122448
```
  4124ab12
- Revert r122359 while I debug PR8845. · 0e14b61c
  Rafael Espindola authored Dec 22, 2010
```
llvm-svn: 122427
```
  0e14b61c
- Use references and simplify. · 50ce2f06
  Rafael Espindola authored Dec 22, 2010
```
llvm-svn: 122405
```
  50ce2f06
Dec 21, 2010
- Simplify EvaluateAsAbsolute now that EvaluateAsRelocatableImpl does all · a468ae02
  Rafael Espindola authored Dec 21, 2010
```
the folding it can.

llvm-svn: 122359
```
  a468ae02
Dec 19, 2010
- Fixed version of 122160 (the previous one would fold undefined symbols). · ee54636f
  Rafael Espindola authored Dec 19, 2010
```
llvm-svn: 122167
```
  ee54636f
- Revert 122160 while I debug it. · 9a2d4e04
  Rafael Espindola authored Dec 19, 2010
```
llvm-svn: 122165
```
  9a2d4e04
- Move all folding to AttemptToFoldSymbolOffsetDifference. · 6fd80a55
  Rafael Espindola authored Dec 19, 2010
```
llvm-svn: 122160
```
  6fd80a55
Dec 18, 2010
- Merge isAbsolute into IsSymbolRefDifferenceFullyResolved. · b403e098
  Rafael Espindola authored Dec 18, 2010
```
llvm-svn: 122148
```
  b403e098
- Remove the MCObjectFormat class. · 8396dd08
  Rafael Espindola authored Dec 18, 2010
```
llvm-svn: 122147
```
  8396dd08
- Add a FIXME and explain a hack. · 293a7c18
  Rafael Espindola authored Dec 18, 2010
```
llvm-svn: 122144
```
  293a7c18