Commits · 69f5402b263d5bfcccda8cdb6db669eb34f82f35 · Lorenzo Albano / LLVM bpEVL

Jun 05, 2016
- Use adjustRelaxExpr for tls relaxations too. · 69f5402b
  Rafael Espindola authored Jun 04, 2016
```
This remove some EM_386 specific code from InputSection.cpp and opens
the way for more relaxations.

llvm-svn: 271814
```
  69f5402b
- Rename TlsGdToLeSkip. · f807d471
  Rafael Espindola authored Jun 04, 2016
```
It will also be used for GT_TO_IE relaxations.

llvm-svn: 271813
```
  f807d471
- Rename adjustRelaxGotExpr. · 5c66b826
  Rafael Espindola authored Jun 04, 2016
```
It will be used for more than just gots.

llvm-svn: 271812
```
  5c66b826
Jun 04, 2016
- Add missing REQUIRES. · d167a25e
  Rafael Espindola authored Jun 04, 2016
```
llvm-svn: 271799
```
  d167a25e
- Fix implicit plt creation on aarch64. · 12dc4469
  Rafael Espindola authored Jun 04, 2016
```
We were not handling page relative relocations.

llvm-svn: 271798
```
  12dc4469
Jun 03, 2016
- Apply clang-tidy's misc-move-constructor-init to lld. · bd521201
  Benjamin Kramer authored Jun 03, 2016
```
No functionality change intended.

llvm-svn: 271686
```
  bd521201
- [LTO] Add --lto-aa-pipeline. · df24d5b8
  Davide Italiano authored Jun 02, 2016
```
Differential Revision:  http://reviews.llvm.org/D20888

llvm-svn: 271605
```
  df24d5b8
Jun 02, 2016

Start adding tlsdesc support for aarch64. · e37d13b9

Rafael Espindola authored Jun 02, 2016

This is mostly extracted from http://reviews.llvm.org/D18960.

The general idea for tlsdesc is that the two GD got entries are used
for a function pointer and its argument. The dynamic linker sets
both. In the non-dlopen case the dynamic linker sets the function to
the identity and the argument to the offset in the tls block.

All that the static linker has to do in the non-dlopen case is
relocate the code to point to the got entries and create a dynamic
relocation.

The dlopen case is more complicated, but can be implemented in another patch.

llvm-svn: 271569

e37d13b9

Simplify mask computation. · 1c0eb972
Rafael Espindola authored Jun 02, 2016
```
llvm-svn: 271525
```
1c0eb972
Simplify. NFC. · 1016f192
Rafael Espindola authored Jun 02, 2016
```
updateAArch64Add takes care of masking.

llvm-svn: 271524
```
1016f192
Stort lines. NFC. · 53d0a9fe
Rafael Espindola authored Jun 02, 2016
```
llvm-svn: 271523
```
53d0a9fe
Delete dead code. · 0f1401a8
Rafael Espindola authored Jun 02, 2016
```
AArch64 uses TLSDESC, so these are dead.

llvm-svn: 271517
```
0f1401a8
[ELF] Split too long X86_64TargetInfo::relaxGot method. NFC. · b720430b
George Rimar authored Jun 02, 2016
```
Patch adds relaxGotNoPic() method to handle no-PIC path.

llvm-svn: 271506
```
b720430b

Jun 01, 2016

[ELF] - Implemented support for test/binop relaxations from latest ABI. · f10c8290

George Rimar authored Jun 01, 2016

Patch implements next relaxation from latest ABI:

"Convert memory operand of test and binop into immediate operand, where binop is one of adc, add, and, cmp, or,
sbb, sub, xor instructions, when position-independent code is disabled."

It is described in System V Application Binary Interface AMD64 Architecture Processor
Supplement Draft Version 0.99.8 (https://github.com/hjl-tools/x86-psABI/wiki/x86-64-psABI-r249.pdf,
B.2 "B.2 Optimize GOTPCRELX Relocations").

Differential revision: http://reviews.llvm.org/D20793

llvm-svn: 271405

f10c8290

[LTO] Fix (incorrect) TLS attribute mismatch. · 64ebf32e

Davide Italiano authored Jun 01, 2016

When we undefine, we also preserve type of symbol so that we get
it right in the combined LTO object.

Differential Revision:  http://reviews.llvm.org/D20851

llvm-svn: 271403

64ebf32e

Handle the -T option. · 3d6d4c39

Rafael Espindola authored Jun 01, 2016

We were not reading it or including in the --reproduce archive.

llvm-svn: 271367

3d6d4c39

Revert "bar" · a8433c1d

Rafael Espindola authored Jun 01, 2016

This reverts commit r271365.
Sorry, wrong branch.

llvm-svn: 271366

a8433c1d

bar · 74540516
Rafael Espindola authored Jun 01, 2016
```
llvm-svn: 271365
```
74540516

May 29, 2016
- [ELF] Unbreak build with GCC. · e6c8fa45
  Davide Italiano authored May 28, 2016
```
Differential Revision:  http://reviews.llvm.org/D20777

llvm-svn: 271148
```
  e6c8fa45
May 28, 2016

Simplify. NFC. · 8b972d22
Rui Ueyama authored May 28, 2016
```
llvm-svn: 271133
```
8b972d22

Make test more realistic. · 3b1ecb56

Rafael Espindola authored May 28, 2016

It doesn't make mach sense to fetch less than 64 bits from a got
entry.

llvm-svn: 271116

3b1ecb56

[ELF][MIPS] Always resolve MIPS GP-relative relocations to 'local' definitions · 9a9a3169

Simon Atanasyan authored May 28, 2016

In case of MIPS, GP-relative relocations always resolve to a definition
in a regular input file, ignoring the one-definition rule. Such
relocations are used to setup GP relative offsets in a function's
prologue. So we, for example, should not attempt to create a dynamic
relocation even if the target symbol is preemptible.

Fixes bug 27880.

Differential Revision: http://reviews.llvm.org/D20664

llvm-svn: 271100

9a9a3169

May 27, 2016

Avoid doing binary search. · 406b469d

Rui Ueyama authored May 27, 2016

MergedInputSection::getOffset is the busiest function in LLD if string
merging is enabled and input files have lots of mergeable sections.
It is usually the case when creating executable with debug info,
so it is pretty common.

The reason why it is slow is because it has to do faily complex
computations. For non-mergeable sections, section contents are
contiguous in output, so in order to compute an output offset,
we only have to add the output section's base address to an input
offset. But for mergeable strings, section contents are split for
merging, so they are not contigous. We've got to do some lookups.

We used to do binary search on the list of section pieces.
It is slow because I think it's hostile to branch prediction.

This patch replaces it with hash table lookup. Seems it's working
pretty well. Below is "perf stat -r10" output when linking clang
with debug info. In this case this patch speeds up about 4%.

Before:

       6584.153205 task-clock (msec)         #    1.001 CPUs utilized            ( +-  0.09% )
               238 context-switches          #    0.036 K/sec                    ( +-  6.59% )
                 0 cpu-migrations            #    0.000 K/sec                    ( +- 50.92% )
         1,067,675 page-faults               #    0.162 M/sec                    ( +-  0.15% )
    18,369,931,470 cycles                    #    2.790 GHz                      ( +-  0.09% )
     9,640,680,143 stalled-cycles-frontend   #   52.48% frontend cycles idle     ( +-  0.18% )
   <not supported> stalled-cycles-backend
    21,206,747,787 instructions              #    1.15  insns per cycle
                                             #    0.45  stalled cycles per insn  ( +-  0.04% )
     3,817,398,032 branches                  #  579.786 M/sec                    ( +-  0.04% )
       132,787,249 branch-misses             #    3.48% of all branches          ( +-  0.02% )

       6.579106511 seconds time elapsed                                          ( +-  0.09% )

After:

       6312.317533 task-clock (msec)         #    1.001 CPUs utilized            ( +-  0.19% )
               221 context-switches          #    0.035 K/sec                    ( +-  4.11% )
                 1 cpu-migrations            #    0.000 K/sec                    ( +- 45.21% )
         1,280,775 page-faults               #    0.203 M/sec                    ( +-  0.37% )
    17,611,539,150 cycles                    #    2.790 GHz                      ( +-  0.19% )
    10,285,148,569 stalled-cycles-frontend   #   58.40% frontend cycles idle     ( +-  0.30% )
   <not supported> stalled-cycles-backend
    18,794,779,900 instructions              #    1.07  insns per cycle
                                             #    0.55  stalled cycles per insn  ( +-  0.03% )
     3,287,450,865 branches                  #  520.799 M/sec                    ( +-  0.03% )
        72,259,605 branch-misses             #    2.20% of all branches          ( +-  0.01% )

       6.307411828 seconds time elapsed                                          ( +-  0.19% )

Differential Revision: http://reviews.llvm.org/D20645

llvm-svn: 270999

406b469d

Avoid having to check in a binary. · 6af54618
Rafael Espindola authored May 27, 2016
```
llvm-svn: 270986
```
6af54618
Update LLD for D20550. · 5079f3b7
Peter Collingbourne authored May 27, 2016
```
Differential Revision: http://reviews.llvm.org/D20704

llvm-svn: 270968
```
5079f3b7
Make -L description a bit more precise. · 8ef190c7
Sean Silva authored May 27, 2016
```
llvm-svn: 270966
```
8ef190c7
Explain a bit better what --start-lib and --end-lib do. · 3b536d09
Sean Silva authored May 27, 2016
```
llvm-svn: 270965
```
3b536d09
Add a help description for --threads to avoid confusion. · 688fade4
Sean Silva authored May 27, 2016
```
llvm-svn: 270964
```
688fade4

--threads is a flag, not a number · 2c1a9da8

Sean Silva authored May 27, 2016

We would previously accept `--threads=4`, but this option just turns on
threading and does not specify a number of threads.

I ran into this by accident because I was passing `--threads=<n>` but
the number didn't seem to affect anything.

llvm-svn: 270963

2c1a9da8

May 26, 2016

[ELF][MIPS] Handle section symbol points to the .MIPS.options / .reginfo section · 84bb355c

Simon Atanasyan authored May 26, 2016

MIPS .reginfo and .MIPS.options sections are consumed by the linker, and
the linker produces a single output section. But it is possible that
input files contain section symbol points to the corresponding input
section. In case of generation a relocatable output we need to write
such symbols to the output file.

Fixes bug 27878.

Differential Revision: http://reviews.llvm.org/D20688

llvm-svn: 270910

84bb355c

Update for llvm change. · a5cefffc
Rafael Espindola authored May 26, 2016
```
llvm-svn: 270907
```
a5cefffc
Removed redundant argument. NFC. · a8f9cf18
George Rimar authored May 26, 2016
```
llvm-svn: 270847
```
a8f9cf18

May 25, 2016

[ELF] - Added support for jmp/call relaxations when... · 95433df1

George Rimar authored May 25, 2016

[ELF] - Added support for jmp/call relaxations when R_X86_64_GOTPCRELX/R_X86_64_REX_GOTPCRELX are used.

D15779 introduced basic approach to support new relaxations.
This patch implements relaxations for jmp and call instructions,
described in System V Application Binary Interface AMD64 Architecture Processor 
Supplement Draft Version 0.99.8 (https://github.com/hjl-tools/x86-psABI/wiki/x86-64-psABI-r249.pdf, 
B.2 "B.2 Optimize GOTPCRELX Relocations")

Differential revision: http://reviews.llvm.org/D20622

llvm-svn: 270721

95433df1

Make SectionPiece 8 bytes smaller on LP64. · d8849274

Rui Ueyama authored May 25, 2016

This patch makes SectionPiece class 8 bytes smaller on platforms
on which pointer size is 8 bytes. Sean suggested in a post commit
review for r270340 that this could make a differentce, and it
actually is. Time to link clang (with debug info) improved from
6.725 seconds to 6.589 seconds or by about 2%.

Differential Revision: http://reviews.llvm.org/D20613

llvm-svn: 270717

d8849274

Do not ignore --no_ctors_in_init_array flag. · 1795f782
Rui Ueyama authored May 25, 2016
```
That flag is probably too dangerous to ignore silently.

llvm-svn: 270711
```
1795f782

ELF: improve CIE no-augmentation test · 2e04361a

Ed Maste authored May 25, 2016

Add another possible error that may be reported for the same case. The
original reproduction case that prompted r270706 produced the error
"corrupted CIE" instead of "corrupted or unsupported CIE information".
The specific error depends on arbitrary data later in the file so
check that neither is emitted in case the input is ever changed.

Document the process used to create the input .o and rename the test
file to .s, as requested by Rafael.

llvm-svn: 270709

2e04361a

ELF: Handle empty CIE augmentation string · 594e06b8

Ed Maste authored May 25, 2016

"A zero length string indicates that no augmentation data is present."

The FreeBSD/mips toolchain (GCC 4.2.1) generates .debug_frame sections
containing CIE records that have an empty augmentation string.

Differential Revision: http://reviews.llvm.org/D19928

llvm-svn: 270706

594e06b8

[ELF] - Implemented optimization for R_X86_64_GOTPCREL relocation. · 5c33b91b

George Rimar authored May 25, 2016

System V Application Binary Interface AMD64 Architecture Processor Supplement Draft Version 0.99.8 
(https://github.com/hjl-tools/x86-psABI/wiki/x86-64-psABI-r249.pdf, B.2 "B.2 Optimize GOTPCRELX Relocations")
introduces possible relaxations for R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX.

That patch implements the next relaxation: 
mov foo@GOTPCREL(%rip), %reg => lea foo(%rip), %reg
and also opens door for implementing all other ones.

Implementation was suggested by Rafael Ávila de Espíndola with few additions and testcases by myself.

Differential revision: http://reviews.llvm.org/D15779

llvm-svn: 270705

5c33b91b

Really define --export-dynamic-symbol= as an alias to --export-dynamic-symbol. · c789b631
Rui Ueyama authored May 25, 2016
```
Thanks to Sean for pointing it out.

llvm-svn: 270660
```
c789b631
Fix comment. · 02fcf11a
Rui Ueyama authored May 25, 2016
```
llvm-svn: 270659
```
02fcf11a