Skip to content
  1. Jun 08, 2018
    • Simon Pilgrim's avatar
      [X86][SSE] Add SSE2/AVX2 vector rotate tests · eab9d204
      Simon Pilgrim authored
      Now that we're custom lowering vector rotates for SSE in general we should be testing the combines with them as well.
      
      llvm-svn: 334290
      eab9d204
    • Simon Pilgrim's avatar
      [X86][SSE] Simplify combineVectorTruncationWithPACKUS to reduce code duplication · a6afa310
      Simon Pilgrim authored
      Simplify combineVectorTruncationWithPACKUS to mask the upper bits followed by calling truncateVectorWithPACK instead of duplicating with similar code.
      
      This results in the codegen using (V)PACKUSDW on SSE41+ targets for vXi64/vXi32 inputs where before it always used PACKUSWB (along with a lot more bitcasting).
      
      I've raised PR37749 as until we avoid unnecessary concats back to 256-bit for bitwise ops, we can't avoid splitting the input value into 128-bit subvectors for masking.
      
      llvm-svn: 334289
      a6afa310
    • Sanjay Patel's avatar
      [x86] restore test comment; NFC · ab4ca060
      Sanjay Patel authored
      The description got deleted along with the FIXME note in
      rL334268.
      
      llvm-svn: 334288
      ab4ca060
    • Sam McCall's avatar
      [clangd] Require case-insensitive prefix match for macro completions. · 80ad7072
      Sam McCall authored
      Summary: Macros are terribly spammy at the moment and this offers some relief.
      
      Reviewers: ioeric
      
      Subscribers: ilya-biryukov, MaskRay, jkorous, cfe-commits
      
      Differential Revision: https://reviews.llvm.org/D47936
      
      llvm-svn: 334287
      80ad7072
    • Artur Pilipenko's avatar
      [BPI] Apply invoke heuristic before loop branch heuristic · 4d063e7b
      Artur Pilipenko authored
      Currently the loop branch heuristic is applied before the invoke heuristic which makes us overestimate the probability of the unwind destination of invokes inside loops. This in turn makes us grossly underestimate the frequencies of loops with invokes.
      
      Reviewed By: skatkov, vsk
      
      Differential Revision: https://reviews.llvm.org/D47371
      
      llvm-svn: 334285
      4d063e7b
    • Florian Hahn's avatar
      [VPlan] Move recipe based VPlan generation to separate function. · b3c6f07d
      Florian Hahn authored
      This first step separates VPInstruction-based and VPRecipe-based
      VPlan creation, which should make it easier to migrate to VPInstruction
      based code-gen step by step.
      
      Reviewers: Ayal, rengolin, dcaballe, hsaito, mkuper, mzolotukhin
      
      Reviewed By: dcaballe
      
      Subscribers: bollu, tschuett, rkruppe, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D47477
      
      llvm-svn: 334284
      b3c6f07d
    • Henry Wong's avatar
      [ADT] Add `StringRef::rsplit(StringRef Separator)`. · 945c481a
      Henry Wong authored
      Summary: Add `StringRef::rsplit(StringRef Separator)` to achieve the function of getting the tail substring according to the separator. A typical usage is to get `data` in `std::basic_string::data`.
      
      Reviewers: mehdi_amini, zturner, beanz, xbolva00, vsk
      
      Reviewed By: zturner, xbolva00, vsk
      
      Subscribers: vsk, xbolva00, llvm-commits, MTC
      
      Differential Revision: https://reviews.llvm.org/D47406
      
      llvm-svn: 334283
      945c481a
    • Tatyana Krasnukha's avatar
      Fix DynamicRegisterInfo copying/moving issue. · 74fd1a65
      Tatyana Krasnukha authored
      Summary:
      Default copy/move constructors and assignment operators leave wrong m_sets[i].registers pointers.
      
      Made the class movable and non-copyable (it's difficult to imagine when it needs to be copied).
      
      Reviewers: clayborg
      
      Reviewed By: clayborg
      
      Differential Revision: https://reviews.llvm.org/D47728
      
      llvm-svn: 334282
      74fd1a65
    • Jonas Hahnfeld's avatar
      [CUDA] Fix emission of constant strings in sections · 3b9cbba9
      Jonas Hahnfeld authored
      CGM.GetAddrOfConstantCString() sets the adress of the created GlobalValue
      to unnamed. When emitting the object file LLVM will mark the surrounding
      section as SHF_MERGE iff the string is nul-terminated and contains no
      other nuls (see IsNullTerminatedString). This results in problems when
      saving temporaries because LLVM doesn't set an EntrySize, so reading in
      the serialized assembly file fails.
      This never happened for the GPU binaries because they usually contain
      a nul-character somewhere. Instead this only affected the module ID
      when compiling relocatable device code.
      
      However, this points to a potentially larger problem: If we put a
      constant string into a named section, we really want the data to end
      up in that section in the object file. To avoid LLVM merging sections
      this patch unmarks the GlobalVariable's address as unnamed which also
      fixes the problem of invalid serialized assembly files when saving
      temporaries.
      
      Differential Revision: https://reviews.llvm.org/D47902
      
      llvm-svn: 334281
      3b9cbba9
    • Simon Dardis's avatar
      [mips] Correct the predicates for a number of codegen only instructions · 1d6254f7
      Simon Dardis authored
      Reviewers: smaksimovic, atanasyan, abeserminji
      
      Differential Revision: https://reviews.llvm.org/D47638
      
      llvm-svn: 334280
      1d6254f7
    • Pavel Labath's avatar
      Fix TestMiExec.py · 98224109
      Pavel Labath authored
      r334215 changed the error message the tool prints for invalid thread
      arguments to -exec-next command. This adjust the test to match that.
      
      llvm-svn: 334279
      98224109
    • Alex Bradbury's avatar
      [RISCV] Implement MC layer support for the fence.tso instruction · ed53ca73
      Alex Bradbury authored
      The instruction makes use of a previously ignored field in the fence
      instruction. It is introduced in the version 2.3 draft of the RISC-V
      specification after much work by the Memory Model Task Group.
      
      As clarified here <https://github.com/riscv/riscv-isa-manual/issues/186>,
      the fence.tso assembler mnemonic does not have operands.
      
      llvm-svn: 334278
      ed53ca73
    • Pavel Labath's avatar
      DebugNamesDWARFIndex: Implement regex version of the GetFunctions method · a3ee1e7f
      Pavel Labath authored
      This also fixes a bug where SymbolFileDWARF was returning the same
      function multiple times - this can happen if both mangled and demangled
      names match the regex. Other lookup lookup functions had code to handle
      this case, but it was forgotten here.
      
      llvm-svn: 334277
      a3ee1e7f
    • Simon Pilgrim's avatar
      [X86][SSE] Consistently prefer lowering to PACKUS over PACKSS · ad45efc4
      Simon Pilgrim authored
      We have some combines/lowerings that attempt to use PACKSS-then-PACKUS and others that use PACKUS-then-PACKSS.
      
      PACKUS is much easier to combine with if we know the upper bits are zero as ComputeKnownBits can easily see through BITCASTs etc. especially now that rL333995 and rL334007 have landed. It also effectively works at byte level which further simplifies shuffle combines.
      
      The only (minor) annoyances are that ComputeKnownBits can sometimes take longer as it doesn't fail as quickly as ComputeNumSignBits (but I'm not seeing any actual regressions in tests) and PACKUSDW only became available after SSE41 so we have more codegen diffs between targets.
      
      llvm-svn: 334276
      ad45efc4
    • Florian Hahn's avatar
      [TableGen] Make DAGInstruction own Pattern to avoid leaking it. · 84e6ef00
      Florian Hahn authored
      Reviewers: dsanders, craig.topper, stoklund, nhaehnle
      
      Reviewed By: craig.topper
      
      Differential Revision: https://reviews.llvm.org/D47525
      
      llvm-svn: 334275
      84e6ef00
    • Sam McCall's avatar
      [clangd] Downrank symbols with reserved names (score *= 0.1) · e018b36c
      Sam McCall authored
      Reviewers: ilya-biryukov
      
      Subscribers: klimek, ioeric, MaskRay, jkorous, cfe-commits
      
      Differential Revision: https://reviews.llvm.org/D47707
      
      llvm-svn: 334274
      e018b36c
    • Pavel Labath's avatar
      DebugNamesDWARFIndex: Implement GetFunctions method · 257ff339
      Pavel Labath authored
      Summary:
      This patch implements the non-regex variant of GetFunctions. To share
      more code with the Apple implementation, I've extracted the common
      filtering code from that class into a utility function on the DWARFIndex
      base class.
      
      The new implementation also searching the accelerator table multiple
      times -- previously it could happen that the apple table would return
      the same die more than once if one specified multiple search flags in
      name_type_mask. This way, I separate table iteration from filtering, and
      so we can be sure each die is inserted at most once.
      
      Reviewers: clayborg, JDevlieghere
      
      Subscribers: aprantl, lldb-commits
      
      Differential Revision: https://reviews.llvm.org/D47881
      
      llvm-svn: 334273
      257ff339
    • David Carlier's avatar
      [Xray] logging forgotten header · 6dedf2b5
      David Carlier authored
      pthread.h missing for pthread_key* functions.
      
      Reviewers: dberris
      
      Reviewed By: dberris
      
      Differential Revison: https://reviews.llvm.org/D47933
      
      llvm-svn: 334272
      6dedf2b5
    • Roman Shirokiy's avatar
      [LV] Fix PR36983. For a given recurrence, fix all phis in exit block · 9ba0aa2d
      Roman Shirokiy authored
      There could be more than one PHIs in exit block using same loop recurrence.
      Don't assume there is only one and fix each user.
      
      Differential Revision: https://reviews.llvm.org/D47788
      
      llvm-svn: 334271
      9ba0aa2d
    • Haojian Wu's avatar
      [clang-tidy] Improve string type matcher for abseil-string-find-starts-with check. · 07996a54
      Haojian Wu authored
      Summary:
      This patch improves the check to match the desugared "string" type (so that it
      can handle custom-implemented string classes), see the newly-added test.
      
      Reviewers: alexfh
      
      Subscribers: klimek, xazax.hun, cfe-commits
      
      Differential Revision: https://reviews.llvm.org/D47704
      
      llvm-svn: 334270
      07996a54
    • Matt Arsenault's avatar
      AMDGPU: Error on LDS global address in functions · 6fc37598
      Matt Arsenault authored
      These won't work as expected now, so error on them to avoid
      wasting time debugging this in the future.
      
      llvm-svn: 334269
      6fc37598
    • Sam Parker's avatar
      [DAGCombine] Fix for PR37667 · 16f963ba
      Sam Parker authored
      While trying to propagate AND masks back to loads, we currently allow
      one non-load node to be included as a leaf in chain. This fix now
      limits that node to produce only a single data value.
      
      Differential Revision: https://reviews.llvm.org/D47878
      
      llvm-svn: 334268
      16f963ba
    • Dean Michael Berris's avatar
      [XRay][compiler-rt] Cleanup some internal XRay utilities · b81372b5
      Dean Michael Berris authored
      This change uses 'const' for the retryingWriteAll(...) API and removes
      unnecessary 'static' local variables in getting the temporary filename.
      
      llvm-svn: 334267
      b81372b5
    • Craig Topper's avatar
    • Craig Topper's avatar
      [X86] Add builtins for pshufd, pshuflw, and pshufhw to enable target feature... · 03de166c
      Craig Topper authored
      [X86] Add builtins for pshufd, pshuflw, and pshufhw to enable target feature and immediate range checking.
      
      llvm-svn: 334265
      03de166c
    • Craig Topper's avatar
      [X86] Fix some typecasts in intrinsic headers that I messed up in r334261. · 573dab15
      Craig Topper authored
      This was caught by the Header tests, but not the CodeGen tests.
      
      llvm-svn: 334264
      573dab15
    • Hiroshi Inoue's avatar
      [NFC] fix formatting · 863fb7a4
      Hiroshi Inoue authored
      llvm-svn: 334263
      863fb7a4
    • Dean Michael Berris's avatar
      [XRay][compiler-rt] Remove reliance on C++ ABI features · af4d583e
      Dean Michael Berris authored
      Summary:
      This fixes http://llvm.org/PR32274.
      
      This change adds a test to ensure that we're able to link XRay modes and
      the runtime to binaries that don't need to depend on the C++ standard
      library or a C++ ABI library. In particular, we ensure that this will work
      with C programs compiled+linked with XRay.
      
      To make the test pass, we need to change a few things in the XRay
      runtime implementations to remove the reliance on C++ ABI features. In
      particular, we change the thread-safe function-local-static
      initialisation to use pthread_* instead of the C++ features that ensure
      non-trivial thread-local/function-local-static initialisation.
      
      Depends on D47696.
      
      Reviewers: dblaikie, jfb, kpw, eizan
      
      Reviewed By: kpw
      
      Subscribers: echristo, eizan, kpw, mgorny, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D46998
      
      llvm-svn: 334262
      af4d583e
    • Craig Topper's avatar
      [X86] Add subvector insert and extract builtins to enable target feature... · 3428beeb
      Craig Topper authored
      [X86] Add subvector insert and extract builtins to enable target feature checking and immediate range checking.
      
      Test changes are due to differences in how we generate undef elements now. We also changed the types used for extractf128_si256/insertf128_si256 to match the signature of the builtin that previously existed which this patch resurrects. This also matches gcc.
      
      llvm-svn: 334261
      3428beeb
    • Aaron Smith's avatar
      PDB support of function-level linking and splitted functions · 010edd37
      Aaron Smith authored
      Summary:
      The patch adds support of splitted functions (when MSVC is used with PGO) and function-level linking feature.
      
      SymbolFilePDB::ParseCompileUnitLineTable function relies on fact that ranges of compiled source files in the binary are continuous and don't intersect each other. The function creates LineSequence for each file and inserts it into LineTable, and implementation of last one relies on continuity of the sequence. But it's not always true when function-level linking enabled, e.g. in added input test file test-pdb-function-level-linking.exe there is xstring's std__basic_string_char_std__char_traits_char__std__allocator_char_____max_size (.00454820) between test-pdb-function-level-linking.cpp's foo (.00454770) and main (.004548F0).
      
      To fix the problem this patch renews the sequence on each address gap.
      
      Reviewers: asmith, zturner
      
      Reviewed By: asmith
      
      Subscribers: aleksandr.urakov, labath, mgorny, lldb-commits
      
      Differential Revision: https://reviews.llvm.org/D47708
      
      llvm-svn: 334260
      010edd37
    • Raphael Isemann's avatar
      Added missing include to LoadedModuleInfoList.h · e535c144
      Raphael Isemann authored
      Subscribers: lldb-commits
      
      Differential Revision: https://reviews.llvm.org/D47923
      
      llvm-svn: 334259
      e535c144
    • Craig Topper's avatar
      [X86] Improve some shuffle decoding code to remove a conditional from a loop... · 2ed43282
      Craig Topper authored
      [X86] Improve some shuffle decoding code to remove a conditional from a loop and reduce the number of temporary variables. NFCI
      
      The NumControlBits variable was definitely sketchy. I think that only worked because the expected value was 1 or 2 and the number of lanes was 2 or 4. Had their been 8 lanes the number of bits should have been 3 not 4 as the previous code would have given.
      
      llvm-svn: 334258
      2ed43282
    • Tony Tye's avatar
      [AMDGPU] Simplify memory legalizer (add missing virtual descructor) · 6db1f5da
      Tony Tye authored
      Differential Revision: https://reviews.llvm.org/D47504
      
      llvm-svn: 334257
      6db1f5da
    • Craig Topper's avatar
    • Alex Shlyapnikov's avatar
      [HWASan] Disable allocator_returns_null.cc test on Android. · a2670df6
      Alex Shlyapnikov authored
      Disable test added in D47798 on Android and investigate why "new"
      behaves differently there.
      
      llvm-svn: 334255
      a2670df6
    • Reid Kleckner's avatar
      Revert r334209 "[LSR] Check yet more intrinsic pointer operands" · a3609f75
      Reid Kleckner authored
      This causes cast failures when compiling harfbuzz in Chromium.
      Reproducer on the way.
      
      llvm-svn: 334254
      a3609f75
    • Shoaib Meenai's avatar
      [CodeGen] Always use MSVC personality for windows-msvc targets · a5fc6033
      Shoaib Meenai authored
      The windows-msvc target is meant to be ABI compatible with MSVC,
      including the exception handling. Ensure that a windows-msvc triple
      always equates to the MSVC personality being used.
      
      This mostly affects the GNUStep and ObjFW Obj-C runtimes. To the best of
      my knowledge, those are normally not used with windows-msvc triples. I
      believe WinObjC is based on GNUStep (or it at least uses libobjc2), but
      that also takes the approach of wrapping Obj-C exceptions in C++
      exceptions, so the MSVC personality function is the right one to use
      there as well.
      
      Differential Revision: https://reviews.llvm.org/D47862
      
      llvm-svn: 334253
      a5fc6033
    • Matt Morehouse's avatar
      [clang-fuzzer] Made loop_proto more "vectorizable". · 1dc1ff85
      Matt Morehouse authored
      Edited loop_proto and its converter to make more "vectorizable" code
      according to kcc's comment in D47666
        - Removed all while loops
        - Can only index into array with induction variable
      
      Patch By: emmettneyman
      
      Differential Revision: https://reviews.llvm.org/D47920
      
      llvm-svn: 334252
      1dc1ff85
    • Shoaib Meenai's avatar
      Reapply "[Parse] Use CapturedStmt for @finally on MSVC" · 5faf6d88
      Shoaib Meenai authored
      This reapplies r334224 and adds explicit triples to some tests to fix
      them on Windows (where otherwise they would have run with the default
      windows-msvc triple, which I'm changing the behavior for).
      
      Original commit message:
      The body of a `@finally` needs to be executed on both exceptional and
      non-exceptional paths. On landingpad platforms, this is straightforward:
      the `@finally` body is emitted as a normal (non-exceptional) cleanup,
      and then a catch-all is emitted which branches to that cleanup (the
      cleanup has code to conditionally re-throw based on a flag which is set
      by the catch-all).
      
      Unfortunately, we can't use the same approach for MSVC exceptions, where
      the catch-all will be emitted as a catchpad. We can't just branch to the
      cleanup from within the catchpad, since we can only exit it via a
      catchret, at which point the exception is destroyed and we can't
      rethrow. We could potentially emit the finally body inside the catchpad
      and have the normal cleanup path somehow branch into it, but that would
      require some new IR construct that could branch into a catchpad.
      
      Instead, after discussing it with Reid Kleckner, we decided that
      frontend outlining was the best approach, similar to how SEH `__finally`
      works today. We decided to use CapturedStmt (which was also suggested by
      Reid) rather than CaptureFinder (which is what `__finally` uses) since
      the latter doesn't handle a lot of cases we care about, e.g. self
      accesses, property accesses, block captures, etc. Extending
      CaptureFinder to handle those additional cases proved unwieldy, whereas
      CapturedStmt already took care of all of those.  In theory `__finally`
      could also be moved over to CapturedStmt, which would remove some
      existing limitations (e.g. the inability to capture this), although
      CaptureFinder would still be needed for SEH filters.
      
      The one case supported by `@finally` but not CapturedStmt (or
      CaptureFinder for that matter) is arbitrary control flow out of the
      `@finally`, e.g. having a return statement inside a `@finally`. We can
      add that support as a follow-up, but in practice we've found it to be
      used very rarely anyway.
      
      Differential Revision: https://reviews.llvm.org/D47564
      
      llvm-svn: 334251
      5faf6d88
    • Rui Ueyama's avatar
      Expand comments. · e53890f3
      Rui Ueyama authored
      llvm-svn: 334250
      e53890f3
Loading