Skip to content
  1. Jul 09, 2017
    • Craig Topper's avatar
      [InstCombine] Speculatively implement a fix for what might be the root cause... · e79b3e7d
      Craig Topper authored
      [InstCombine] Speculatively implement a fix for what might be the root cause of PR33721 by making sure that we have integer types before doing select C, -1, 0 -> sext C to int
      
      I recently changed m_One and m_AllOnes to use Constant::isOneValue/isAllOnesValue which work on floating point values too. The original implementation looked specifically for ConstantInt scalars and splats. So I'm guessing we are accidentally trying to issue sext/zexts on floating point types now.
      
      Hopefully I figure out how to reproduce the failure from the PR soon.
      
      llvm-svn: 307486
      e79b3e7d
  2. Jul 08, 2017
  3. Jul 07, 2017
    • Lei Huang's avatar
    • Dehao Chen's avatar
      Increase the import-threshold for crtical functions. · 64c46574
      Dehao Chen authored
      Summary: For interative sample-pgo, if a hot call site is inlined in the profiling binary, we should inline it in before profile annotation in the backend. Before that, the compile phase first collects all GUIDs that needs to be imported and creates virtual "hot" call edge in the summary. However, "hot" is not good enough to guarantee the callsites get inlined. This patch introduces "critical" call edge, and assign much higher importing threshold for those edges.
      
      Reviewers: tejohnson
      
      Reviewed By: tejohnson
      
      Subscribers: sanjoy, mehdi_amini, llvm-commits, eraman
      
      Differential Revision: https://reviews.llvm.org/D35096
      
      llvm-svn: 307439
      64c46574
    • Dehao Chen's avatar
      Add sample PGO support to ThinLTO new pass manager. · 3a986142
      Dehao Chen authored
      Summary:
      For SamplePGO + ThinLTO, because profile annotation is done twice at both PrepareForThinLTO pipeline and backend compiler, the following changes are needed at the PrepareForThinLTO phase to ensure the IR is not changed dramatically. Otherwise the profile annotation will be inaccurate in the backend compiler.
      
      * disable hot-caller heuristic
      * disable loop unrolling
      * disable indirect call promotion
      
      This will unblock the new PM testing for sample PGO (tools/clang/test/CodeGen/pgo-sample-thinlto-summary.c), which will be covered in another cfe patch.
      
      Reviewers: chandlerc, tejohnson, davidxl
      
      Reviewed By: tejohnson
      
      Subscribers: sanjoy, mehdi_amini, Prazek, inglorion, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D34895
      
      llvm-svn: 307437
      3a986142
    • Zachary Turner's avatar
      [PDB] More changes to bring lld PDBs to parity with MSVC. · 3a11fdf8
      Zachary Turner authored
      1) Don't write a /src/headerblock stream.  This appears to be
         written conditionally by MSVC, but it's not clear what the
         condition is.  For now, just remove it since we dont' know
         what it is anyway and the particular pdb we've checked in
         for the test doesn't have one.
      2) Write a valid timestamp for the PDB file signature.  This
         leads to non-reproducible builds, but it matches the default
         behavior of link, so it should be out default as well.  If
         we need reproducibility, we should add a separate command
         line option for it that is off by default.
      3) Write an empty FPO stream.  MSVC seems to always write an
         FPO stream.  This change makes the stream directory match
         up, although we still need to make the contents of the FPO
         stream match.
      
      llvm-svn: 307436
      3a11fdf8
    • Anna Thomas's avatar
      [LoopUnrollRuntime] Support multiple exit blocks unrolling when prolog remainder generated · e3872003
      Anna Thomas authored
      With the NFC refactoring in rL307417 (git SHA 987dd01), all the logic
      is in place to support multiple exit/exiting blocks when prolog
      remainder is generated.
      This patch removed the assert that multiple exit blocks unrolling is only
      supported when epilog remainder is generated.
      
      Also, added test runs and checks with PROLOG prefix in
      runtime-loop-multiple-exits.ll test cases.
      
      llvm-svn: 307435
      e3872003
    • Sanjay Patel's avatar
      [DAGCombiner] use local variable to shorten code; NFCI · 4cea2ec2
      Sanjay Patel authored
      llvm-svn: 307429
      4cea2ec2
    • Quentin Colombet's avatar
      [RegAllocFast] Don't insert kill flags of super-register for partial kill · 868ef847
      Quentin Colombet authored
      When reusing a register for a new definition, the fast register allocator
      used to insert a kill flag at the previous last use of that register to
      inform later passes that this register is free between the redef and the
      last use. However, this may be wrong when subregisters are involved.
      Indeed, a partially redef would have trigger a kill of the full super
      register, potentially wrongly marking all the other subregisters as
      free. Given we don't track which lanes are still live, we cannot set the
      kill flag in such case.
      
      Note: This bug has been latent for about 7 years (r104056).
      
      llvmg.org/PR33677
      
      llvm-svn: 307428
      868ef847
    • Quentin Colombet's avatar
      81551148
    • Davide Italiano's avatar
      [Local] Update the comment for removeUnreachableBlocks. · 4eb210bd
      Davide Italiano authored
      It referenced a wrong function name, and didn't mention what the
      second argument did. This should be slightly more accurate now.
      
      llvm-svn: 307425
      4eb210bd
    • Matthias Braun's avatar
      FuzzerUtilDarwin.cpp: We need to pass modifiable strings to posix_spawn · af7442ab
      Matthias Braun authored
      This fixes a bug where unmodifiable strings where passed to posix_spawn.
      
      This is an attempt to unbreak the greendragon libFuzzer bot.
      
      llvm-svn: 307424
      af7442ab
    • Zachary Turner's avatar
      Fix some differences between lld and MSVC generated PDBs. · c1e93e5f
      Zachary Turner authored
      A couple of things were different about our generated PDBs.
      
      1) We were outputting the wrong Version on the PDB Stream.
         The version we were setting was newer than what MSVC is setting.
         It's not clear what the implications are, but we change LLD
         to use PdbImplVC70, as MSVC does.
      2) For the optional debug stream indices in the DBI Stream, we
         were outputting 0 to mean "the stream is not present".  MSVC
         outputs uint16_t(-1), which is the "correct" way to specify
         that a stream is not present.  So we fix that as well.
      3) We were setting the PDB Stream signature to 0.  This is supposed
         to be the result of calling time(nullptr).  Although this leads
         to non-deterministic builds, a better way to solve that is by
         having a command line option explicitly for generating a
         reproducible build, and have the default behavior of lld-link
         match the default behavior of link.
      
      To test this, I'm making use of the new and improved `pdb diff`
      sub command.  To make it suitable for writing tests against, I had
      to modify the diff subcommand slightly to print less verbose output.
      Previously it would always print | <column> | <value1> | <value2> |
      which is quite verbose, and the values are fragile.  All we really
      want to know is "did we produce the same value as link?"  So I added
      command line options to print a single character representing the
      result status (different, identical, equivalent), and another to
      hide the value display.  Note that just inspecting the diff output
      used to write the test, you can see some things that are obviously
      wrong.  That is just reflective of the fact that this is the state
      of affairs today, not that we're asserting that this is "correct".
      We can use this as a starting point to discover differences, fix
      them, and update the test.
      
      Differential Revision: https://reviews.llvm.org/D35086
      
      llvm-svn: 307422
      c1e93e5f
    • Zachary Turner's avatar
      [llvm-pdbutil] Improve diff mode. · f3b4b2d8
      Zachary Turner authored
      We're getting to the point that some MS tools (e.g. DIA) can recognize
      our PDBs but others (e.g. link.exe) cannot. I think the way forward is
      to improve our tooling to help us find differences more easily. For
      example, if we can compile the same program with clang-cl and cl and
      have a tool tell us all the places where the PDBs differ, this could
      tell us what we're doing wrong. It's tricky though, because there are a
      lot of "benign" differences in a PDB. For example, if the string table
      in one PDB consists of "foo" followed by "bar" and in the other PDB it
      consists of "bar" followed by "foo", this is not necessarily a critical
      difference, as long as the uses of these strings also refer to the
      correct location. On the other hand, if the second PDB doesn't even
      contain the string "foo" at all, this is a critical difference.
      
      diff mode has been in llvm-pdbutil for quite a while, but because of the
      above challenge along with some others, it's been hard to make it
      useful. I think this patch addresses that. It looks for all the same
      things, but it now prints the output in tabular format (carefully
      formatted and aligned into tables and fields), and it highlights
      critical differences in red, non-critical differences in yellow, and
      identical fields in green.  This makes it easy to spot the places we
      differ, and the general concept of outputting arbitrary fields in
      tabular format can be extended to provide analysis into many of the
      different types of information that show up in a PDB.
      
      Differential Revision: https://reviews.llvm.org/D35039
      
      llvm-svn: 307421
      f3b4b2d8
    • Gor Nishanov's avatar
      [cloning] Do not duplicate types when cloning functions · 8cdf6487
      Gor Nishanov authored
      Summary:
      This is an addon to the change rl304488 cloning fixes. (Originally rl304226 reverted rl304228 and reapplied rl304488 https://reviews.llvm.org/D33655)
      
      rl304488 works great when DILocalVariables that comes from the inlined function has a 'unique-ed' type, but,
      in the case when the variable type is distinct we will create a second DILocalVariable in the scope of the original function that was inlined.
      
      Consider cloning of the following function:
      ```
      define private void @f() !dbg !5 {
        %1 = alloca i32, !dbg !11
        call void @llvm.dbg.declare(metadata i32* %1, metadata !14, metadata !12), !dbg !18
        ret void, !dbg !18
      }
      
      !14 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !17) ; came from an inlined function
      !15 = distinct !DISubprogram(name: "inlined", linkageName: "inlined", scope: null, file: !6, line: 8, type: !7, isLocal: true, isDefinition: true, scopeLine: 9, isOptimized: false, unit: !0, variables: !16)
      !16 = !{!14}
      !17 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32)
      ```
      
      Without this fix, when function 'f' is cloned, we will create another DILocalVariable for "inlined", due to its type being distinct.
      
      ```
      define private void @f.1() !dbg !23 {
        %1 = alloca i32, !dbg !26
        call void @llvm.dbg.declare(metadata i32* %1, metadata !28, metadata !12), !dbg !30
        ret void, !dbg !30
      }
      
      !14 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !17)
      !15 = distinct !DISubprogram(name: "inlined", linkageName: "inlined", scope: null, file: !6, line: 8, type: !7, isLocal: true, isDefinition: true, scopeLine: 9, isOptimized: false, unit: !0, variables: !16)
      !16 = !{!14}
      !17 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32)
       ;
      !28 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !29) ; OOPS second DILocalVariable
      !29 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32)
      ```
      
      Now we have two DILocalVariable for "inlined" within the same scope. This result in assert in AsmPrinter/DwarfDebug.h:131: void llvm::DbgVariable::addMMIEntry(const llvm::DbgVariable &): Assertion `V.Var == Var && "conflicting variable"' failed.
      (Full example: See: https://bugs.llvm.org/show_bug.cgi?id=33492)
      
      In this change we prevent duplication of types so that when a metadata for DILocalVariable is cloned it will get uniqued to the same metadate node as an original variable.
      
      Reviewers: loladiro, dblaikie, aprantl, echristo
      
      Reviewed By: loladiro
      
      Subscribers: EricWF, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D35106
      
      llvm-svn: 307418
      8cdf6487
    • Anna Thomas's avatar
      [LoopUnrollRuntime] NFC: use the precomputed loop exit in ConnectProlog · 734ab3f7
      Anna Thomas authored
      Minor refactoring to use the preexisting loop exit that's already
      calculated. We do not need to recompute the loop exit in ConnectProlog.
      Apart from avoiding redundant computation, this is required for
      supporting multiple loop exits when Prolog remainder loops are generated.
      
      llvm-svn: 307417
      734ab3f7
    • Tony Jiang's avatar
      c260e0eb
    • Simon Pilgrim's avatar
      Fix some more -Wimplicit-fallthrough warnings. NFCI. · cb07d67a
      Simon Pilgrim authored
      llvm-svn: 307411
      cb07d67a
    • Matthew Simpson's avatar
      [ARM] Implement interleaved access bug fix from r306334 · 12eaef75
      Matthew Simpson authored
      r306334 fixed a bug in AArch64 dealing with wide interleaved accesses having
      pointer types. The bug also exists in ARM, so this patch copies over the fix.
      
      llvm-svn: 307409
      12eaef75
    • Sam Kolton's avatar
      [AMDGPU] Assembler: refactor convert methods (VOP3 and MIMG) · 10ac2fd2
      Sam Kolton authored
      Summary: Simplified converter methods for VOP3 and MIMG.
      
      Reviewers: dp, artem.tamazov
      
      Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, vpykhtin, t-tye
      
      Differential Revision: https://reviews.llvm.org/D35047
      
      llvm-svn: 307407
      10ac2fd2
    • Rafael Espindola's avatar
      Fix variable names. NFC. · 7413496f
      Rafael Espindola authored
      llvm-svn: 307406
      7413496f
    • Sanjay Patel's avatar
      [x86] add SBB optimization for SETAE (uge) condition code · dd36f757
      Sanjay Patel authored
      x86 scalar select-of-constants (Cond ? C1 : C2) combining/lowering is a mess 
      with missing optimizations. We handle some patterns, but miss logical variants.
      
      To clean that up, we should convert all select-of-constants to logic/math and 
      enhance the combining for the expected patterns from that. DAGCombiner already 
      has the foundation to allow the transforms, so we just need to fill in the holes 
      for x86 math op lowering. Selecting 0 or -1 needs extra attention to produce the
      optimal code as shown here.
      
      Attempt to verify that all of these IR forms are logically equivalent:
      http://rise4fun.com/Alive/plxs
      
      Earlier steps in this series:
      rL306040
      rL306072
      
      Differential Revision: https://reviews.llvm.org/D34652
      
      llvm-svn: 307404
      dd36f757
    • Sanjay Patel's avatar
      [DemandedBits] fix formatting; NFC · 1bbdf4e1
      Sanjay Patel authored
      llvm-svn: 307403
      1bbdf4e1
    • Dmitry Preobrazhensky's avatar
      [AMDGPU][mc][gfx9] Added support of op_sel/op_sel_hi for V_MAD_MIX* · b2d24e23
      Dmitry Preobrazhensky authored
      See https://bugs.llvm.org//show_bug.cgi?id=33595
      
      Reviewers: vpykhtin, artem.tamazov, arsenm
      
      Differential Revision: https://reviews.llvm.org/D35021
      
      llvm-svn: 307402
      b2d24e23
Loading