Skip to content
  1. Jan 29, 2019
    • Bjorn Pettersson's avatar
      [IPCP] Don't crash due to arg count/type mismatch between caller/callee · d014d576
      Bjorn Pettersson authored
      Summary:
      This patch avoids an assert in IPConstantPropagation when
      there is a argument count/type mismatch between the caller and
      the callee.
      
      While this is actually UB on C-level (clang emits a warning),
      the IR verifier seems to accept it. I'm not sure what other
      frontends/languages might think about this, so simply bailing out
      to avoid hitting an assert (in CallSiteBase<>::getArgOperand or
      Value::doRAUW) seems like a simple solution.
      
      The problem is exposed by the fact that AbstractCallSites will look
      through a bitcast at the callee position of a call/invoke.
      
      Reviewers: jdoerfert, reames, efriedma
      
      Reviewed By: jdoerfert, efriedma
      
      Subscribers: eli.friedman, efriedma, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D57052
      
      llvm-svn: 352469
      d014d576
  2. Jan 19, 2019
    • Johannes Doerfert's avatar
      Enable IPConstantPropagation to work with abstract call sites · 36872b5d
      Johannes Doerfert authored
      This modification of the currently unused inter-procedural constant
      propagation pass (IPConstantPropagation) shows how abstract call sites
      enable optimization of callback calls alongside direct and indirect
      calls. Through minimal changes, mostly dealing with the partial mapping
      of callbacks, inter-procedural constant propagation was enabled for
      callbacks, e.g., OpenMP runtime calls or pthreads_create.
      
      Differential Revision: https://reviews.llvm.org/D56447
      
      llvm-svn: 351628
      36872b5d
  3. Aug 23, 2018
    • Florian Hahn's avatar
      Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions. · 3052290d
      Florian Hahn authored
      This version of the patch fixes cleaning up ssa_copy intrinsics, so it does not
      crash for instructions in blocks that have been marked unreachable.
      
      This patch updates IPSCCP to use PredicateInfo to propagate
      facts to true branches predicated by EQ and to false branches
      predicated by NE.
      
      As a follow up, we should be able to extend it to also propagate additional
      facts about nonnull.
      
      Reviewers: davide, mssimpso, dberlin, efriedma
      
      Reviewed By: davide, dberlin
      
      Differential Revision: https://reviews.llvm.org/D45330
      
      llvm-svn: 340525
      3052290d
  4. Jul 25, 2018
  5. Jul 17, 2018
    • Florian Hahn's avatar
      [IPSCCP] Run Solve each time we resolved an undef in a function. · d95761d9
      Florian Hahn authored
      Once we resolved an undef in a function we can run Solve, which could
      lead to finding a constant return value for the function, which in turn
      could turn undefs into constants in other functions that call it, before
      resolving undefs there.
      
      Computationally the amount of work we are doing stays the same, just the
      order we process things is slightly different and potentially there are
      a few less undefs to resolve.
      
      We are still relying on the order of functions in the IR, which means
      depending on the order, we are able to resolve the optimal undef first
      or not. For example, if @test1 comes before @testf, we find the constant
      return value of @testf too late and we cannot use it while solving
      @test1.
      
      This on its own does not lead to more constants removed in the
      test-suite, probably because currently we have to be very lucky to visit
      applicable functions in the right order.
      
      Maybe we manage to come up with a better way of resolving undefs in more
      'profitable' functions first.
      
      Reviewers: efriedma, mssimpso, davide
      
      Reviewed By: efriedma, davide
      
      Differential Revision: https://reviews.llvm.org/D49385
      
      llvm-svn: 337283
      d95761d9
  6. Jul 10, 2018
    • Manoj Gupta's avatar
      llvm: Add support for "-fno-delete-null-pointer-checks" · 77eeac3d
      Manoj Gupta authored
      Summary:
      Support for this option is needed for building Linux kernel.
      This is a very frequently requested feature by kernel developers.
      
      More details : https://lkml.org/lkml/2018/4/4/601
      
      GCC option description for -fdelete-null-pointer-checks:
      This Assume that programs cannot safely dereference null pointers,
      and that no code or data element resides at address zero.
      
      -fno-delete-null-pointer-checks is the inverse of this implying that
      null pointer dereferencing is not undefined.
      
      This feature is implemented in LLVM IR in this CL as the function attribute
      "null-pointer-is-valid"="true" in IR (Under review at D47894).
      The CL updates several passes that assumed null pointer dereferencing is
      undefined to not optimize when the "null-pointer-is-valid"="true"
      attribute is present.
      
      Reviewers: t.p.northover, efriedma, jyknight, chandlerc, rnk, srhines, void, george.burgess.iv
      
      Reviewed By: efriedma, george.burgess.iv
      
      Subscribers: eraman, haicheng, george.burgess.iv, drinkcat, theraven, reames, sanjoy, xbolva00, llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D47895
      
      llvm-svn: 336613
      77eeac3d
  7. Jun 21, 2018
  8. Jun 01, 2018
  9. May 26, 2018
    • Florian Hahn's avatar
      Revert r333268: [IPSCCP] Use PredicateInfo to propagate facts from... · 718af2f8
      Florian Hahn authored
      Reverting this to see if this is causing the failures of the
      clang-with-thin-lto-ubuntu bot.
      
      [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions.
      
      This patch updates IPSCCP to use PredicateInfo to propagate
      facts to true branches predicated by EQ and to false branches
      predicated by NE.
      
      As a follow up, we should be able to extend it to also propagate additional
      facts about nonnull.
      
      Reviewers: davide, mssimpso, dberlin, efriedma
      
      Reviewed By: davide, dberlin
      
      Differential Revision: https://reviews.llvm.org/D45330
      
      llvm-svn: 333323
      718af2f8
  10. May 25, 2018
  11. Mar 01, 2018
    • Reid Kleckner's avatar
      [IPSCCP] do not break musttail invariant (PR36485) · 3762a089
      Reid Kleckner authored
      Do not replace results of `musttail` calls with a constant if the
      call itself can't be removed.
      
      Do not zap returns of `musttail` callees, if the call site can't be
      removed and replaced with a constant.
      
      Do not zap returns of `musttail`-calling blocks, this breaks
      invariant too.
      
      Patch by Fedor Indutny
      
      Differential Revision: https://reviews.llvm.org/D43695
      
      llvm-svn: 326404
      3762a089
  12. Jan 09, 2018
    • Chris Bieneman's avatar
      [IPSCCP] Remove calls without side effects · abdea268
      Chris Bieneman authored
      Summary:
      When performing constant propagation for call instructions we have historically replaced all uses of the return from a call, but not removed the call itself. This is required for correctness if the calls have side effects, however the compiler should be able to safely remove calls that don't have side effects.
      
      This allows the compiler to completely fold away calls to functions that have no side effects if the inputs are constant and the output can be determined at compile time.
      
      Reviewers: davide, sanjoy, bruno, dberlin
      
      Subscribers: llvm-commits
      
      Differential Revision: https://reviews.llvm.org/D38856
      
      llvm-svn: 322125
      abdea268
  13. Feb 04, 2017
  14. Feb 02, 2017
  15. Feb 01, 2017
  16. May 05, 2016
  17. May 04, 2016
    • Hal Finkel's avatar
      [ConstantFold] Don't try to strip fp -> int bitcasts to simplify icmps · e2b89118
      Hal Finkel authored
      ConstantFold has logic to take icmp (bitcast x to y), null and strip the
      bitcast. This makes sense in general, but not if x has floating-point type. In
      this case, we'd need a fcmp, not an icmp, and the code will assert. We normally
      don't see this situation because we constant fold fp -> int bitcasts, however,
      we'll see it for bitcasts of ppc_fp128 -> i128. This is because that bitcast is
      Endian-dependent, and as a result, we don't simplify it in ConstantFold (we
      could, but no one has yet added the necessary logic). Regardless, ConstantFold
      should not depend on that canonicalization for correctness.
      
      llvm-svn: 268534
      e2b89118
  18. Apr 08, 2016
    • Sanjoy Das's avatar
      Don't IPO over functions that can be de-refined · 5ce32728
      Sanjoy Das authored
      Summary:
      Fixes PR26774.
      
      If you're aware of the issue, feel free to skip the "Motivation"
      section and jump directly to "This patch".
      
      Motivation:
      
      I define "refinement" as discarding behaviors from a program that the
      optimizer has license to discard.  So transforming:
      
      ```
      void f(unsigned x) {
        unsigned t = 5 / x;
        (void)t;
      }
      ```
      
      to
      
      ```
      void f(unsigned x) { }
      ```
      
      is refinement, since the behavior went from "if x == 0 then undefined
      else nothing" to "nothing" (the optimizer has license to discard
      undefined behavior).
      
      Refinement is a fundamental aspect of many mid-level optimizations done
      by LLVM.  For instance, transforming `x == (x + 1)` to `false` also
      involves refinement since the expression's value went from "if x is
      `undef` then { `true` or `false` } else { `false` }" to "`false`" (by
      definition, the optimizer has license to fold `undef` to any non-`undef`
      value).
      
      Unfortunately, refinement implies that the optimizer cannot assume
      that the implementation of a function it can see has all of the
      behavior an unoptimized or a differently optimized version of the same
      function can have.  This is a problem for functions with comdat
      linkage, where a function can be replaced by an unoptimized or a
      differently optimized version of the same source level function.
      
      For instance, FunctionAttrs cannot assume a comdat function is
      actually `readnone` even if it does not have any loads or stores in
      it; since there may have been loads and stores in the "original
      function" that were refined out in the currently visible variant, and
      at the link step the linker may in fact choose an implementation with
      a load or a store.  As an example, consider a function that does two
      atomic loads from the same memory location, and writes to memory only
      if the two values are not equal.  The optimizer is allowed to refine
      this function by first CSE'ing the two loads, and the folding the
      comparision to always report that the two values are equal.  Such a
      refined variant will look like it is `readonly`.  However, the
      unoptimized version of the function can still write to memory (since
      the two loads //can// result in different values), and selecting the
      unoptimized version at link time will retroactively invalidate
      transforms we may have done under the assumption that the function
      does not write to memory.
      
      Note: this is not just a problem with atomics or with linking
      differently optimized object files.  See PR26774 for more realistic
      examples that involved neither.
      
      This patch:
      
      This change introduces a new set of linkage types, predicated as
      `GlobalValue::mayBeDerefined` that returns true if the linkage type
      allows a function to be replaced by a differently optimized variant at
      link time.  It then changes a set of IPO passes to bail out if they see
      such a function.
      
      Reviewers: chandlerc, hfinkel, dexonsmith, joker.eph, rnk
      
      Subscribers: mcrosier, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D18634
      
      llvm-svn: 265762
      5ce32728
  19. Jan 07, 2016
    • David Majnemer's avatar
      [SCCP] Don't violate the lattice invariants · f1a9c9e1
      David Majnemer authored
      We marked values which are 'undef' as constant instead of undefined
      which violates SCCP's invariants.  If we can figure out that a
      computation results in 'undef', leave it in the undefined state.
      
      This fixes PR16052.
      
      llvm-svn: 257102
      f1a9c9e1
    • David Majnemer's avatar
      [SCCP] Can't go from overdefined to constant · bae94573
      David Majnemer authored
      The fix for PR23999 made us mark loads of null as producing the constant
      undef which upsets the lattice.  Instead, keep the load as "undefined".
      This fixes PR26044.
      
      llvm-svn: 257087
      bae94573
  20. Jun 17, 2015
    • David Majnemer's avatar
      Move the personality function from LandingPadInst to Function · 7fddeccb
      David Majnemer authored
      The personality routine currently lives in the LandingPadInst.
      
      This isn't desirable because:
      - All LandingPadInsts in the same function must have the same
        personality routine.  This means that each LandingPadInst beyond the
        first has an operand which produces no additional information.
      
      - There is ongoing work to introduce EH IR constructs other than
        LandingPadInst.  Moving the personality routine off of any one
        particular Instruction and onto the parent function seems a lot better
        than have N different places a personality function can sneak onto an
        exceptional function.
      
      Differential Revision: http://reviews.llvm.org/D10429
      
      llvm-svn: 239940
      7fddeccb
  21. Apr 24, 2015
  22. Mar 13, 2015
    • David Blaikie's avatar
      [opaque pointer type] Add textual IR support for explicit type parameter to gep operator · f72d05bc
      David Blaikie authored
      Similar to gep (r230786) and load (r230794) changes.
      
      Similar migration script can be used to update test cases, which
      successfully migrated all of LLVM and Polly, but about 4 test cases
      needed manually changes in Clang.
      
      (this script will read the contents of stdin and massage it into stdout
      - wrap it in the 'apply.sh' script shown in previous commits + xargs to
      apply it over a large set of test cases)
      
      import fileinput
      import sys
      import re
      
      rep = re.compile(r"(getelementptr(?:\s+inbounds)?\s*\()((<\d*\s+x\s+)?([^@]*?)(|\s*addrspace\(\d+\))\s*\*(?(3)>)\s*)(?=$|%|@|null|undef|blockaddress|getelementptr|addrspacecast|bitcast|inttoptr|zeroinitializer|<|\[\[[a-zA-Z]|\{\{)", re.MULTILINE | re.DOTALL)
      
      def conv(match):
        line = match.group(1)
        line += match.group(4)
        line += ", "
        line += match.group(2)
        return line
      
      line = sys.stdin.read()
      off = 0
      for match in re.finditer(rep, line):
        sys.stdout.write(line[off:match.start()])
        sys.stdout.write(conv(match))
        off = match.end()
      sys.stdout.write(line[off:])
      
      llvm-svn: 232184
      f72d05bc
  23. Feb 27, 2015
    • David Blaikie's avatar
      [opaque pointer type] Add textual IR support for explicit type parameter to load instruction · a79ac14f
      David Blaikie authored
      Essentially the same as the GEP change in r230786.
      
      A similar migration script can be used to update test cases, though a few more
      test case improvements/changes were required this time around: (r229269-r229278)
      
      import fileinput
      import sys
      import re
      
      pat = re.compile(r"((?:=|:|^)\s*load (?:atomic )?(?:volatile )?(.*?))(| addrspace\(\d+\) *)\*($| *(?:%|@|null|undef|blockaddress|getelementptr|addrspacecast|bitcast|inttoptr|\[\[[a-zA-Z]|\{\{).*$)")
      
      for line in sys.stdin:
        sys.stdout.write(re.sub(pat, r"\1, \2\3*\4", line))
      
      Reviewers: rafael, dexonsmith, grosser
      
      Differential Revision: http://reviews.llvm.org/D7649
      
      llvm-svn: 230794
      a79ac14f
    • David Blaikie's avatar
      [opaque pointer type] Add textual IR support for explicit type parameter to... · 79e6c749
      David Blaikie authored
      [opaque pointer type] Add textual IR support for explicit type parameter to getelementptr instruction
      
      One of several parallel first steps to remove the target type of pointers,
      replacing them with a single opaque pointer type.
      
      This adds an explicit type parameter to the gep instruction so that when the
      first parameter becomes an opaque pointer type, the type to gep through is
      still available to the instructions.
      
      * This doesn't modify gep operators, only instructions (operators will be
        handled separately)
      
      * Textual IR changes only. Bitcode (including upgrade) and changing the
        in-memory representation will be in separate changes.
      
      * geps of vectors are transformed as:
          getelementptr <4 x float*> %x, ...
        ->getelementptr float, <4 x float*> %x, ...
        Then, once the opaque pointer type is introduced, this will ultimately look
        like:
          getelementptr float, <4 x ptr> %x
        with the unambiguous interpretation that it is a vector of pointers to float.
      
      * address spaces remain on the pointer, not the type:
          getelementptr float addrspace(1)* %x
        ->getelementptr float, float addrspace(1)* %x
        Then, eventually:
          getelementptr float, ptr addrspace(1) %x
      
      Importantly, the massive amount of test case churn has been automated by
      same crappy python code. I had to manually update a few test cases that
      wouldn't fit the script's model (r228970,r229196,r229197,r229198). The
      python script just massages stdin and writes the result to stdout, I
      then wrapped that in a shell script to handle replacing files, then
      using the usual find+xargs to migrate all the files.
      
      update.py:
      import fileinput
      import sys
      import re
      
      ibrep = re.compile(r"(^.*?[^%\w]getelementptr inbounds )(((?:<\d* x )?)(.*?)(| addrspace\(\d\)) *\*(|>)(?:$| *(?:%|@|null|undef|blockaddress|getelementptr|addrspacecast|bitcast|inttoptr|\[\[[a-zA-Z]|\{\{).*$))")
      normrep = re.compile(       r"(^.*?[^%\w]getelementptr )(((?:<\d* x )?)(.*?)(| addrspace\(\d\)) *\*(|>)(?:$| *(?:%|@|null|undef|blockaddress|getelementptr|addrspacecast|bitcast|inttoptr|\[\[[a-zA-Z]|\{\{).*$))")
      
      def conv(match, line):
        if not match:
          return line
        line = match.groups()[0]
        if len(match.groups()[5]) == 0:
          line += match.groups()[2]
        line += match.groups()[3]
        line += ", "
        line += match.groups()[1]
        line += "\n"
        return line
      
      for line in sys.stdin:
        if line.find("getelementptr ") == line.find("getelementptr inbounds"):
          if line.find("getelementptr inbounds") != line.find("getelementptr inbounds ("):
            line = conv(re.match(ibrep, line), line)
        elif line.find("getelementptr ") != line.find("getelementptr ("):
          line = conv(re.match(normrep, line), line)
        sys.stdout.write(line)
      
      apply.sh:
      for name in "$@"
      do
        python3 `dirname "$0"`/update.py < "$name" > "$name.tmp" && mv "$name.tmp" "$name"
        rm -f "$name.tmp"
      done
      
      The actual commands:
      From llvm/src:
      find test/ -name *.ll | xargs ./apply.sh
      From llvm/src/tools/clang:
      find test/ -name *.mm -o -name *.m -o -name *.cpp -o -name *.c | xargs -I '{}' ../../apply.sh "{}"
      From llvm/src/tools/polly:
      find test/ -name *.ll | xargs ./apply.sh
      
      After that, check-all (with llvm, clang, clang-tools-extra, lld,
      compiler-rt, and polly all checked out).
      
      The extra 'rm' in the apply.sh script is due to a few files in clang's test
      suite using interesting unicode stuff that my python script was throwing
      exceptions on. None of those files needed to be migrated, so it seemed
      sufficient to ignore those cases.
      
      Reviewers: rafael, dexonsmith, grosser
      
      Differential Revision: http://reviews.llvm.org/D7636
      
      llvm-svn: 230786
      79e6c749
  24. May 28, 2014
  25. Aug 16, 2013
    • Daniel Dunbar's avatar
      [tests] Cleanup initialization of test suffixes. · 9efbedfd
      Daniel Dunbar authored
       - Instead of setting the suffixes in a bunch of places, just set one master
         list in the top-level config. We now only modify the suffix list in a few
         suites that have one particular unique suffix (.ml, .mc, .yaml, .td, .py).
      
       - Aside from removing the need for a bunch of lit.local.cfg files, this enables
         4 tests that were inadvertently being skipped (one in
         Transforms/BranchFolding, a .s file each in DebugInfo/AArch64 and
         CodeGen/PowerPC, and one in CodeGen/SI which is now failing and has been
         XFAILED).
      
       - This commit also fixes a bunch of config files to use config.root instead of
         older copy-pasted code.
      
      llvm-svn: 188513
      9efbedfd
  26. Feb 22, 2013
  27. Feb 20, 2013
  28. Jul 02, 2012
    • Chandler Carruth's avatar
      Convert all tests using TCL-style quoting to use shell-style quoting. · a5a29f97
      Chandler Carruth authored
      This was done through the aid of a terrible Perl creation. I will not
      paste any of the horrors here. Suffice to say, it require multiple
      staged rounds of replacements, state carried between, and a few
      nested-construct-parsing hacks that I'm not proud of. It happens, by
      luck, to be able to deal with all the TCL-quoting patterns in evidence
      in the LLVM test suite.
      
      If anyone is maintaining large out-of-tree test trees, feel free to poke
      me and I'll send you the steps I used to convert things, as well as
      answer any painful questions etc. IRC works best for this type of thing
      I find.
      
      Once converted, switch the LLVM lit config to use ShTests the same as
      Clang. In addition to being able to delete large amounts of Python code
      from 'lit', this will also simplify the entire test suite and some of
      lit's architecture.
      
      Finally, the test suite runs 33% faster on Linux now. ;]
      For my 16-hardware-thread (2x 4-core xeon e5520): 36s -> 24s
      
      llvm-svn: 159525
      a5a29f97
  29. Feb 16, 2012
  30. Nov 27, 2011
  31. Sep 01, 2011
  32. Aug 15, 2011
  33. Jun 17, 2011
Loading