Skip to content
  1. Aug 30, 2014
    • Hal Finkel's avatar
      Fix AddAliasScopeMetadata to not add scopes when deriving from unknown pointers · a3708df4
      Hal Finkel authored
      The previous implementation of AddAliasScopeMetadata, which adds noalias
      metadata to preserve noalias parameter attribute information when inlining had
      a flaw: it would add alias.scope metadata to accesses which might have been
      derived from pointers other than noalias function parameters. This was
      incorrect because even some access known not to alias with all noalias function
      parameters could easily alias with an access derived from some other pointer.
      Instead, when deriving from some unknown pointer, we cannot add alias.scope
      metadata at all. This fixes a miscompile of the test-suite's tramp3d-v4.
      Furthermore, we cannot add alias.scope to functions unless we know they
      access only argument-derived pointers (currently, we know this only for
      memory intrinsics).
      
      Also, we fix a theoretical problem with using the NoCapture attribute to skip
      the capture check. This is incorrect (as explained in the comment added), but
      would not matter in any code generated by Clang because we get only inferred
      nocapture attributes in Clang-generated IR.
      
      This functionality is not yet enabled by default.
      
      llvm-svn: 216818
      a3708df4
    • David Majnemer's avatar
      InstCombine: Respect recursion depth in visitUDivOperand · 492e612e
      David Majnemer authored
      llvm-svn: 216817
      492e612e
    • David Majnemer's avatar
      InstCombine: Try harder to combine icmp instructions · 5e96f1b4
      David Majnemer authored
      consider: (and (icmp X, Y), (and Z, (icmp A, B)))
      It may be possible to combine (icmp X, Y) with (icmp A, B).
      If we successfully combine, create an 'and' instruction with Z.
      
      This fixes PR20814.
      
      N.B. There is room for improvement after this change but I'm not
      convinced it's worth chasing yet.
      
      llvm-svn: 216814
      5e96f1b4
  2. Aug 29, 2014
  3. Aug 28, 2014
  4. Aug 27, 2014
  5. Aug 26, 2014
  6. Aug 25, 2014
  7. Aug 24, 2014
    • David Majnemer's avatar
      InstCombine: Properly optimize or'ing bittests together · 0ffccf7f
      David Majnemer authored
      CFE, with -03, would turn:
      bool f(unsigned x) {
        bool a = x & 1;
        bool b = x & 2;
        return a | b;
      }
      
      into:
        %1 = lshr i32 %x, 1
        %2 = or i32 %1, %x
        %3 = and i32 %2, 1
        %4 = icmp ne i32 %3, 0
      
      This sort of thing exposes a nasty pathology in GCC, ICC and LLVM.
      
      Instead, we would rather want:
        %1 = and i32 %x, 3
        %2 = icmp ne i32 %1, 0
      
      Things get a bit more interesting in the following case:
        %1 = lshr i32 %x, %y
        %2 = or i32 %1, %x
        %3 = and i32 %2, 1
        %4 = icmp ne i32 %3, 0
      
      Replacing it with the following sequence is better:
        %1 = shl nuw i32 1, %y
        %2 = or i32 %1, 1
        %3 = and i32 %2, %x
        %4 = icmp ne i32 %3, 0
      
      This sequence is preferable because %1 doesn't involve %x and could
      potentially be hoisted out of loops if it is invariant; only perform
      this transform in the non-constant case if we know we won't increase
      register pressure.
      
      llvm-svn: 216343
      0ffccf7f
  8. Aug 23, 2014
    • Jingyue Wu's avatar
      [SROA] Fold a PHI node if all its incoming values are the same · ec33fa9a
      Jingyue Wu authored
      Summary:
      Fixes PR20425.
      
      During slice building, if all of the incoming values of a PHI node are the same, replace the PHI node with the common value. This simplification makes alloca's used by PHI nodes easier to promote.
      
      Test Plan: Added three more tests in phi-and-select.ll
      
      Reviewers: nlewycky, eliben, meheff, chandlerc
      
      Reviewed By: chandlerc
      
      Subscribers: zinovy.nis, hfinkel, baldrick, llvm-commits
      
      Differential Revision: http://reviews.llvm.org/D4659
      
      llvm-svn: 216299
      ec33fa9a
  9. Aug 22, 2014
  10. Aug 21, 2014
Loading