Skip to content
  1. Aug 12, 2013
  2. Aug 11, 2013
    • Reed Kotler's avatar
      Don't generate floating point stubs for mips16 code if the function · d265e888
      Reed Kotler authored
      is actually an instrinsic that will not occur in libc. This list here
      is not exhaustive but fixes the one places in test-suite where this occurs.
      I have filed a bug against myself to research the full list and add them
      to the array of such cases. In the future, actual stub generation will occur
      in a later phase and we won't need this code because we will know at that time
      during the compilation that in fact no helper function was even needed.
      
      llvm-svn: 188149
      d265e888
    • Elena Demikhovsky's avatar
      AVX-512: Added more tests for BROADCAST · 5fed3b95
      Elena Demikhovsky authored
      llvm-svn: 188148
      5fed3b95
    • Elena Demikhovsky's avatar
      AVX-512: Added VPERM* instructons and MOV* zmm-to-zmm instructions. · cf5b1458
      Elena Demikhovsky authored
      Added a test for shuffles using VPERM.
      
      llvm-svn: 188147
      cf5b1458
    • Chandler Carruth's avatar
      Re-instate r187323 which fast-tracks promotable allocas as soon as the · d7cd7e36
      Chandler Carruth authored
      SROA-based analysis has enough information. This should work now that
      both mem2reg *and* the SSAUpdater-based AllocaPromoter have been updated
      to be able to promote the types of allocas that the SROA analysis
      detects.
      
      I've included tests for the AllocaPromoter that were only possible to
      write once we fast-tracked promotable allocas without rewriting them.
      This includes a test both for r187347 and r188145.
      
      Original commit log for r187323:
      """
      Now that mem2reg understands how to cope with a slightly wider set of uses of
      an alloca, we can pre-compute promotability while analyzing an alloca for
      splitting in SROA. That lets us short-circuit the common case of a bunch of
      trivially promotable allocas. This cuts 20% to 30% off the run time of SROA for
      typical frontend-generated IR sequneces I'm seeing. It gets the new SROA to
      within 20% of ScalarRepl for such code. My current benchmark for these numbers
      is PR15412, but it fits the general pattern of IR emitted by Clang so it should
      be widely applicable.
      """
      
      llvm-svn: 188146
      d7cd7e36
    • Chandler Carruth's avatar
      Finish fixing the SSAUpdater-based AllocaPromoter strategy in SROA to cope with · c17283b4
      Chandler Carruth authored
      the more general set of patterns that are now handled by mem2reg and that we
      can detect quickly while doing SROA's initial analysis. Notably, this allows it
      to promote through no-op bitcast and GEP sequences. A core part of the
      SSAUpdater approach is the ability to test whether a particular instruction is
      part of the set being promoted. Testing this becomes significantly more complex
      in the world where the operand to every load and store isn't the alloca itself.
      I ended up using the approach of walking up the def-chain until we find the
      alloca. I benchmarked this against keeping a set of pointer operands and
      keeping a set of the loads and stores we care about, and this one seemed faster
      although the difference was very small.
      
      No test case yet because currently the rewriting always "fixes" the inputs to
      not require this. The next patch which re-enables early promotion of easy cases
      in SROA will include a test case that specifically exercises this aspect of the
      alloca promoter.
      
      llvm-svn: 188145
      c17283b4
    • Chandler Carruth's avatar
      Reformat some bits of AllocaPromoter and simplify the name and type of · 45b136f4
      Chandler Carruth authored
      our visiting datastructures in the AllocaPromoter/SSAUpdater path of
      SROA. Also shift the order if clears around to be more consistent.
      
      No functionality changed here, this is just a cleanup.
      
      llvm-svn: 188144
      45b136f4
    • Reed Kotler's avatar
      Incorrect JAL instruction attributes caused the optimizer to make a wrong · 705c5951
      Reed Kotler authored
      instruction move. Just affects static relocation. -static works fine now
      with mips16 for the most part.
      
      llvm-svn: 188143
      705c5951
  3. Aug 10, 2013
  4. Aug 09, 2013
Loading