Skip to content
  1. Dec 10, 2017
  2. Dec 09, 2017
  3. Dec 08, 2017
    • Craig Topper's avatar
      [X86] Teach lowering to only let through (insert_subvector (vXi1 zeros),... · 7f0d456e
      Craig Topper authored
      [X86] Teach lowering to only let through (insert_subvector (vXi1 zeros), subvec, 0) for vector sizes that have native KSHIFT support.
      
      For narrow sizes we'll widen the zero vector and widen the insert. Then do an extract_subvector to get back down to correct size.
      
      This allows us to remove some patterns from the isel table that had to COPY_TO_REGCLASS to an oversized register, do the shift and then COPY_TO_REGCLASS back to the narrow register. Now this is represented explicitly in the DAG.
      
      This seems to have perturbed the register allocation in one of the tests, but the number of instructions didn't change.
      
      llvm-svn: 320190
      7f0d456e
    • Simon Pilgrim's avatar
      [X86][X87] Tag x87 float compare instructions scheduler classes · 6415f56c
      Simon Pilgrim authored
      llvm-svn: 320189
      6415f56c
    • Simon Pilgrim's avatar
      [X86][SHA] Tag SHA instructions scheduler classes · 19d460b0
      Simon Pilgrim authored
      Put these under VecIMul itinerary classes for now - seems to be a good average value
      
      llvm-svn: 320161
      19d460b0
    • Gadi Haber's avatar
      [X86][Haswell]: Updating the scheduling information for the Haswell subtarget. · 2cf601f2
      Gadi Haber authored
      Updated the scheduling information for the Haswell subtarget with the following changes:
      
      Regrouped the instructions after adding appropriate load + store latencies.
      Added scheduling for missing instructions such as the GATHER instrs.
      The changes were made after revisiting the latencies impact of all memory uOps.
      
      Reviewers: RKSimon, zvi, craig.topper, apilipenko
      Differential Revision: https://reviews.llvm.org/D40021
      
      Change-Id: Iaf6c1f5169add1552845a8a566af4e5a359217a7
      llvm-svn: 320137
      2cf601f2
    • Craig Topper's avatar
      [X86] Handle alls version of vXi1 insert_vector_elt with a constant index... · 323ba39f
      Craig Topper authored
      [X86] Handle alls version of vXi1 insert_vector_elt with a constant index without falling back to shuffles.
      
      We previously only supported inserting to the LSB or MSB where it was easy to zero to perform an OR to insert.
      
      This change effectively extracts the old value and the new value, xors them together and then xors that single bit with the correct location in the original vector. This will cancel out the old value in the first xor leaving the new value in the position.
      
      The way I've implemented this uses 3 shifts and two xors and uses an additional register. We can avoid the additional register at the cost of another shift.
      
      llvm-svn: 320120
      323ba39f
  4. Dec 07, 2017
Loading