- Dec 10, 2017
-
-
Simon Pilgrim authored
llvm-svn: 320334
-
Simon Pilgrim authored
llvm-svn: 320333
-
Simon Pilgrim authored
Include i686 (non-REX) variant tests as well llvm-svn: 320332
-
Simon Pilgrim authored
llvm-svn: 320331
-
Simon Pilgrim authored
llvm-svn: 320330
-
Simon Pilgrim authored
llvm-svn: 320329
-
Simon Pilgrim authored
llvm-svn: 320328
-
Craig Topper authored
This matches AVX512 version and is more consistent overall. And improves our scheduler models. In some cases this adds _Int to instructions that didn't have any Int_ before. It's a side effect of the adjustments made to some of the multiclasses. llvm-svn: 320325
-
Simon Pilgrim authored
llvm-svn: 320322
-
Simon Pilgrim authored
Split off some 'n' instruction versions to make it clearer when WAIT is being inserted llvm-svn: 320321
-
Craig Topper authored
[X86] Rename some instructions from 'rb' to 'rrb' to make 'b' a proper suffix. Fix the scheduling information for some of them. Some of the scheduling information was only present for the 'rb' version' and not the 'rr' version. Now we match 'rr(b?)' llvm-svn: 320320
-
Craig Topper authored
llvm-svn: 320319
-
Simon Pilgrim authored
Locally tag COPY as WriteMove, which has caused some reg-reg + reg-mem instruction tests to reorder. llvm-svn: 320308
-
Simon Pilgrim authored
llvm-svn: 320307
-
Simon Pilgrim authored
llvm-svn: 320305
-
Craig Topper authored
Based on the fact that the 'Y' version of the instruction is next to this, I assume Z256 is the intended value. llvm-svn: 320295
-
Craig Topper authored
The VEX versions were present but not the legacy SSE versions. llvm-svn: 320294
-
Craig Topper authored
[X86] Correct the _Int part of more scheduler model instrexes. Put _b in the correct order relative to _Int llvm-svn: 320282
-
Craig Topper authored
llvm-svn: 320280
-
Craig Topper authored
[X86] Fix bad regular expressions in the scheduler models. Question marks should be outside of multicharacter parenthesized expressions If the question mark is inside the parentheses it only applies to the single character proceeding it. I had to make a few additional cleanups to fix some duplicate warnings that were exposed by fixing this. llvm-svn: 320279
-
Joel Jones authored
This patch improves performance on Cavium T99 as shown here (libquantum 0.2.4): https://docs.google.com/spreadsheets/d/1Lo1o2E1NjrpkwS7DvYYWsiVvPdd93h7KBaqeptMrZPY/edit?usp=sharing By increasing the LoopMicroOpsBufferSize in the Cavium T99 Scheduler file, loop unrolling becomes more aggressive. This helps performance on T99. Test case included. Patch by Stefan Teleman Differential Revision: https://reviews.llvm.org/D40695 llvm-svn: 320272
-
Craig Topper authored
llvm-svn: 320268
-
- Dec 09, 2017
-
-
Craig Topper authored
[X86] Improve lowering of vXi1 insert_subvectors to better utilize (insert_subvector zero, vec, 0) for zeroing upper bits. This can be better recognized during isel when the producer already zeroed the upper bits. llvm-svn: 320267
-
Craig Topper authored
llvm-svn: 320260
-
Craig Topper authored
[X86] When inserting into the upper bits of a vXi1 vector, make sure we shift enough bits if we widened the vector. We may need to widen the vector to make the shifts legal, but if we do that we need to make sure we shift left/right after accounting for the new size. If not we can't guarantee we are shifting in zeros. The test cases affected actually show cases where we should move the shifts all together, but that's another problem. llvm-svn: 320248
-
Dylan McKay authored
These were broken because of various printing format changes. llvm-svn: 320246
-
Craig Topper authored
We were previously using kunpck with zero inputs unnecessarily. And we had cases where we would insert into a zero vector and then insert into larger zero vector incurring two sets of shifts. llvm-svn: 320244
-
Dylan McKay authored
Summary: This relaxes an assertion inside SelectionDAGBuilder which is overly restrictive on targets which have no concept of alignment (such as AVR). In these architectures, all types are aligned to 8-bits. After this, LLVM will only assert that accesses are aligned on targets which actually require alignment. This patch follows from a discussion on llvm-dev a few months ago http://llvm.1065342.n5.nabble.com/llvm-dev-Unaligned-atomic-load-store-td112815.html Reviewers: bogner, nemanjai, joerg, efriedma Reviewed By: efriedma Subscribers: efriedma, cactus, llvm-commits Differential Revision: https://reviews.llvm.org/D39946 llvm-svn: 320243
-
Jessica Paquette authored
The outliner previously would never outline calls. Calls are pretty common in files, so it makes sense to outline them. In fact, in the LLVM test suite, if you count the number of instructions that the outliner misses when you outline calls vs when you don't, it turns out that, on average, around 6% of the instructions encountered are calls. So, if we outline calls, we can find more candidates, and thus save some more space. This commit adds that functionality and updates the mir test to reflect that. llvm-svn: 320229
-
Paul Robinson authored
MachineSink attempts to place instructions near the basic blocks where they are needed. Once an instruction has been sunk, its location relative to other instructions no longer is consistent with the original source code. In order to ensure correct stepping in the debugger, the debug location for sunk instructions is either merged with the insertion point or erased if the target successor block is empty. Originally submitted as r318679, revised to fix sanitizer failure and improve testing. Patch by Matthew Voss! Differential Revision: https://reviews.llvm.org/D39933 llvm-svn: 320216
-
- Dec 08, 2017
-
-
Dan Gohman authored
This puts the functionality under control of a command-line option which is off by default to avoid breaking existing setups. llvm-svn: 320197
-
Dan Gohman authored
This includes a fix so that it doesn't transform declarations, and it puts the functionality under control of a command-line option which is off by default to avoid breaking existing setups. llvm-svn: 320196
-
Konstantin Zhuravlyov authored
- Add gfx704 - Change bonaire to gfx704 - Remove gfx804 - Remove gfx901 - Remove gfx903 Differential Revision: https://reviews.llvm.org/D40046 llvm-svn: 320194
-
Craig Topper authored
[X86] Teach lowering to only let through (insert_subvector (vXi1 zeros), subvec, 0) for vector sizes that have native KSHIFT support. For narrow sizes we'll widen the zero vector and widen the insert. Then do an extract_subvector to get back down to correct size. This allows us to remove some patterns from the isel table that had to COPY_TO_REGCLASS to an oversized register, do the shift and then COPY_TO_REGCLASS back to the narrow register. Now this is represented explicitly in the DAG. This seems to have perturbed the register allocation in one of the tests, but the number of instructions didn't change. llvm-svn: 320190
-
Simon Pilgrim authored
llvm-svn: 320189
-
Matt Arsenault authored
llvm-svn: 320187
-
Konstantin Zhuravlyov authored
Differential Revision: https://reviews.llvm.org/D40924 llvm-svn: 320176
-
Simon Pilgrim authored
Put these under VecIMul itinerary classes for now - seems to be a good average value llvm-svn: 320161
-
Gadi Haber authored
Updated the scheduling information for the Haswell subtarget with the following changes: Regrouped the instructions after adding appropriate load + store latencies. Added scheduling for missing instructions such as the GATHER instrs. The changes were made after revisiting the latencies impact of all memory uOps. Reviewers: RKSimon, zvi, craig.topper, apilipenko Differential Revision: https://reviews.llvm.org/D40021 Change-Id: Iaf6c1f5169add1552845a8a566af4e5a359217a7 llvm-svn: 320137
-
Abderrazek Zaafrani authored
Replace interleaved store instructions by equivalent and more efficient instructions based on latency cost model. Https://reviews.llvm.org/D38196 llvm-svn: 320123
-