[RISCV] Optimize VRELOAD/VSPILL lowering if VLEN is known. (#74421)
Instead of using VLENB and a shift, load (VLEN/8)*LMUL directly into a register. We could go further and use ADDI, but that would be more intrusive to the code structure. My primary goal is to remove the read of VLENB which might be expensive if it's not optimized in hardware.
Loading
Please sign in to comment