Commit 06cab79e authored Aug 30, 2017 by Stanislav Mekhanoshin

[AMDGPU] Use v_max_f* for fcanonicalize

If denorms are not flushed we can use max instead of multiplication
by 1. For double that is simply faster, while for float and half
it is shorter, because mul uses constant bus and VOP3.

Differential Revision: https://reviews.llvm.org/D36856

llvm-svn: 312095

parent 5a2898ae

Show whitespace changes

Inline Side-by-side

Please to comment