Commit 72a7024a authored Aug 16, 2023 by Matt Arsenault Committed by Matt Arsenault Sep 12, 2023

AMDGPU: Correctly lower llvm.sqrt.f32

Make codegen emit correctly rounded sqrt by default.

Emit the fast but only kind of fast expansion in AMDGPUCodeGenPrepare
based on !fpmath, like the fdiv case. Hack around visitation ordering
problems from AMDGPUCodeGenPrepare using forward iteration instead of
a well behaved combiner.

https://reviews.llvm.org/D158129

parent 64751ea2

Expand all Show whitespace changes

Inline Side-by-side

Please to comment