HIP: Directly call fabs builtins
All the attribute numbering changes are due to the ocml function being assumed convergent, and the direct call is not. This begins a series of patches to avoid using the thinnest ocml wrappers around llvm intrinsics. These wrapper functions aren't buying anything and add complexity. As we do not propagate fast math flags into the library on linking, and cannot generally safely do so, we're losing information by calling into them. Directly call the builtins so we get the fast math flags set appropriate for the translation unit naturally.
Loading
Please sign in to comment