math: Add fdim implementation
Based on the amd-builtin, but explicitly vectorized for all sizes (not just float4), and includes a vectorized double implementation. Passes piglit (float) tests on pitcairn. Signed-off-by:Aaron Watry <awatry@gmail.com> Reviewed-by:
Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 268708
Loading
Please sign in to comment