[LoopUtils] reduce FMF and min/max complexity when forming reductions
I don't know if there's some way this changes what the vectorizers
may produce for reductions, but I have added test coverage with
3567908d and 5ced712e to show that both passes already have bugs in
this area. Hopefully this does not make things worse before we can
really fix it.