We can't make the -4 versions inline, since we use ifuncs for them, so make vectorized versions. Test included.