1 c-asm loop less and 1x unroll of float_to_int16_sse()