vf_blend: Add SSE2 optimization for multiply