convolve_copy_sse2: replace SSE w/SSE2 code

this should be neutral or slightly faster on modern (P4+) architectures

Change-Id: Iec4c080275941eb8c9e05a66a2daf0405d86a69b
1 file changed