0aa39ff054af16be236ee34f4ead2cdc258a48fc - aom

commit	0aa39ff054af16be236ee34f4ead2cdc258a48fc	[log] [tgz]
author	David Barker <david.barker@argondesign.com>	Tue May 23 12:53:08 2017 +0100
committer	Debargha Mukherjee <debargha@google.com>	Fri May 26 18:50:20 2017 +0000
tree	de5a9496b9da95d96d2f0f57ee23109fe99b7a27
parent	b9f68d278a935b023510ea1ee38383e536857980 [diff]

ext-inter: Vectorize new masked SAD/SSE functions

We would expect that these new functions would be slower than
the old masked SAD/SSE functions, as they do additional work
(blending two inputs and comparing to a third, rather than
just comparing two inputs).

This is true for the SAD functions, which are about 50% slower
(depending on block size and bit depth). However, the sub-pixel
SSE functions are comparable to the old speed for the accelerated
special cases (xoffset or yoffset = 0 or 4), and are
between 40-90% faster for the generic case.

Change-Id: I1a296ed8fc9e3edc313a6add516ff76b17cd3e9f

aom_dsp/aom_dsp.mk[diff]
aom_dsp/aom_dsp_rtcd_defs.pl[diff]
aom_dsp/x86/masked_sad_intrin_ssse3.c[Added - diff]
aom_dsp/x86/masked_variance_intrin_ssse3.c[Added - diff]
test/masked_sad_test.cc[diff]
test/masked_variance_test.cc[diff]

6 files changed