Lowbd rect intrapred DC/LEFT/TOP/128 sse2 optimization
Add lowbd unit test functionality to intrapred_test.cc
Function speedup against C (i7-6700):
Predictor DC LEFT TOP 128
4x8 ~1.4x ~1.4x ~1.7x ~1.9x
8x4 ~1.2x ~1.6x ~1.6x ~2.6x
8x16 ~1.4x ~1.3x ~1.4x ~2.1x
16x8 ~2.0x ~1.8x ~2.3x ~2.1x
16x32 ~2.0x ~1.9x ~1.8x ~2.2x
32x16 ~2.0x ~2.0x ~1.9x ~2.2x
Change-Id: I33db512020ca3c6853a9205a8079f3d00134f584
diff --git a/aom_dsp/aom_dsp.cmake b/aom_dsp/aom_dsp.cmake
index 0da4392..89f294b 100644
--- a/aom_dsp/aom_dsp.cmake
+++ b/aom_dsp/aom_dsp.cmake
@@ -45,6 +45,7 @@
set(AOM_DSP_COMMON_INTRIN_SSE2
"${AOM_ROOT}/aom_dsp/x86/aom_asm_stubs.c"
"${AOM_ROOT}/aom_dsp/x86/convolve.h"
+ "${AOM_ROOT}/aom_dsp/x86/intrapred_sse2.c"
"${AOM_ROOT}/aom_dsp/x86/txfm_common_sse2.h"
"${AOM_ROOT}/aom_dsp/x86/lpf_common_sse2.h"
"${AOM_ROOT}/aom_dsp/x86/loopfilter_sse2.c")