commit | 770bf715035a5726a06e7e0885b14296f846640f | [log] [tgz] |
---|---|---|
author | Yi Luo <luoyi@google.com> | Fri Mar 25 16:48:19 2016 -0700 |
committer | Yi Luo <luoyi@google.com> | Fri Mar 25 16:48:19 2016 -0700 |
tree | c7639f7efb5c75920550f21c381e9fd92757f45d | |
parent | f9d77d66e6e9eef2c516e1277fdca3d7e110cae3 [diff] |
8x8/16x16 HT types V_DCT to H_FLIPADST SSE2 optimization - Wrote function: fidtx8_sse2() and fidtx16_sse2(). - Turned on vp10_fht8x8_sse2()/vp10_fht16x16_sse2() for new types. - Updated 8x8/16x16 unit tests for accuracy/speed. - Running 20K times with random numbers and getting through tx type from V_DCT to H_FLIPADST, SSE2 speed improvement: 8x8: ~131% 16x16: ~66% Change-Id: Ibbb707e932a08fec3b1f423a7dab280a1d696c9a