commit | 81ad95363a1a5892fb4cac6ff8c2e27c40344049 | [log] [tgz] |
---|---|---|
author | Yi Luo <luoyi@google.com> | Tue Jun 21 12:17:39 2016 -0700 |
committer | Yi Luo <luoyi@google.com> | Thu Jun 23 12:56:47 2016 -0700 |
tree | 880106ef562f966d861b74dc4a9bc2ee2d1255ca | |
parent | 76ff9b3097b645b6679cff86fe9eee97462c4cdb [diff] |
Convolution vertical filter SSSE3 optimization - Apply 8-pixel vertical filtering direction parallelism. - Add unit tests to verify bit exact. - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680. - Combinational cycle count of vp10_convolve() drops from 26.06% to 6.73%. Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57