vp9: implement top/left half (4x4) sub-8x8-IDCT.
authorRonald S. Bultje <rsbultje@gmail.com>
Mon, 2 Dec 2013 02:27:34 +0000 (21:27 -0500)
committerRonald S. Bultje <rsbultje@gmail.com>
Sat, 7 Dec 2013 17:39:36 +0000 (12:39 -0500)
commit92436e8ad9bd4d44fba47e0b95622529108a96e5
tree0fe0c475841476601a9296235e56eacdd4c8c3f8
parentb2045c44a94db4df97fdd753ea738fcf578d65d8
vp9: implement top/left half (4x4) sub-8x8-IDCT.

For that specific case (eob>3&&eob<=12), runtime of idct8x8 goes from
668 to 477 cycles. For all idct8x8, runtime goes from 521 to 490 cycles.
libavcodec/x86/vp9itxfm.asm