[FFmpeg-devel] [PATCH] VP8 V simple loopfilter in MMX/MMX2/SSE2
Ronald S. Bultje
Fri Jul 2 23:07:24 CEST 2010
On Thu, Jul 1, 2010 at 11:10 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
> On Thu, Jul 1, 2010 at 10:46 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
>> On Thu, Jul 1, 2010 at 10:32 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
>>> see attached, my first try at doing a loopfilter in SIMD.
>>> C takes about 660 cycles for the main MB one of 1760 for the 3
>>> together in the splitmv case. MMX didn't really measure since it's
>>> only 1 instruction difference as per MMX2 (I just tested that they
>>> gave identical output). MMX2 takes 190/350 cycles, SSE2 takes 180/330
>>> cycles (which is weird, should be faster, but who knows what my crappy
>>> CPU is doing, this machine is 5 yrs old - Intel Core Duo 2GHz on a
>>> MacBook Pro).
>> Now with vp8dsp-init.c changes also.
> Now with proper alignment for constants, thanks to Vitor for noticing.
H loop filter changes attached separately.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 7518 bytes
Desc: not available
More information about the ffmpeg-devel