[FFmpeg-devel] [PATCH] Add x86-optimized versions of exponent_min().

Loren Merritt lorenm
Mon Jan 31 20:53:24 CET 2011


On Mon, 31 Jan 2011, Justin Ruggles wrote:

> I get some very weird mmx2 results when I remove the first sub and
> change jae to ja.
>
> Athlon64 X2 6000+
> sse2: 3006 -> 2753
> mmx2: 5228 -> 5453
>  mmx: 5490 -> 5430
>
> Atom 330
> sse2:  6834 -> 3779
> mmx2:  9951 -> 10525
>  mmx: 11390 -> 11325
>
> Both CPUs are consistent in the change, except that on Athlon64 the mmx2
> version is slower than the mmx version.  What do you suggest?

I usually blame such weird results on code alignment, but I have no 
systematic way to fix them.

--Loren Merritt



More information about the ffmpeg-devel mailing list