[FFmpeg-devel] looking for comparison of intrinsics vs hand written asm

Loren Merritt lorenm
Sat Oct 31 01:21:04 CET 2009


On Fri, 30 Oct 2009, Frank Barchard wrote:

> asm has 2 advantages
> 1. more control over order, letting you tune to architecture.  ie you can
> pair instructions.

You don't need to tune to a specific cpu in order to beat the compiler's 
scheduling. Just write the asm in any old naive order, and it will still 
be simultaneously faster on every cpu than auto-scheduled intrinsics.
(As long as the cpu does at least a little out-of-order execution. Otoh, 
if it's strictly in-order, then maybe the compiler can do better than no 
scheduling at all; I've never tried to optimize for such a case.)

--Loren Merritt



More information about the ffmpeg-devel mailing list