[FFmpeg-devel] swscale/rgb2rgb : add X86_64 SIMD (SSSE3 and AVX2) for shuffly_bytes func

Nicolas George george at nsup.org
Sun Mar 18 18:28:48 EET 2018


Martin Vignali (2018-03-18):
> I run the test again with a bigger width (512 instead of 128)
> This is my result :
> shuffle_bytes_0321_c: 128.6
> shuffle_bytes_0321_ssse3: 41.6
> shuffle_bytes_0321_avx2: 23.4

IIUC, these benchmarks are expressed in CPU cycles. But what James says
is that it can cause the CPU frequency to be throttled: if that happens,
less cycles can use more time, and even worse, cause other unrelated to
take more time. A benchmark in actual time and typical use case would be
needed to decide.

Regards,

-- 
  Nicolas George
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20180318/717a4104/attachment.sig>


More information about the ffmpeg-devel mailing list