[FFmpeg-devel] [PATCH] swscale_unscaled: fix and speed up DITHER_COPY macro for x86 with SSE2

James Almer jamrial at gmail.com
Fri Sep 22 18:47:32 EEST 2017


On 9/22/2017 12:23 PM, Mateusz wrote:
> New version of the patch -- now it uses the same logic independent of the target bitdepth.
> 
> For x86_64 it is much faster than current code (with perfect quality), for x86_32 it is fast
> if you add to configure: --extra-cflags="-msse2"
> (for x86_32 with default configure options it is slower than current code but with better quality)
> 
> Please review/test.
> 
> Mateusz

We don't accept intrinsics, or new arch specific code outside of arch
specific folders.

Either write this in NASM syntax, or if it *really* needs to be inlined,
use __asm__() inline blocks. But whichever you use, it needs to go in
the x86/ folder.


More information about the ffmpeg-devel mailing list