[FFmpeg-devel] [PATCH] lavfi/removegrain: add x86 and x86_64 SSE2 functions

James Almer jamrial at gmail.com
Wed Jul 15 02:16:43 CEST 2015


On 14/07/15 8:35 PM, James Darnley wrote:
> Speed of all modes increased by a factor between 7.4 and 19.8 largely depending
> on whether bytes are unpacked into words.  Modes 2, 3, and 4 have been sped-up
> by a factor of 43 (thanks quick sort!)
> 
> All modes are available on x86_64 but only modes 1, 10, 11, 12, 13, 14, 19, 20,
> 21, and 22 are available on x86 due to the number of SIMD registers used.
> 
> With a contribution from James Almer <jamrial at gmail.com>
> ---
>  LICENSE.md                            |    1 +
>  libavfilter/removegrain.h             |   40 ++
>  libavfilter/vf_removegrain.c          |   38 +-
>  libavfilter/x86/Makefile              |    4 +
>  libavfilter/x86/vf_removegrain.asm    | 1218 +++++++++++++++++++++++++++++++++
>  libavfilter/x86/vf_removegrain_init.c |   88 +++
>  6 files changed, 1370 insertions(+), 19 deletions(-)
>  create mode 100644 libavfilter/removegrain.h
>  create mode 100644 libavfilter/x86/vf_removegrain.asm
>  create mode 100644 libavfilter/x86/vf_removegrain_init.c

Assembly looks fine and works. Any improvements or additions can be done in subsequent
patches.

Paul: Any comment? If not then please push it.


More information about the ffmpeg-devel mailing list