[FFmpeg-devel] [PATCH] 8-bit hevc decoding optimization on aarch64 with neon

Shengbin Meng shengbinmeng at gmail.com
Tue Nov 21 12:51:49 EET 2017


> On 19 Nov 2017, at 01:35, Rafal Dabrowa <fatwildcat at gmail.com> wrote:
> 
> 
> This is a proposal of performance optimizations for 8-bit
> hevc video decoding on aarch64 platform with neon (simd) extension.

Nice to see the work for aarch64! 

We are also in the process of doing NEON optimization for HEVC decoding. (http://ffmpeg.org/pipermail/ffmpeg-devel/2017-October/218233.html <http://ffmpeg.org/pipermail/ffmpeg-devel/2017-October/218233.html>)

Now we are just about to finish arm 32-bit work and ready to send some patches out. Looks like for aarch64 we can join force:) What do you think?

> 
> The patch contains optimizations for most heavily used qpel, epel, sao and idct
> functions.  Among the functions provided for optimization there are two
> intensively used, but not optimized in this patch: hevc_v_loop_filter_luma_8
> and hevc_h_loop_filter_luma_8. I have no idea how they could be optimized
> hence I leaved them without optimizations.
> 

I see that optimization for loop filter already exists for arm 32-bit code. Why not use that algorithm?


Regards,
Shengbin


More information about the ffmpeg-devel mailing list