[FFmpeg-devel] [PATCH 0/4] More H.264 assembly (the sequel) [version 2]

James Darnley jdarnley at obe.tv
Tue Dec 6 14:04:55 EET 2016


On 2016-12-05 19:32, James Darnley wrote:
> Fixed the problem Michael highlighted.  Dropped the intra functions until it
> becomes clear why their performance is unexpected. Updated the benchmarks with
> results from a Nehalem and used (slightly) more accurate data.
> 
> Regarding the age of MMX:  I have written it so unless someone tells me to
> remove it I will keep the code.  However, I will probably not write any more
> going forward.
> 
> James Darnley (4):
>   avcodec/h264: clean up and expand x86 function definitions
>   whitespace changes after last commit
>   avcodec/h264: mmx2, sse2, avx 10-bit h chroma deblock/loop filter
>   avcodec/h264: mmx2, sse2, avx 10-bit 4:2:2 h chroma deblock/loop
>     filter
> 
>  libavcodec/x86/h264_deblock_10bit.asm | 157 ++++++++++++++++++++++++++++++++++
>  libavcodec/x86/h264dsp_init.c         |  68 +++++++++------
>  2 files changed, 200 insertions(+), 25 deletions(-)
> 

If nobody raises objections or makes further comments I will push these,
including the AVX, later today.



More information about the ffmpeg-devel mailing list