[FFmpeg-devel] [PATCH 3/3] x86/hevc: add ff_hevc_sao_band_filter_{8, 10, 12}_{sse2, avx2}

Christophe Gisquet christophe.gisquet at gmail.com
Sat Jan 31 11:33:53 CET 2015


Hi,

2015-01-30 19:50 GMT+01:00 James Almer <jamrial at gmail.com>:
> +%macro HEVC_SAO_BAND_FILTER_COMPUTE 3
> +    psraw             %2, %3, %1-5
> +    pcmpeqw          m10, %2, m0
> +    pcmpeqw          m11, %2, m1
> +    pcmpeqw          m12, %2, m2
> +    pcmpeqw           %2, m3
> +    pand             m10, m4
> +    pand             m11, m5
> +    pand             m12, m6
> +    pand              %2, m7
> +    por              m10, m11
> +    por              m12, %2
> +    por              m10, m12
> +    paddw             %3, m10
> +%endmacro

The shift does really force to work on bytes, too bad. Some pshufb
might still be possible using the result, but it would be cumbersome
because the psraw result is [0-31], and offset might be signed.

> +.loop:
> +    movu             m13, [srcq+widthq]
[...]
> +    movu      [dstq+widthq], m8

Some of those moves could be aligned, but there's some work to be done
at the buffer levels. So it's not like it's really part of this patch.

Looks good, any improvement seems like an additional patch.

-- 
Christophe


More information about the ffmpeg-devel mailing list