[FFmpeg-devel] [PATCH v2 4/5] libavfilter/x86/vf_gblur: add localbuf and ff_horiz_slice_avx2/512()
Wu, Jianhua
jianhua.wu at intel.com
Mon Sep 13 07:56:39 EEST 2021
Zhao Zhili wrote:
>
> Hi Wu,
>
> > On Aug 4, 2021, at 10:06 AM, Wu Jianhua <jianhua.wu at intel.com> wrote:
> >
> > We introduced a ff_horiz_slice_avx2/512() implemented on a new
> algorithm.
> > In a nutshell, the new algorithm does three things, gathering data
> > from
> > 8/16 rows, blurring data, and scattering data back to the image buffer.
> > Here we used a customized transpose 8x8/16x16 to avoid the huge
> > overhead brought by gather and scatter instructions, which is
> > dependent on the temporary buffer called localbuf added newly.
> >
>
> I get fate error related to the patch:
>
> make fate-checkasm-vf_gblur
>
> checkasm: using random seed 227700911
> SSE:
> - vf_gblur.postscale_slice [OK]
> SSE4.1:
> - vf_gblur.horiz_slice [OK]
> test failed comparing 297.388 with 190 (abs diff=107.388 with EPS=0.01)
> AVX2:
> horiz_slice_avx2 (vf_gblur.c:47)
> - vf_gblur.horiz_slice [FAILED]
> - vf_gblur.postscale_slice [OK]
> src/tests/fate-run.sh: line 78: 40217 Illegal instruction: 4 $target_exec
> $target_path/"$@“
>
> The last one "Illegal instruction: 4” exist before the patch.
>
> I have tested on two macbook: mbp 2015 and MacBook Pro (16-inch, 2019).
> The test succeed on Ubuntu.
>
Hi Zhili,
Thanks for the notice. I'll fix it once I get where the problem located.
Best regards,
Jianhua
More information about the ffmpeg-devel
mailing list