[FFmpeg-devel] [FFMpeg-Devel][GSoC][PATCH 1/2] postproc: Updated postprocess_template to use new sse/avx deinterlace functions
Michael Niedermayer
michaelni at gmx.at
Fri Apr 24 00:55:58 CEST 2015
On Thu, Apr 23, 2015 at 05:54:05PM -0400, Tucker DiNapoli wrote:
> I've revised the code and now the linear_interpolate and median deinterlace filters
> give the exact same results for both C and sse2/avx2. I'm almost certain the reason
> the other deinterlace filters don't give the exact same result is due to the use
> of tempory memory to hold some lines, I'm not exactly sure what to do about this
> however.
>
> Unfortunately this patch causes the C/MMX versions to produce different
> results than before. I'm not exactly sure why this is, I changed the code
> to work 4 blocks at a time, which I'm guessing is the issue, but I don't
> know why that would be. If anyone has any idea why this is happening and
> how to fix it it would be appreciated.
i really wish i knew and could fix all bugs in this quickly now
but i cant :(
>
> ____________________________________________________________________________
>
> This commit adds several new files containing yasm assembly code, they are:
> PPContext.asm; Defines the PPContext struct using the yasm struc command
> PPUtil.asm; Various utility macros used in the other asm code
> block_copy.asm; Implements the block copy function, the sse2 and avx2
> versions copy multiple blocks at once.
> deinterlace.asm; Contains implemenations of the postprocessing filters
> with support for sse2 and avx2.
>
> Adding these new functions to postprocess_template entailed adding a new
> templates for AVX2 and modifying the current SSE2 template to use the
> sse2 functions. A new deinterlace function was added to move the logic
> of which deinterlace function to use out of the postprocess function and
> make adding the new functions eaiser. The inline code for packing QP
> into pQPb was moved into a seperate asm file and uptaded for sse2/avx2.
>
./ffplay matrixbench_mpeg2.mpg -vf tinterlace=4,pp=lb
shows some remaining interlacing artifacts in some individual blocks
thats even happening with -cpuflags 0 after the patch
this patch also seems to break the deblocking filters
-vf 'pp=fq|50/va/ha' looks vissibly different after it with a low
bitrate video
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
Complexity theory is the science of finding the exact solution to an
approximation. Benchmarking OTOH is finding an approximation of the exact
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20150424/628f29b6/attachment.asc>
More information about the ffmpeg-devel
mailing list