[FFmpeg-devel] [FFmpeg-Devel] [PATCH 4/5] postproc: Merged second and third inner loops in postprocess_template.c

Michael Niedermayer michaelni at gmx.at
Wed Apr 1 23:27:17 CEST 2015


On Wed, Apr 01, 2015 at 02:36:04PM -0400, Tucker DiNapoli wrote:
> The structure of the postprocess function is to loop over x from 0 to
> width, and in that loop to process 4 block at a time. This inner loop
> was previously split into 3 seperate loops, i.e:
>     outer_loop over x
>       save current x location
>       loop over 4 blocks
>       restore x location
>       loop over 4 blocks
>       restore x location
>       loop over 4 blocks
>     end
> this commit merges the second two inner loops (elimitating the restore x
> location), the next commit merges the first inner loop as well,
> resulting in:
>    outer_loop:
>      loop over 4 blocks
>    end

How and where do you plan to add SSE2 and AVX2 ?
at what point will the calls to the yasm functions be ?
what part of the loop will the yasm functions for the qualification
task cover ?


[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

When the tyrant has disposed of foreign enemies by conquest or treaty, and
there is nothing more to fear from them, then he is always stirring up
some war or other, in order that the people may require a leader. -- Plato
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20150401/385bfd94/attachment.asc>


More information about the ffmpeg-devel mailing list