[FFmpeg-devel] [PATCH v2] swscale/output: Altivec-optimize yuv2plane1_8

Michael Niedermayer michael at niedermayer.cc
Wed Nov 21 18:22:36 EET 2018


On Wed, Nov 21, 2018 at 02:35:32PM +0200, Lauri Kasanen wrote:
> On Wed, 21 Nov 2018 13:21:58 +0100
> Michael Niedermayer <michael at niedermayer.cc> wrote:
> 
> > On Wed, Nov 21, 2018 at 10:12:48AM +0200, Lauri Kasanen wrote:
> > > > ./ffmpeg_g -f rawvideo -pix_fmt rgb24 -s hd1080 -i /dev/zero -pix_fmt yuv420p \
> > > > -f null -vframes 100 -v error -nostats -
> > > > 
> > > > 1158 UNITS in planar1,   65528 runs,      8 skips
> > > > 
> > > > -cpuflags 0
> > > > 
> > > > 19082 UNITS in planar1,   65533 runs,      3 skips
> > > > 
> > > > 16.48 speedup ratio. On x86, SSE2 is ~7. Curiously, the Power C version
> > > > takes as many cycles as the x86 SSE2 version, yikes it's fast.
> > > > 
> > > > Note that this function uses VSX instructions, but is not marked so.
> > > > This is because several existing functions also make that mistake.
> > > > I'll submit a patch moving them once this is reviewed.
> > > > 
> > > > v2: Remove !BE check
> > > > Signed-off-by: Lauri Kasanen <cand at gmx.com>
> > > 
> > > Ping. Seems not many ffmpeg devs interested in ppc.
> > 
> > have you tried "make fate" with this patch (note you need to configure with
> > fate samples" so all tests are run
> 
> I ran those fate tests containing "scale" in the name, I gather the
> full suite takes > 20min. Otherwise I tested with a PNG to video
> conversion on LE, and Carl Eugen Hoyos tested with Lena on BE.

the full fate tests must be run, many of these tests use swscale without
having "scale" in their name
and yes on lower end hardware 20min and longer is possible

[...]

-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

If you think the mosad wants you dead since a long time then you are either
wrong or dead since a long time.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 181 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20181121/42046446/attachment.sig>


More information about the ffmpeg-devel mailing list