[FFmpeg-devel] [PATCH v2] swscale/output: Altivec-optimize yuv2plane1_8

Michael Niedermayer michael at niedermayer.cc
Sun Nov 18 02:29:21 EET 2018


On Sat, Nov 17, 2018 at 10:12:14AM +0200, Lauri Kasanen wrote:
> ./ffmpeg_g -f rawvideo -pix_fmt rgb24 -s hd1080 -i /dev/zero -pix_fmt yuv420p \
> -f null -vframes 100 -v error -nostats -
> 
> 1158 UNITS in planar1,   65528 runs,      8 skips
> 
> -cpuflags 0
> 
> 19082 UNITS in planar1,   65533 runs,      3 skips
> 
> 16.48 speedup ratio. On x86, SSE2 is ~7. Curiously, the Power C version
> takes as many cycles as the x86 SSE2 version, yikes it's fast.
> 
> Note that this function uses VSX instructions, but is not marked so.
> This is because several existing functions also make that mistake.
> I'll submit a patch moving them once this is reviewed.
> 
> v2: Remove !BE check
> Signed-off-by: Lauri Kasanen <cand at gmx.com>
> ---
>  libswscale/ppc/swscale_altivec.c | 53 ++++++++++++++++++++++++++++++++++++++++
>  1 file changed, 53 insertions(+)

iam no altivec guy, just wanted to reply primarly so people dont wait for
a review from me as i normally review swscale patches.
if someone who knows altivec ok-es this then its ok with me too.
assuming this was tested (fate and some real world test)

thx

[...]

-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Concerning the gods, I have no means of knowing whether they exist or not
or of what sort they may be, because of the obscurity of the subject, and
the brevity of human life -- Protagoras
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 181 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20181118/770fba5a/attachment.sig>


More information about the ffmpeg-devel mailing list