[FFmpeg-devel] [RFC] flac_wasted32 vector implementation for VSX on ppc64le

Rémi Denis-Courmont remi at remlab.net
Thu Jun 6 12:53:32 EEST 2024



Le 6 juin 2024 10:43:05 GMT+03:00, Sean McGovern <gseanmcg at gmail.com> a écrit :
>Hi,
>
>Attached inline is a _non-working_ implementation of flac_wasted32 for
>VSX developed on a POWER9 in little-endian mode but probably just as
>usable on POWER{8,10}.
>
>I'm not sure why probably one of the simplest DSP functions in lavc
>does not work for me, I imagine this is probably something endian
>related even though IBM's documentation for vec_sl()[1] does not
>suggest any.

Mixing up bytes and elements in the iterator. But you should be able to track this down with gdb or good ol' printf().

>Here's my code:
>
>#define VSX_STRIDE 16
>
>void ff_flac_wasted32_vsx(int32_t *decoded, int wasted, int len)
>{
>   register vec_s32 vec1;
>   register vec_u32 vec2 = { wasted, wasted, wasted, wasted };

There should be an instruction to splat a scalar to a vector. Better yet use vector-scalar shift, if VSX has it.

>   register vec_s32 shifted;
>
>   for (int i = 0; i < len; i += VSX_STRIDE) {
>       vec1 = vec_vsx_ld(i, decoded);
>       shifted = vec_sl(vec1, vec2);
>       vec_vsx_st(shifted, i, decoded);
>   }
>}
>
>Anyone with experience with AltiVec or VSX see something obvious I am missing?
>
>-- Sean McGovern
>
>[1] https://www.ibm.com/docs/en/xl-c-and-cpp-linux/16.1.1?topic=functions-vec-sl
>_______________________________________________
>ffmpeg-devel mailing list
>ffmpeg-devel at ffmpeg.org
>https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
>To unsubscribe, visit link above, or email
>ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".
>


More information about the ffmpeg-devel mailing list