[FFmpeg-devel] [PATCH 0/2] x86: hevc_mc: port to SSSE3

James Almer jamrial at gmail.com
Sat Aug 23 16:48:12 CEST 2014


On 23/08/14 11:07 AM, Mickaël Raulet wrote:
> For 10bits and 12bits, they should stay sse4 as well because of packusdw. You need some instructions to convert it to ssse3 see below
> 
> 
> static av_always_inline __m128i _MM_PACKUS_EPI32( __m128i a, __m128i b )
> {
>      a = _mm_slli_epi32 (a, 16);
>      a = _mm_srai_epi32 (a, 16);
>      b = _mm_slli_epi32 (b, 16);
>      b = _mm_srai_epi32 (b, 16);
>      a = _mm_packs_epi32 (a, b);
>     return a;
> }

There's a PACK macro in lavfi/x86/yasm-16.asm that does this without intrinsics.


More information about the ffmpeg-devel mailing list