[FFmpeg-devel] [PATCH] h264: integrate clear_blocks calls with IDCT.

Michael Niedermayer michaelni at gmx.at
Mon Feb 18 03:04:26 CET 2013


On Sun, Feb 17, 2013 at 02:52:54PM -0800, Ronald S. Bultje wrote:
> From: "Ronald S. Bultje" <rsbultje at gmail.com>
> 
> The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
> to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
> (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
> tested (cathedral), i.e. almost 30 cycles per mb faster.
> 

> Arm assembly changes untested.

fate-h264 (h264-conformance-ba_mw_d in this case but its not the only
one)

Program received signal SIGSEGV, Segmentation fault.
ff_h264_idct_add8_neon () at ffmpeg/libavcodec/arm/h264idct_neon.S:166
166         ldrsh           r8,  [r1]

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

While the State exists there can be no freedom; when there is freedom there
will be no State. -- Vladimir Lenin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20130218/5a2c9d3d/attachment.asc>


More information about the ffmpeg-devel mailing list