[FFmpeg-devel] [PATCH] SSE optimization for DCA decoder

David Conrad lessen42
Fri Aug 29 04:06:45 CEST 2008


Hi,

Attached gives me about a 45% faster overall DCA decode on my penryn.  
Name suggestions for the function welcome.

Regression tests pass, and I get bit-identical output.

81883 dezicycles in ff_dca_qmf_mul_c, 16380 runs, 4 skips
81067 dezicycles in ff_dca_qmf_mul_c, 32761 runs, 7 skips
82178 dezicycles in ff_dca_qmf_mul_c, 65528 runs, 8 skips
82789 dezicycles in ff_dca_qmf_mul_c, 131051 runs, 21 skips

11990 dezicycles in ff_dca_qmf_mul_sse, 16270 runs, 114 skips
12518 dezicycles in ff_dca_qmf_mul_sse, 32538 runs, 230 skips
12260 dezicycles in ff_dca_qmf_mul_sse, 65126 runs, 410 skips
12254 dezicycles in ff_dca_qmf_mul_sse, 130235 runs, 837 skips

-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: dca_qmf_see.txt
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20080828/d7b8a793/attachment.txt>
-------------- next part --------------




More information about the ffmpeg-devel mailing list