[FFmpeg-devel] libavcodec/exr : add SSE SIMD for reorder_pixels v2 (WIP)

Martin Vignali martin.vignali at gmail.com
Sat Aug 26 21:11:09 EEST 2017


Hello,

in attach new patch for SSE simd of reorder pixels in exr decoder (use by
zip and rle uncompress),
after comments on the previous patch by Ivan Kalvachev.

After testing only on a small buffer, i fix the overread problem of the
previous patch (who hid the last loop trouble)

pass fate test for me (on Mac Os X)


Tested with the decoding of a sequence of 150 HD Exr images (CGI render
with 17 layers per file in float pixel, ZIP16 compression)

SSE :
349190 decicycles in reorder_pixels_zip,  130716 runs,    356 skips
bench: utime=109.222s
bench: maxrss=607002624kB

Scalar :
3039686 decicycles in reorder_pixels_zip,  130395 runs,    677 skips
bench: utime=123.042s
bench: maxrss=607019008kB


Comments Welcome

Martin
Jokyo Images
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-libavcodec-add-SSE-SIMD-for-reorder-pixels.patch
Type: application/octet-stream
Size: 16035 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20170826/5420bc0a/attachment.obj>


More information about the ffmpeg-devel mailing list