[FFmpeg-devel] [PATCH]: Change Stack Frame Limit in Cuda Context

Ben Chang benc at nvidia.com
Thu Jan 25 01:44:13 EET 2018


Hi,

Please help review this patch to reduce stack frame size per GPU thread. The default allocation size per thread (1024 bytes) is excessive and can be reduced to 128 bytes based on nvidia cuda kernel compilation statistics. This should help with reducing video memory usage per cuda context.

Thanks,
Ben

-----------------------------------------------------------------------------------
This email message is for the sole use of the intended recipient(s) and may contain
confidential information.  Any unauthorized review, use, disclosure or distribution
is prohibited.  If you are not the intended recipient, please contact the sender by
reply email and destroy all copies of the original message.
-----------------------------------------------------------------------------------
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-Reduce-cuda-context-s-stack-frame-size-limit-through.patch
Type: application/octet-stream
Size: 4453 bytes
Desc: 0001-Reduce-cuda-context-s-stack-frame-size-limit-through.patch
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20180124/74f29a27/attachment.obj>


More information about the ffmpeg-devel mailing list