[FFmpeg-devel] [PATCH]: Change Stack Frame Limit in Cuda Context

Carl Eugen Hoyos ceffmpeg at gmail.com
Thu Jan 25 02:17:26 EET 2018


2018-01-25 0:44 GMT+01:00 Ben Chang <benc at nvidia.com>:

> Please help review this patch to reduce stack frame size per GPU
> thread. The default allocation size per thread (1024 bytes) is
> excessive and can be reduced to 128 bytes based on nvidia cuda
> kernel compilation statistics. This should help with reducing video
> memory usage per cuda context.

> Subject: [PATCH] Reduce cuda context's stack frame size limit through
> cuCtxSetLimit. The default stack limit is 1024 byte per GPU thread. This
> reduces limit to 128 byte as verified against current cuda kernel compilation
> statistic. This will reduce local memory allocated per cuda context.

This looks as if your commit message spans several lines, should
be one line followed by an empty line and as many more lines as
you need.

Is there a reason why the error messages are different?

> This email message is for the sole use of the intended recipient(s)
> and may contain confidential information.

Please remove this or use another email address.

Carl Eugen


More information about the ffmpeg-devel mailing list