[FFmpeg-devel] Performance of P010LE/BE pixel convertion

Ali KIZIL alikizil at gmail.com
Thu Sep 1 13:35:03 EEST 2016


What CPU are you using? It's presumably going to vary wildly from one
CPU to another?

>* On 1 Sep 2016, at 08:52, Ali KIZIL <alikizil at gmail.com <http://ffmpeg.org/mailman/listinfo/ffmpeg-devel>> wrote:
*

>

>* Hi all,
*

>

>* I tested P010LE pixel convertion from YUV420P in NVENC Main 10 HEVC UHD 50
*

>* fps encoding on Nvidia Pascal Titan X GPU:
*

>

>* Nvidia Pascal Titan X GPU can not reach to 50 fps on Main 10 P010LE HEVC
*

>* encoding:
*

>

>* ffmpeg -loglevel verbose -i
*

>* /media/usb1/4k_sampels/Samsung_SUHD_Picture_Quality\ Demo_Nano_Crystal\
*

>* Display_UK-Version.mp4 -c:v:0 nvenc_hevc -preset hp -cbr 1 -2pass 0 -r 50
*

>* -vb 28000k -minrate 28000k -maxrate 28000k -bufsize 28000k -muxrate 30000k
*

>* -c:a:0 aac -b:a:0 192k -pix_fmt p010le 'udp://233.33.33.1:5001'
*

>

>* FPS waves around 41-43 fps. If same command with YUV420P, it reaches to 120
*

>* - 130 fps.
*

>

>* GPU NVENC Load:
*

>* nvidia-smi dmon -i 0
*

>* gpu pwr temp sm mem enc dec mclk pclkIdx W C % % % % MHz MHz
*

>

>* 0    81    67     9     2    41     0  4513  1809
*

>* 0    80    66     9     2    41     0  4513  1809
*

>* 0    81    67     9     2    42     0  4513  1809
*

>* 0    80    67    10     2    41     0  4513  1809
*

>* 0    81    67     9     2    44     0  4513  1809
*

>

>* I think bottleneck is not at GPU side, pixel convertion maybe needs speed
*

>* up improvement.
*

>

>

>* If codec changed to rawvideo to test pixel format convertion performance
*

>* testing, FPS again waves around 39-40 fps
*

>

>

>* I wanted to state these test reults.
*

>

>

>* Kind Regards,
*

>* _______________________________________________
*

>* ffmpeg-devel mailing list
*

>* ffmpeg-devel at ffmpeg.org <http://ffmpeg.org/mailman/listinfo/ffmpeg-devel>
*

>* http://ffmpeg.org/mailman/listinfo/ffmpeg-devel <http://ffmpeg.org/mailman/listinfo/ffmpeg-devel>
*


The test is done on "Intel(R) Core(TM) i7-4960X CPU @ 3.60GHz" with 32 GB
DDR3 (8pcs. x 4GB  Kingston KHX2133C11D3) with Linux kizil105
3.19.0-25-generic #26~14.04.1-Ubuntu SMP Fri Jul 24 21:16:20 UTC 2015
x86_64 x86_64 x86_64 GNU/Linux.

For memory dmidecode shows 1333 Mhz. I will check my BIOS settings if RAM
speed set wrong and update the mail list.
sudo dmidecode --type memory
# dmidecode 2.12
# SMBIOS entry point at 0x000f04c0
SMBIOS 2.7 present.

Handle 0x0029, DMI type 16, 23 bytes
Physical Memory Array
        Location: System Board Or Motherboard
        Use: System Memory
        Error Correction Type: None
        Maximum Capacity: 96 GB
        Error Information Handle: Not Provided
        Number Of Devices: 4

Handle 0x002B, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x0029
        Error Information Handle: Not Provided
        Total Width: 64 bits
        Data Width: 64 bits
        Size: 8192 MB
        Form Factor: DIMM
        Set: None
        Locator: Node0_Dimm0
        Bank Locator: Node0_Bank0
        Type: DDR3
        Type Detail: Unbuffered (Unregistered)
        Speed: 1333 MHz
        Manufacturer: Kingston
        Serial Number: 6A2AF2C5
        Asset Tag: Dimm0_AssetTag
        Part Number: KHX2133C11D3/
        Rank: 2
        Configured Clock Speed: 1333 MHz

As a final note, I will test same settings on a server with Dual Xeon E5-
2630 V4 with 64 GB (4 pcs x 16 GB) DDR4 2133 Mhz Ram. I will update for
this as well.


More information about the ffmpeg-devel mailing list