[FFmpeg-devel] [PATCH v2] aarch64/opusdsp: implement NEON accelerated postfilter and deemphasis

Carl Eugen Hoyos ceffmpeg at gmail.com
Sat Apr 6 16:42:08 EEST 2019


2019-04-02 13:47 GMT+02:00, Lynne <dev at lynne.ee>:
> 153372 UNITS in postfilter_c,   65536 runs,      0 skips
> 73164 UNITS in postfilter_neon,   65536 runs,      0 skips -> 2.1x speedup
>
> 80591 UNITS in deemphasis_c,  131072 runs,      0 skips
> 43969 UNITS in deemphasis_neon,  131072 runs,      0 skips -> 1.83x speedup

> Total decoder speedup: ~15% on a Raspberry Pi 3 (from
> 28.1x to 33.5x realtime)

How can I reproduce this?
On something called "X-Gene" Linux, I see a 2,5% overall
speedup (126x to 129x)

The issue is that there is some probability that your patch will
break mac and Windows compilation, so I would at least
like to understand why I can not reproduce your speedup.

Carl Eugen


More information about the ffmpeg-devel mailing list