[FFmpeg-devel] Fwd: Fixpoint FFT optimization, with MDCT and IMDCT wrappers for audio optimization
Mon Aug 27 01:11:30 CEST 2007
> As opposed to recent x86 chips, where 32x32 mul is 9 times slower than add?
Modern x86 chips have pipelined adders and multipliers, so the add and
multiply rate is the same (at least assuming they have equal numbers
of each). I believe Intel has been doing this since the pentium pro
in the mid 90s, and AMD since the K7 in the late 90s.
>Moreover, at least ARM9E and ARM11 cores execute 32x32->64 MAC in 3 cycles
Which is still 3x slower verses adds then a desktop PC made in the
last 10-15 years, hence my point about not generalizing from x86 too
More information about the ffmpeg-devel