[FFmpeg-devel] [PATCH] remove out-dated ADPCM frame_size handling in libavformat

Justin Ruggles justin.ruggles
Thu Sep 9 00:49:36 CEST 2010


Justin Ruggles wrote:

> Michael Niedermayer wrote:
> 
>> On Mon, Sep 06, 2010 at 08:11:38AM -0400, Justin Ruggles wrote:
>> [...]
>>> Index: tests/ref/acodec/g726
>>> ===================================================================
>>> --- tests/ref/acodec/g726	(revision 25042)
>>> +++ tests/ref/acodec/g726	(working copy)
>>> @@ -1,4 +1,4 @@
>>> -5d8cce28f83dd33c3c7eaf43a5db5294 *./tests/data/acodec/g726.wav
>>> -24082 ./tests/data/acodec/g726.wav
>>> -4f1ba1af75dee64625a1c852e6cd01d3 *./tests/data/g726.acodec.out.wav
>>> -stddev: 8504.69 PSNR: 17.74 MAXDIFF:31645 bytes:    96104/  1058400
>>> +fd090ddf05cc3401cc75c4a5ace1d05a *./tests/data/acodec/g726.wav
>>> +24052 ./tests/data/acodec/g726.wav
>>> +74abea06027375111eeac1b2f8c7d3af *./tests/data/g726.acodec.out.wav
>>> +stddev: 8554.55 PSNR: 17.69 MAXDIFF:29353 bytes:    95984/  1058400
>> the number of samples encoded seems to be changing and not equal to
>> the input
> 
> When the frame size in the encoder makes frames end on a byte boundary
> without any padding, the output is always identical.  Since codes are
> between 2 and 5 bits long, how would the decoder distinguish between
> padding to a byte boundary and another valid code?  I'll double-check,
> but it seems that the decoder currently treats padding as additional
> samples.

I've confirmed that this is the cause of the difference.  The parameters
used by the regression test give a 4-bit code size.  When the frame size
is odd, that leads to 1 extra sample being decoded by the decoder
because of padding.  In the current version, because of resampling from
44100 Hz to 8000 Hz, the frame size actually varies from frame-to-frame.

Current:
source samples             = 264600
resampled samples          =  47991
number of odd-sized frames =     61
decoded samples            =  48052
decoded data bytes         =  96104

Patched:
source samples             = 264600
resampled samples          =  47991
number of odd-sized frames =      1 (the last frame)
decoded samples            =  47992
decoded data bytes         =  95984

So choosing a frame size that forces the encoder to only use padding for
the last frame (which this patch does) seems to be the appropriate thing
to do.

-Justin



More information about the ffmpeg-devel mailing list