[FFmpeg-devel] [PATCH] Libspeex VAD encoder support
michaelni at gmx.at
Tue Oct 2 02:31:45 CEST 2012
On Mon, Oct 01, 2012 at 09:15:17PM +0400, Dmitry Samonenko wrote:
> On Sat, Sep 29, 2012 at 04:28:42PM +0400, Dmitry Samonenko wrote:
> > Hello.
> > I am continuing to implement full RFC 5574 (RTP + SPEEX) conformity.
> > One of the several lacking features - Voice Activity Detection encoding - is now done.
> > There is a nice sound file with the natural speech containing several
> > pauses - a good test case. It's a raw S16LE file about 1.4M. Don't know
> > list's regulation about where to upload such things.
> > Being speex encoded without VAD it's size is reduced to 157K, with VAD - to 145K.
> > That's almost 10% difference between VAD and non-VAD.
> > From 3c378db7cf256f3690fd79c378882b61d9822d5c Mon Sep 17 00:00:00 2001
> > From: Dmitry Samonenko <shreddingwork at gmail.com>
> > Date: Sat, 29 Sep 2012 15:47:28 +0400
> > Subject: [PATCH 1/1] Libspeex VAD support
> > Option for Voice Activity Detection is added to speex encoder.
> > Speex detects non-speech periods and encodes them with just enough bits
> > to reproduce the background noise, aka ``comfort noise generation''.
> > ---
> > libavcodec/libspeexenc.c | 9 +++++++++
> > 1 files changed, 9 insertions(+), 0 deletions(-)
> It seems to me that my patch arrived at the wrong time: FFmpeg turned
> 1.0, some huge patches appeared - these events overshadowed it.
> Well, hope it's a better time now.
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
it is not once nor twice but times without number that the same ideas make
their appearance in the world. -- Aristotle
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Size: 198 bytes
Desc: Digital signature
More information about the ffmpeg-devel