FFmpeg
|
WMA Voice decoding context. More...
Data Fields | |
Global values specified in the stream header / extradata or used all over. | |
GetBitContext | gb |
packet bitreader. | |
int8_t | vbm_tree [25] |
converts VLC codes to frame type | |
int | spillover_bitsize |
number of bits used to specify spillover_nbits in the packet header = ceil(log2(ctx->block_align << 3)) | |
int | history_nsamples |
number of samples in history for signal prediction (through ACB) | |
int | do_apf |
whether to apply the averaged projection filter (APF) | |
int | denoise_strength |
strength of denoising in Wiener filter [0-11] | |
int | denoise_tilt_corr |
Whether to apply tilt correction to the Wiener filter coefficients (postfilter) | |
int | dc_level |
Predicted amount of DC noise, based on which a DC removal filter is used. | |
int | lsps |
number of LSPs per frame [10 or 16] | |
int | lsp_q_mode |
defines quantizer defaults [0, 1] | |
int | lsp_def_mode |
defines different sets of LSP defaults [0, 1] | |
int | frame_lsp_bitsize |
size (in bits) of LSPs, when encoded per-frame (independent coding) | |
int | sframe_lsp_bitsize |
size (in bits) of LSPs, when encoded per superframe (residual coding) | |
int | min_pitch_val |
base value for pitch parsing code | |
int | max_pitch_val |
max value + 1 for pitch parsing | |
int | pitch_nbits |
number of bits used to specify the pitch value in the frame header | |
int | block_pitch_nbits |
number of bits used to specify the first block's pitch value | |
int | block_pitch_range |
range of the block pitch | |
int | block_delta_pitch_nbits |
number of bits used to specify the delta pitch between this and the last block's pitch value, used in all but first block | |
int | block_delta_pitch_hrange |
1/2 range of the delta (full range is from -this to +this-1) | |
uint16_t | block_conv_table [4] |
boundaries for block pitch unit/scale conversion | |
Packet values specified in the packet header or related to a packet. | |
A packet is considered to be a single unit of data provided to this decoder by the demuxer. | |
int | spillover_nbits |
number of bits of the previous packet's last superframe preceding this packet's first full superframe (useful for re-synchronization also) | |
int | has_residual_lsps |
if set, superframes contain one set of LSPs that cover all frames, encoded as independent and residual LSPs; if not set, each frame contains its own, fully independent, LSPs | |
int | skip_bits_next |
number of bits to skip at the next call to wmavoice_decode_packet() (since they're part of the previous superframe) | |
uint8_t | sframe_cache [SFRAME_CACHE_MAXSIZE+FF_INPUT_BUFFER_PADDING_SIZE] |
cache for superframe data split over multiple packets | |
int | sframe_cache_size |
set to >0 if we have data from an (incomplete) superframe from a previous packet that spilled over in the current packet; specifies the amount of bits in sframe_cache | |
PutBitContext | pb |
bitstream writer for sframe_cache | |
Frame and superframe values | |
Superframe and frame data - these can change from frame to frame, although some of them do in that case serve as a cache / history for the next frame or superframe. | |
double | prev_lsps [MAX_LSPS] |
LSPs of the last frame of the previous superframe. | |
int | last_pitch_val |
pitch value of the previous frame | |
int | last_acb_type |
frame type [0-2] of the previous frame | |
int | pitch_diff_sh16 |
((cur_pitch_val - last_pitch_val) << 16) / MAX_FRAMESIZE | |
float | silence_gain |
set for use in blocks if ACB_TYPE_NONE | |
int | aw_idx_is_ext |
whether the AW index was encoded in 8 bits (instead of 6) | |
int | aw_pulse_range |
the range over which aw_pulse_set1() can apply the pulse, relative to the value in aw_first_pulse_off. | |
int | aw_n_pulses [2] |
number of AW-pulses in each block; note that this number can be negative (in which case it basically means "zero") | |
int | aw_first_pulse_off [2] |
index of first sample to which to apply AW-pulses, or -0xff if unset | |
int | aw_next_pulse_off_cache |
the position (relative to start of the second block) at which pulses should start to be positioned, serves as a cache for pitch-adaptive window pulses between blocks | |
int | frame_cntr |
current frame index [0 - 0xFFFE]; is only used for comfort noise in pRNG() | |
float | gain_pred_err [6] |
cache for gain prediction | |
float | excitation_history [MAX_SIGNAL_HISTORY] |
cache of the signal of previous superframes, used as a history for signal generation | |
float | synth_history [MAX_LSPS] |
see excitation_history | |
Postfilter values | |
Variables used for postfilter implementation, mostly history for smoothing and so on, and context variables for FFT/iFFT. | |
RDFTContext | rdft |
RDFTContext | irdft |
contexts for FFT-calculation in the postfilter (for denoise filter) | |
DCTContext | dct |
DCTContext | dst |
contexts for phase shift (in Hilbert transform, part of postfilter) | |
float | sin [511] |
float | cos [511] |
8-bit cosine/sine windows over [-pi,pi] range | |
float | postfilter_agc |
gain control memory, used in adaptive_gain_control() | |
float | dcf_mem [2] |
DC filter history. | |
float | zero_exc_pf [MAX_SIGNAL_HISTORY+MAX_SFRAMESIZE] |
zero filter output (i.e. | |
float | denoise_filter_cache [MAX_FRAMESIZE] |
int | denoise_filter_cache_size |
samples in denoise_filter_cache | |
float | tilted_lpcs_pf [0x80] |
aligned buffer for LPC tilting | |
float | denoise_coeffs_pf [0x80] |
aligned buffer for denoise coefficients | |
float | synth_filter_out_buf [0x80+MAX_LSPS_ALIGN16] |
aligned buffer for postfilter speech synthesis | |
WMA Voice decoding context.
Definition at line 132 of file wmavoice.c.
GetBitContext WMAVoiceContext::gb |
packet bitreader.
During decoder init, it contains the extradata from the demuxer. During decoding, it contains packet data.
Definition at line 137 of file wmavoice.c.
Referenced by parse_packet_header(), synth_superframe(), wmavoice_decode_init(), and wmavoice_decode_packet().
int8_t WMAVoiceContext::vbm_tree[25] |
converts VLC codes to frame type
Definition at line 141 of file wmavoice.c.
Referenced by check_bits_for_superframe(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::spillover_bitsize |
number of bits used to specify spillover_nbits in the packet header = ceil(log2(ctx->block_align << 3))
Definition at line 143 of file wmavoice.c.
Referenced by parse_packet_header(), and wmavoice_decode_init().
int WMAVoiceContext::history_nsamples |
number of samples in history for signal prediction (through ACB)
Definition at line 146 of file wmavoice.c.
Referenced by synth_frame(), synth_superframe(), wmavoice_decode_init(), and wmavoice_flush().
int WMAVoiceContext::do_apf |
whether to apply the averaged projection filter (APF)
Definition at line 150 of file wmavoice.c.
Referenced by synth_frame(), synth_superframe(), wmavoice_decode_end(), wmavoice_decode_init(), and wmavoice_flush().
int WMAVoiceContext::denoise_strength |
strength of denoising in Wiener filter [0-11]
Definition at line 152 of file wmavoice.c.
Referenced by calc_input_response(), and wmavoice_decode_init().
int WMAVoiceContext::denoise_tilt_corr |
Whether to apply tilt correction to the Wiener filter coefficients (postfilter)
Definition at line 154 of file wmavoice.c.
Referenced by calc_input_response(), and wmavoice_decode_init().
int WMAVoiceContext::dc_level |
Predicted amount of DC noise, based on which a DC removal filter is used.
Definition at line 156 of file wmavoice.c.
Referenced by postfilter(), and wmavoice_decode_init().
int WMAVoiceContext::lsps |
number of LSPs per frame [10 or 16]
Definition at line 159 of file wmavoice.c.
Referenced by postfilter(), synth_block(), synth_frame(), synth_superframe(), wiener_denoise(), wmavoice_decode_init(), and wmavoice_flush().
int WMAVoiceContext::lsp_q_mode |
defines quantizer defaults [0, 1]
Definition at line 160 of file wmavoice.c.
Referenced by synth_superframe(), and wmavoice_decode_init().
int WMAVoiceContext::lsp_def_mode |
defines different sets of LSP defaults [0, 1]
Definition at line 161 of file wmavoice.c.
Referenced by synth_superframe(), and wmavoice_decode_init().
int WMAVoiceContext::frame_lsp_bitsize |
size (in bits) of LSPs, when encoded per-frame (independent coding)
Definition at line 163 of file wmavoice.c.
Referenced by check_bits_for_superframe(), and wmavoice_decode_init().
int WMAVoiceContext::sframe_lsp_bitsize |
size (in bits) of LSPs, when encoded per superframe (residual coding)
Definition at line 165 of file wmavoice.c.
Referenced by check_bits_for_superframe(), and wmavoice_decode_init().
int WMAVoiceContext::min_pitch_val |
base value for pitch parsing code
Definition at line 168 of file wmavoice.c.
Referenced by kalman_smoothen(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::max_pitch_val |
max value + 1 for pitch parsing
Definition at line 169 of file wmavoice.c.
Referenced by kalman_smoothen(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::pitch_nbits |
number of bits used to specify the pitch value in the frame header
Definition at line 170 of file wmavoice.c.
Referenced by check_bits_for_superframe(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::block_pitch_nbits |
number of bits used to specify the first block's pitch value
Definition at line 172 of file wmavoice.c.
Referenced by check_bits_for_superframe(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::block_pitch_range |
range of the block pitch
Definition at line 174 of file wmavoice.c.
Referenced by synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::block_delta_pitch_nbits |
number of bits used to specify the delta pitch between this and the last block's pitch value, used in all but first block
Definition at line 175 of file wmavoice.c.
Referenced by check_bits_for_superframe(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::block_delta_pitch_hrange |
1/2 range of the delta (full range is from -this to +this-1)
Definition at line 179 of file wmavoice.c.
Referenced by synth_frame(), and wmavoice_decode_init().
uint16_t WMAVoiceContext::block_conv_table[4] |
boundaries for block pitch unit/scale conversion
Definition at line 181 of file wmavoice.c.
Referenced by synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::spillover_nbits |
number of bits of the previous packet's last superframe preceding this packet's first full superframe (useful for re-synchronization also)
Definition at line 193 of file wmavoice.c.
Referenced by parse_packet_header(), and wmavoice_decode_packet().
int WMAVoiceContext::has_residual_lsps |
if set, superframes contain one set of LSPs that cover all frames, encoded as independent and residual LSPs; if not set, each frame contains its own, fully independent, LSPs
Definition at line 197 of file wmavoice.c.
Referenced by check_bits_for_superframe(), parse_packet_header(), and synth_superframe().
int WMAVoiceContext::skip_bits_next |
number of bits to skip at the next call to wmavoice_decode_packet() (since they're part of the previous superframe)
Definition at line 202 of file wmavoice.c.
Referenced by wmavoice_decode_packet(), and wmavoice_flush().
uint8_t WMAVoiceContext::sframe_cache[SFRAME_CACHE_MAXSIZE+FF_INPUT_BUFFER_PADDING_SIZE] |
cache for superframe data split over multiple packets
Definition at line 207 of file wmavoice.c.
Referenced by synth_superframe(), and wmavoice_decode_packet().
int WMAVoiceContext::sframe_cache_size |
set to >0 if we have data from an (incomplete) superframe from a previous packet that spilled over in the current packet; specifies the amount of bits in sframe_cache
Definition at line 209 of file wmavoice.c.
Referenced by synth_superframe(), wmavoice_decode_packet(), and wmavoice_flush().
PutBitContext WMAVoiceContext::pb |
bitstream writer for sframe_cache
Definition at line 214 of file wmavoice.c.
Referenced by wmavoice_decode_packet().
double WMAVoiceContext::prev_lsps[MAX_LSPS] |
LSPs of the last frame of the previous superframe.
Definition at line 225 of file wmavoice.c.
Referenced by synth_superframe(), wmavoice_decode_init(), and wmavoice_flush().
int WMAVoiceContext::last_pitch_val |
pitch value of the previous frame
Definition at line 227 of file wmavoice.c.
Referenced by synth_block_fcb_acb(), synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::last_acb_type |
frame type [0-2] of the previous frame
Definition at line 228 of file wmavoice.c.
Referenced by synth_frame(), and wmavoice_decode_init().
int WMAVoiceContext::pitch_diff_sh16 |
((cur_pitch_val - last_pitch_val) << 16) / MAX_FRAMESIZE
Definition at line 229 of file wmavoice.c.
Referenced by synth_block_fcb_acb(), and synth_frame().
float WMAVoiceContext::silence_gain |
set for use in blocks if ACB_TYPE_NONE
Definition at line 231 of file wmavoice.c.
Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and synth_frame().
int WMAVoiceContext::aw_idx_is_ext |
whether the AW index was encoded in 8 bits (instead of 6)
Definition at line 233 of file wmavoice.c.
Referenced by aw_parse_coords(), and aw_pulse_set1().
int WMAVoiceContext::aw_pulse_range |
the range over which aw_pulse_set1() can apply the pulse, relative to the value in aw_first_pulse_off.
The exact position of the first AW-pulse is within [pulse_off, pulse_off + this], and depends on bitstream values; [16 or 24]
Definition at line 235 of file wmavoice.c.
Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().
int WMAVoiceContext::aw_n_pulses[2] |
number of AW-pulses in each block; note that this number can be negative (in which case it basically means "zero")
Definition at line 241 of file wmavoice.c.
Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().
int WMAVoiceContext::aw_first_pulse_off[2] |
index of first sample to which to apply AW-pulses, or -0xff if unset
Definition at line 244 of file wmavoice.c.
Referenced by aw_parse_coords(), aw_pulse_set1(), and aw_pulse_set2().
int WMAVoiceContext::aw_next_pulse_off_cache |
the position (relative to start of the second block) at which pulses should start to be positioned, serves as a cache for pitch-adaptive window pulses between blocks
Definition at line 246 of file wmavoice.c.
Referenced by aw_pulse_set2().
int WMAVoiceContext::frame_cntr |
current frame index [0 - 0xFFFE]; is only used for comfort noise in pRNG()
Definition at line 252 of file wmavoice.c.
Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and synth_frame().
float WMAVoiceContext::gain_pred_err[6] |
cache for gain prediction
Definition at line 254 of file wmavoice.c.
Referenced by synth_block_fcb_acb(), synth_block_hardcoded(), and wmavoice_flush().
float WMAVoiceContext::excitation_history[MAX_SIGNAL_HISTORY] |
cache of the signal of previous superframes, used as a history for signal generation
Definition at line 256 of file wmavoice.c.
Referenced by synth_superframe(), and wmavoice_flush().
float WMAVoiceContext::synth_history[MAX_LSPS] |
Definition at line 259 of file wmavoice.c.
Referenced by synth_superframe(), and wmavoice_flush().
RDFTContext WMAVoiceContext::rdft |
Definition at line 269 of file wmavoice.c.
Referenced by calc_input_response(), wiener_denoise(), wmavoice_decode_end(), and wmavoice_decode_init().
RDFTContext WMAVoiceContext::irdft |
contexts for FFT-calculation in the postfilter (for denoise filter)
Definition at line 269 of file wmavoice.c.
Referenced by calc_input_response(), wiener_denoise(), wmavoice_decode_end(), and wmavoice_decode_init().
DCTContext WMAVoiceContext::dct |
Definition at line 271 of file wmavoice.c.
Referenced by calc_input_response(), wmavoice_decode_end(), and wmavoice_decode_init().
DCTContext WMAVoiceContext::dst |
contexts for phase shift (in Hilbert transform, part of postfilter)
Definition at line 271 of file wmavoice.c.
Referenced by calc_input_response(), wmavoice_decode_end(), and wmavoice_decode_init().
float WMAVoiceContext::sin[511] |
Definition at line 273 of file wmavoice.c.
Referenced by calc_input_response(), and wmavoice_decode_init().
float WMAVoiceContext::cos[511] |
8-bit cosine/sine windows over [-pi,pi] range
Definition at line 273 of file wmavoice.c.
Referenced by calc_input_response(), and wmavoice_decode_init().
float WMAVoiceContext::postfilter_agc |
gain control memory, used in adaptive_gain_control()
Definition at line 275 of file wmavoice.c.
Referenced by postfilter(), and wmavoice_flush().
float WMAVoiceContext::dcf_mem[2] |
DC filter history.
Definition at line 277 of file wmavoice.c.
Referenced by postfilter(), and wmavoice_flush().
float WMAVoiceContext::zero_exc_pf[MAX_SIGNAL_HISTORY+MAX_SFRAMESIZE] |
zero filter output (i.e.
excitation) by postfilter
Definition at line 279 of file wmavoice.c.
Referenced by synth_frame(), synth_superframe(), and wmavoice_flush().
float WMAVoiceContext::denoise_filter_cache[MAX_FRAMESIZE] |
Definition at line 281 of file wmavoice.c.
Referenced by wiener_denoise(), and wmavoice_flush().
int WMAVoiceContext::denoise_filter_cache_size |
samples in denoise_filter_cache
Definition at line 282 of file wmavoice.c.
Referenced by wiener_denoise().
float WMAVoiceContext::tilted_lpcs_pf[0x80] |
aligned buffer for LPC tilting
Definition at line 284 of file wmavoice.c.
Referenced by wiener_denoise().
float WMAVoiceContext::denoise_coeffs_pf[0x80] |
aligned buffer for denoise coefficients
Definition at line 286 of file wmavoice.c.
Referenced by wiener_denoise().
float WMAVoiceContext::synth_filter_out_buf[0x80+MAX_LSPS_ALIGN16] |
aligned buffer for postfilter speech synthesis
Definition at line 288 of file wmavoice.c.
Referenced by postfilter(), and wmavoice_flush().