[FFmpeg-devel] [PATCH v7 1/8] Add generic metadata injection using AV_PKT_DATA_METADATA_UPDATE

Romain Beauxis romain.beauxis at gmail.com
Mon Mar 10 23:21:39 EET 2025


Le lun. 10 mars 2025 à 15:46, Michael Niedermayer
<michael at niedermayer.cc> a écrit :
>
> On Mon, Mar 10, 2025 at 08:50:06AM -0500, Romain Beauxis wrote:
> > Le dim. 9 mars 2025 à 13:31, Michael Niedermayer
> > <michael at niedermayer.cc> a écrit :
> > >
> > > On Tue, Feb 25, 2025 at 04:01:14PM -0600, Romain Beauxis wrote:
> > > > libavcodec/decode.c: intercept `AV_PKT_DATA_METADATA_UPDATE` packet
> > > > extra data, attach them to the next decoded frame.
> > > >
> > > > ---
> > > >  libavcodec/decode.c | 20 ++++++++++++++++++++
> > > >  1 file changed, 20 insertions(+)
> > > >
> > > > diff --git a/libavcodec/decode.c b/libavcodec/decode.c
> > > > index cac7e620d2..96e2f0ce95 100644
> > > > --- a/libavcodec/decode.c
> > > > +++ b/libavcodec/decode.c
> > > > @@ -97,6 +97,8 @@ typedef struct DecodeContext {
> > > >      int lcevc_frame;
> > > >      int width;
> > > >      int height;
> > > > +
> > > > +    AVDictionary *pending_metadata;
> > > >  } DecodeContext;
> > > >
> > > >  static DecodeContext *decode_ctx(AVCodecInternal *avci)
> > > > @@ -729,6 +731,8 @@ int attribute_align_arg avcodec_send_packet(AVCodecContext *avctx, const AVPacke
> > > >  {
> > > >      AVCodecInternal *avci = avctx->internal;
> > > >      DecodeContext     *dc = decode_ctx(avci);
> > > > +    const uint8_t *side_metadata;
> > > > +    size_t size;
> > > >      int ret;
> > > >
> > > >      if (!avcodec_is_open(avctx) || !av_codec_is_decoder(avctx->codec))
> > > > @@ -746,6 +750,14 @@ int attribute_align_arg avcodec_send_packet(AVCodecContext *avctx, const AVPacke
> > > >          ret = av_packet_ref(avci->buffer_pkt, avpkt);
> > > >          if (ret < 0)
> > > >              return ret;
> > > > +
> > > > +        side_metadata = av_packet_get_side_data(avpkt, AV_PKT_DATA_METADATA_UPDATE, &size);
> > > > +        if (side_metadata) {
> > > > +            av_dict_free(&dc->pending_metadata);
> > > > +            ret = av_packet_unpack_dictionary(side_metadata, size, &dc->pending_metadata);
> > > > +            if (ret < 0)
> > > > +                return ret;
> > > > +        }
> > > >      } else
> > > >          dc->draining_started = 1;
> > > >
> > > > @@ -815,6 +827,7 @@ fail:
> > > >  int ff_decode_receive_frame(AVCodecContext *avctx, AVFrame *frame)
> > > >  {
> > > >      AVCodecInternal *avci = avctx->internal;
> > > > +    DecodeContext     *dc = decode_ctx(avci);
> > > >      int ret;
> > > >
> > > >      if (!avcodec_is_open(avctx) || !av_codec_is_decoder(avctx->codec))
> > > > @@ -887,6 +900,12 @@ int ff_decode_receive_frame(AVCodecContext *avctx, AVFrame *frame)
> > > >          }
> > > >      }
> > > >  #endif
> > > > +
> > > > +    if (dc->pending_metadata) {
> > > > +        av_dict_copy(&frame->metadata, dc->pending_metadata, AV_DICT_APPEND);
> > > > +        av_dict_free(&dc->pending_metadata);
> > > > +    }
> > > > +
> > > >      return 0;
> > > >  fail:
> > > >      av_frame_unref(frame);
> > >
> > > Does this work correctly, if theres a delay/latency in the decoder ?
> >
> > Can you describe a scenario that you're thinking about?
>
> The users feeds several packets into a multi threaded decoder
> and then depending on the threads and luck sooner or later
> one frame comes out.
>
> Passing some data in a way that disregards this, feels wrong
> Hypothetically there also could be a 2nd AV_PKT_DATA_METADATA_UPDATE
> going in before the frame corresponding to the first comes out
> but i may be missing something

As far as I can tell, the mechanism is only used by ogg streams and
for ogg, these scenarios would not really apply I believe.

Currently, DTS are broken for chained ogg streams so passing packets
in any other order than the stream's sequential order would not work.

I think this concern could be addressed in the planned follow-up work,
which is to remove the ogg headers from the demuxer and attach
AV_PKT_DATA_METADATA_UPDATE to a packet with actual media content.

In this case, the metadata would be immediately attached to the frame
when the packet is decoded.

-- Romain


More information about the ffmpeg-devel mailing list