[FFmpeg-devel] [PATCH] pthread_frame: allow per-field ThreadFrame owners.

Ronald S. Bultje rsbultje at gmail.com
Wed Jul 12 00:09:53 EEST 2017


Hi Wan-Teh,

On Tue, Jul 11, 2017 at 4:53 PM, Wan-Teh Chang <wtc at google.com> wrote:

> Hi Ronald,
>
> I have a question about this patch.
>
> On Mon, Apr 3, 2017 at 7:24 AM, Ronald S. Bultje <rsbultje at gmail.com>
> wrote:
> > This tries to handle cases where separate invocations of decode_frame()
> > (each running in separate threads) write to respective fields in the
> > same AVFrame->data[]. Having per-field owners makes interaction between
> > readers (the referencing thread) and writers (the decoding thread)
> > slightly more optimal if both accesses are field-based, since they will
> > use the respective producer's thread objects (mutex/cond) instead of
> > sharing the thread objects of the first field's producer.
> >
> > In practice, this fixes the following tsan-warning in fate-h264:
> >
> > WARNING: ThreadSanitizer: data race (pid=21615)
> >   Read of size 4 at 0x7d640000d9fc by thread T2 (mutexes: write M1006):
> >     #0 ff_thread_report_progress pthread_frame.c:569
> (ffmpeg:x86_64+0x100f7cf54)
> > [..]
> >   Previous write of size 4 at 0x7d640000d9fc by main thread (mutexes:
> write M1004):
> >     #0 update_context_from_user pthread_frame.c:335
> (ffmpeg:x86_64+0x100f81abb)
> > ---
> >  libavcodec/h264_slice.c    |  8 +++++---
> >  libavcodec/pthread_frame.c | 18 ++++++++++--------
> >  libavcodec/thread.h        |  2 +-
> >  libavcodec/utils.c         |  7 ++++---
> >  4 files changed, 20 insertions(+), 15 deletions(-)
> >
> > diff --git a/libavcodec/h264_slice.c b/libavcodec/h264_slice.c
> > index fa1e9ae..d4d31cc 100644
> > --- a/libavcodec/h264_slice.c
> > +++ b/libavcodec/h264_slice.c
> > @@ -1423,14 +1423,14 @@ static int h264_field_start(H264Context *h,
> const H264SliceContext *sl,
> >       * We have to do that before the "dummy" in-between frame
> allocation,
> >       * since that can modify h->cur_pic_ptr. */
> >      if (h->first_field) {
> > +        int last_field = last_pic_structure == PICT_BOTTOM_FIELD;
> >          av_assert0(h->cur_pic_ptr);
> >          av_assert0(h->cur_pic_ptr->f->buf[0]);
> >          assert(h->cur_pic_ptr->reference != DELAYED_PIC_REF);
> >
> >          /* Mark old field/frame as completed */
> > -        if (h->cur_pic_ptr->tf.owner == h->avctx) {
> > -            ff_thread_report_progress(&h->cur_pic_ptr->tf, INT_MAX,
> > -                                      last_pic_structure ==
> PICT_BOTTOM_FIELD);
> > +        if (h->cur_pic_ptr->tf.owner[last_field] == h->avctx) {
> > +            ff_thread_report_progress(&h->cur_pic_ptr->tf, INT_MAX,
> last_field);
> >          }
> >
> >          /* figure out if we have a complementary field pair */
> > @@ -1568,7 +1568,9 @@ static int h264_field_start(H264Context *h, const
> H264SliceContext *sl,
> >              return AVERROR_INVALIDDATA;
> >          }
> >      } else {
> > +        int field = h->picture_structure == PICT_BOTTOM_FIELD;
> >          release_unused_pictures(h, 0);
> > +        h->cur_pic_ptr->tf.owner[field] = h->avctx;
> >      }
> >      /* Some macroblocks can be accessed before they're available in case
> >      * of lost slices, MBAFF or threading. */
>
> Note: I have to admit I don't understand the changes to
> libavcodec/h264_slice.c. The changes to the other files are
> straightforward, except for the one issue I ask about below.
>
> > diff --git a/libavcodec/utils.c b/libavcodec/utils.c
> > index 3e8677d..0c68836 100644
> > --- a/libavcodec/utils.c
> > +++ b/libavcodec/utils.c
> > @@ -3971,7 +3971,8 @@ int ff_thread_ref_frame(ThreadFrame *dst,
> ThreadFrame *src)
> >  {
> >      int ret;
> >
> > -    dst->owner = src->owner;
> > +    dst->owner[0] = src->owner[0];
> > +    dst->owner[1] = src->owner[1];
> >
> >      ret = av_frame_ref(dst->f, src->f);
> >      if (ret < 0)
> > @@ -3981,7 +3982,7 @@ int ff_thread_ref_frame(ThreadFrame *dst,
> ThreadFrame *src)
> >
> >      if (src->progress &&
> >          !(dst->progress = av_buffer_ref(src->progress))) {
> > -        ff_thread_release_buffer(dst->owner, dst);
> > +        ff_thread_release_buffer(dst->owner[0], dst);
> >          return AVERROR(ENOMEM);
> >      }
> >
> [...]
>
> I don't understand why we pass dst->owner[0], not dst->owner[1], to
> the ff_thread_release_buffer() call. Does this assume dst->owner[0] ==
> dst->owner[1]?
>

Neither is perfect, ideally (from an API programming PoV) you'd want to use
both. But that's not possible... In practice, I don't think it matters
much, since all it does is decide which release_buffer-queue it'll be
appended to (if we do delayed free()'ing). Are there problems with this?

Although dst->owner[0] and dst->owner[1] are initialized to the same
> value, the changes to libavcodec/h264_slice.c seem to imply
> dst->owner[0] and dst->owner[1] may become different.


Right, if each slice is decoded in a separate (frame-)worker thread, then
they will be different.

Ronald


More information about the ffmpeg-devel mailing list