[FFmpeg-trac] #6915(avformat:open): DASH audio segments duration doesn't match exactly with video segments duration.

FFmpeg trac at avcodec.org
Fri Dec 22 01:30:34 EET 2017


#6915: DASH audio segments duration doesn't match exactly with video segments
duration.
------------------------------------+-------------------------------------
             Reporter:  beloko      |                    Owner:  stevenliu
                 Type:  defect      |                   Status:  open
             Priority:  normal      |                Component:  avformat
              Version:  git-master  |               Resolution:
             Keywords:              |               Blocked By:
             Blocking:              |  Reproduced by developer:  0
Analyzed by developer:  0           |
------------------------------------+-------------------------------------

Comment (by stevenliu):

 If the two m3u8 list not MUST sequence align, i think this way can fix it.

 {{{
 MacBook:xxx StevenLiu$ rm -rf *.m4s *.m3u8;./ffmpeg -hide_banner -i
 ~/bbb_sunflower_1080p_30fps_normal.mp4 -g 150 -r 100 -x264opts
 "scenecut=-1" -f dash -min_seg_duration 1000000 -window_size 99999 -t 5
 -hls_playlist 1 output_Steven.mpd
 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from
 '/Users/StevenLiu/bbb_sunflower_1080p_30fps_normal.mp4':
   Metadata:
     major_brand     : isom
     minor_version   : 1
     compatible_brands: isomavc1
     creation_time   : 2013-12-16T17:44:39.000000Z
     title           : Big Buck Bunny, Sunflower version
     artist          : Blender Foundation 2008, Janus Bager Kristensen 2013
     comment         : Creative Commons Attribution 3.0 -
 http://bbb3d.renderfarming.net
     genre           : Animation
     composer        : Sacha Goedegebure
   Duration: 00:10:34.53, start: 0.000000, bitrate: 3481 kb/s
     Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p,
 1920x1080 [SAR 1:1 DAR 16:9], 2998 kb/s, 30 fps, 30 tbr, 30k tbn, 60 tbc
 (default)
     Metadata:
       creation_time   : 2013-12-16T17:44:39.000000Z
       handler_name    : GPAC ISO Video Handler
     Stream #0:1(und): Audio: mp3 (mp4a / 0x6134706D), 48000 Hz, stereo,
 s16p, 160 kb/s (default)
     Metadata:
       creation_time   : 2013-12-16T17:44:42.000000Z
       handler_name    : GPAC ISO Audio Handler
     Stream #0:2(und): Audio: ac3 (ac-3 / 0x332D6361), 48000 Hz, 5.1(side),
 fltp, 320 kb/s (default)
     Metadata:
       creation_time   : 2013-12-16T17:44:42.000000Z
       handler_name    : GPAC ISO Audio Handler
     Side data:
       audio service type: main
 Stream mapping:
   Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))
   Stream #0:2 -> #0:1 (ac3 (native) -> aac (native))
 Press [q] to stop, [?] for help
 [libx264 @ 0x7f9441042000] using SAR=1/1
 [libx264 @ 0x7f9441042000] using cpu capabilities: MMX2 SSE2Fast SSSE3
 SSE4.2 AVX AVX2 FMA3 LZCNT BMI2
 [libx264 @ 0x7f9441042000] profile High, level 5.1
 [libx264 @ 0x7f9441042000] 264 - core 133 r2334M a3ac64b - H.264/MPEG-4
 AVC codec - Copyleft 2003-2013 - http://www.videolan.org/x264.html -
 options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7
 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1
 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6
 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0
 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1
 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=150 keyint_min=15
 scenecut=0 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0
 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00
 [aac @ 0x7f9441043800] Using a PCE to encode channel layout
 [dash @ 0x7f944180dc00] No bit rate set for stream 0
 [dash @ 0x7f944180dc00] Opening 'init-stream0.m4s' for writing
 [dash @ 0x7f944180dc00] Opening 'init-stream1.m4s' for writing
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 Bandwidth info not available, set audio and video bitrates
 Output #0, dash, to 'output_Steven.mpd':
   Metadata:
     major_brand     : isom
     minor_version   : 1
     compatible_brands: isomavc1
     composer        : Sacha Goedegebure
     title           : Big Buck Bunny, Sunflower version
     artist          : Blender Foundation 2008, Janus Bager Kristensen 2013
     comment         : Creative Commons Attribution 3.0 -
 http://bbb3d.renderfarming.net
     genre           : Animation
     encoder         : Lavf58.3.100
     Stream #0:0(und): Video: h264 (libx264), yuv420p(progressive),
 1920x1080 [SAR 1:1 DAR 16:9], q=-1--1, 100 fps, 12800 tbn, 100 tbc
 (default)
     Metadata:
       creation_time   : 2013-12-16T17:44:39.000000Z
       handler_name    : GPAC ISO Video Handler
       encoder         : Lavc58.8.100 libx264
     Side data:
       cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: -1
     Stream #0:1(und): Audio: aac (LC), 48000 Hz, 5.1(side), fltp, 394 kb/s
 (default)
     Metadata:
       creation_time   : 2013-12-16T17:44:42.000000Z
       handler_name    : GPAC ISO Audio Handler
       encoder         : Lavc58.8.100 aac
     Side data:
       audio service type: main
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00001.m4s.tmp' for
 writingrop=0 speed=0.444x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream0-00001.m4s.tmp' for
 writingdrop=0 speed=0.526x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00002.m4s.tmp' for
 writingdrop=0 speed=0.459x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00003.m4s.tmp' for
 writingdrop=0 speed=0.548x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream0-00002.m4s.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00004.m4s.tmp' for
 writingdrop=0 speed=0.49x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream0-00003.m4s.tmp' for
 writingdrop=0 speed=0.502x
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00005.m4s.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream0-00004.m4s.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'chunk-stream1-00006.m4s.tmp' for writing
 [dash @ 0x7f944180dc00] Opening 'output_Steven.mpd.tmp' for writing
 frame=  500 fps= 44 q=-1.0 Lsize=N/A time=00:00:05.01 bitrate=N/A dup=354
 drop=0 speed=0.436x
 video:365kB audio:241kB subtitle:0kB other streams:0kB global headers:0kB
 muxing overhead: unknown
 [libx264 @ 0x7f9441042000] frame I:4     Avg QP:16.78  size: 20153
 [libx264 @ 0x7f9441042000] frame P:150   Avg QP:20.52  size:  1625
 [libx264 @ 0x7f9441042000] frame B:346   Avg QP:18.64  size:   141
 [libx264 @ 0x7f9441042000] consecutive B-frames:  5.4%  5.2%  5.4% 84.0%
 [libx264 @ 0x7f9441042000] mb I  I16..4: 92.8%  3.1%  4.1%
 [libx264 @ 0x7f9441042000] mb P  I16..4:  6.0%  1.9%  0.1%  P16..4:  2.5%
 0.2%  0.2%  0.0%  0.0%    skip:89.2%
 [libx264 @ 0x7f9441042000] mb B  I16..4:  0.2%  0.0%  0.0%  B16..8:  0.6%
 0.0%  0.0%  direct: 0.2%  skip:99.0%  L0:33.2% L1:65.7% BI: 1.1%
 [libx264 @ 0x7f9441042000] 8x8 transform intra:17.9% inter:71.0%
 [libx264 @ 0x7f9441042000] coded y,uvDC,uvAC intra: 3.2% 11.7% 2.2% inter:
 0.2% 0.7% 0.1%
 [libx264 @ 0x7f9441042000] i16 v,h,dc,p: 83% 12%  3%  1%
 [libx264 @ 0x7f9441042000] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 20% 11% 62%  1%
 2%  1%  2%  1%  1%
 [libx264 @ 0x7f9441042000] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 26% 22% 28%  4%
 4%  4%  4%  3%  4%
 [libx264 @ 0x7f9441042000] i8c dc,h,v,p: 77% 14%  9%  0%
 [libx264 @ 0x7f9441042000] Weighted P-Frames: Y:22.7% UV:22.0%
 [libx264 @ 0x7f9441042000] ref P L0: 83.9%  4.5%  9.7%  1.5%  0.4%
 [libx264 @ 0x7f9441042000] ref B L0: 73.8% 24.9%  1.3%
 [libx264 @ 0x7f9441042000] ref B L1: 97.2%  2.8%
 [libx264 @ 0x7f9441042000] kb/s:597.11
 [aac @ 0x7f9441043800] Qavg: 190.813
 MacBook:xxx StevenLiu$ cat media_0.m3u8 media_1.m3u8
 #EXTM3U
 #EXT-X-VERSION:6
 #EXT-X-TARGETDURATION:2
 #EXT-X-MEDIA-SEQUENCE:1
 #EXT-X-MAP:URI="init-stream0.m4s"
 #EXTINF:1.500000,
 chunk-stream0-00001.m4s
 #EXTINF:1.500000,
 chunk-stream0-00002.m4s
 #EXTINF:1.500000,
 chunk-stream0-00003.m4s
 #EXTINF:0.500000,
 chunk-stream0-00004.m4s
 #EXT-X-ENDLIST
 #EXTM3U
 #EXT-X-VERSION:6
 #EXT-X-TARGETDURATION:1
 #EXT-X-MEDIA-SEQUENCE:1
 #EXT-X-MAP:URI="init-stream1.m4s"
 #EXTINF:0.960000,
 chunk-stream1-00001.m4s
 #EXTINF:0.981333,
 chunk-stream1-00002.m4s
 #EXTINF:0.981333,
 chunk-stream1-00003.m4s
 #EXTINF:0.981333,
 chunk-stream1-00004.m4s
 #EXTINF:0.981333,
 chunk-stream1-00005.m4s
 #EXTINF:0.114667,
 chunk-stream1-00006.m4s
 #EXT-X-ENDLIST
 MacBook:xxx StevenLiu$
 MacBook:xxx StevenLiu$ mediastreamvalidator master.m3u8
 mediastreamvalidator: Version 1.2(170822)

 [master.m3u8] Started loading root playlist
 [media_1.m3u8] Started loading media playlist
 Can't deal with multiple sample timings per sample buffer
 [media_1.m3u8] All media files delivered and have end tag, stopping

 --------------------------------------------------------------------------------
 media_1.m3u8
 --------------------------------------------------------------------------------
 Processed 6 out of 6 segments
 Average segment duration: 0.833333
 Total segment bitrates (all discontinuities): average: 397.52 kb/s, max:
 400.91 kb/s
 Playlist max bitrate: 394.000000 kb/s
 Audio Group ID: AUDIO


 Discontinuity: sequence: 0, parsed segment count: 6 of 6, duration: 5.000
 sec, average: 397.52 kb/s, max: 400.91 kb/s
 Track ID: 1
 Audio Codec: AAC-LC
 Audio sample rate: 48000 Hz
 Audio channels: 0
 Audio channel layout: (null)

 --------------------------------------------------------------------------------
 MUST fix issues
 --------------------------------------------------------------------------------

 Error: Zero sample count
 --> Detail:  Track ID 295
 --> Source:  media_1.m3u8 - chunk-stream1-00006.m4s:6235 at 0

 MacBook:xxx StevenLiu$ git diff
 diff --git a/libavformat/dashenc.c b/libavformat/dashenc.c
 index 5687530f2d..07dd2bef5f 100644
 --- a/libavformat/dashenc.c
 +++ b/libavformat/dashenc.c
 @@ -1168,6 +1168,7 @@ static int dash_write_packet(AVFormatContext *s,
 AVPacket *pkt)
      DASHContext *c = s->priv_data;
      AVStream *st = s->streams[pkt->stream_index];
      OutputStream *os = &c->streams[pkt->stream_index];
 +    int get_split_time_point = 0;
      int ret;

      ret = update_stream_extradata(s, os, st->codecpar);
 @@ -1195,10 +1196,17 @@ static int dash_write_packet(AVFormatContext *s,
 AVPacket *pkt)
      if (os->first_pts == AV_NOPTS_VALUE)
          os->first_pts = pkt->pts;

 -    if ((!c->has_video || st->codecpar->codec_type == AVMEDIA_TYPE_VIDEO)
 &&
 -        pkt->flags & AV_PKT_FLAG_KEY && os->packets_written &&
 -        av_compare_ts(pkt->pts - os->start_pts, st->time_base,
 -                      c->min_seg_duration, AV_TIME_BASE_Q) >= 0) {
 +    if (c->has_video && st->codecpar->codec_type == AVMEDIA_TYPE_AUDIO) {
 +        if ( av_compare_ts(pkt->pts + pkt->duration - os->start_pts,
 st->time_base, c->min_seg_duration, AV_TIME_BASE_Q) >= 0) {
 +            get_split_time_point = 1;
 +        }
 +    } else {
 +        if (av_compare_ts(pkt->pts - os->start_pts, st->time_base,
 c->min_seg_duration, AV_TIME_BASE_Q) >= 0) {
 +            get_split_time_point = 1;
 +        }
 +    }
 +    if ((!c->has_video || (st->codecpar->codec_type ==
 AVMEDIA_TYPE_VIDEO) || c->hls_playlist) &&
 +        pkt->flags & AV_PKT_FLAG_KEY && os->packets_written &&
 get_split_time_point) {
          int64_t prev_duration = c->last_duration;

          c->last_duration = av_rescale_q(pkt->pts - os->start_pts,
 MacBook:xxx StevenLiu$
 }}}

--
Ticket URL: <https://trac.ffmpeg.org/ticket/6915#comment:40>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker


More information about the FFmpeg-trac mailing list