[FFmpeg-trac] #5846(undetermined:new): Extract text subtitles in UTF-8 with BOM

FFmpeg trac at avcodec.org
Wed Sep 14 07:22:24 EEST 2016

#5846: Extract text subtitles in UTF-8 with BOM
             Reporter:  edumj        |                     Type:
               Status:  new          |  enhancement
            Component:               |                 Priority:  normal
  undetermined                       |                  Version:
             Keywords:               |  unspecified
             Blocking:               |               Blocked By:
Analyzed by developer:  0            |  Reproduced by developer:  0
 Summary of the bug:

 I can add '''srt''' subs (UTF-8) to a '''MP4''' file (ttxt subs) with:
 -c:s mov_text
 but when I extract them from the same '''MP4''' file with:
 ffmpeg  -i input.mp4 -c:s text   output.srt
 apart from they have no break lines (opened in '''Notepad''') it seems Ok
 in '''Notepad++''', but it says it's UTF-8 but whithout '''BOM''', and if
 I try to convert them to '''IDX/SUB''' with '''Txt2VobSub''', all special
 characters (like accents) are wrong.

 Here http://www.trustfm.net/software/video/Txt2Vobsub.php?page=Features it
 says '''Txt2Vobsub''' does not support UTF8 without '''BOM''' (it does,
 but wrong), so I need no add that '''BOM''' manually with '''Nopetad++''',
 and then accents are back!

 According to this http://forum.doom9.org/showthread.php?t=152419
 '''-bom''' option existed before??

 I haven't tried yet, but if I extract subtitles from a '''MKV''' file, it
 also would be whitout '''BOM'''?

 How to reproduce:
 % ffmpeg -i input ... output
 ffmpeg version
 built on ...
 Patches should be submitted to the ffmpeg-devel mailing list and not this
 bug tracker.

Ticket URL: <https://trac.ffmpeg.org/ticket/5846>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker

More information about the FFmpeg-trac mailing list