[FFmpeg-trac] #10378(undetermined:new): Feature Request: positional ocr of dvd and bd subs (in combination with tesseract)

FFmpeg trac at avcodec.org
Mon May 22 01:16:26 EEST 2023


#10378: Feature Request:  positional ocr of dvd and bd subs (in combination with
tesseract)
-------------------------------------+-------------------------------------
             Reporter:  techguru     |                     Type:
                                     |  enhancement
               Status:  new          |                 Priority:  important
            Component:               |                  Version:
  undetermined                       |  unspecified
             Keywords:  frame data   |               Blocked By:
  ocr                                |
             Blocking:               |  Reproduced by developer:  0
Analyzed by developer:  0            |
-------------------------------------+-------------------------------------
 if you combine ffmpeg and tesseract(and maybe opencv)....it should be
 possible to get positional frame data of the text in the frame
 and its been a BADLY needed feature for those of us that do OCRing of dvd
 and bd subs for many many years now

 I have attached a very good example of the sub files that has a lot of
 this positional use

 (and yes I understand theres no positional data in the sub file...thats
 where ffmpeg in conjunction with tesseract should come in...it should be
 able to find that data in relation to the size of the frame)

 the output subtitle would have to be .ass since its one of the more
 popular types with positional awareness
-- 
Ticket URL: <https://trac.ffmpeg.org/ticket/10378>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker


More information about the FFmpeg-trac mailing list