[FFmpeg-trac] #10378(undetermined:new): Feature Request: positional ocr of dvd and bd subs (in combination with tesseract)
FFmpeg
trac at avcodec.org
Mon May 22 01:16:26 EEST 2023
#10378: Feature Request: positional ocr of dvd and bd subs (in combination with
tesseract)
-------------------------------------+-------------------------------------
Reporter: techguru | Type:
| enhancement
Status: new | Priority: important
Component: | Version:
undetermined | unspecified
Keywords: frame data | Blocked By:
ocr |
Blocking: | Reproduced by developer: 0
Analyzed by developer: 0 |
-------------------------------------+-------------------------------------
if you combine ffmpeg and tesseract(and maybe opencv)....it should be
possible to get positional frame data of the text in the frame
and its been a BADLY needed feature for those of us that do OCRing of dvd
and bd subs for many many years now
I have attached a very good example of the sub files that has a lot of
this positional use
(and yes I understand theres no positional data in the sub file...thats
where ffmpeg in conjunction with tesseract should come in...it should be
able to find that data in relation to the size of the frame)
the output subtitle would have to be .ass since its one of the more
popular types with positional awareness
--
Ticket URL: <https://trac.ffmpeg.org/ticket/10378>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker
More information about the FFmpeg-trac
mailing list