-
Notifications
You must be signed in to change notification settings - Fork 596
Description
I know that the files *.pseg.png store the coordinates of the automatic line segmentation. I have seen mention on the Web of the use of GIMP for manipulating these coordinates, but without further details on how this is done. When I open GIMP on these files, I see nothing that shows the segmentation, nor anything that I can edit manually.
For background: I am trying to do OCR for some documents that are in a poor state, with smudges and faded ink, and no matter how much image preprocessing I do, automatic segmentation fails on at least some parts of the page. I see manual adjustment as the only viable way forward. That is, I would like to manually remove lines, add new lines, change the positions of lines, and ideally also change the order of lines, before the actual OCR is done. Can I insert this manual correction into the usual OCRopus workflow?
Thanks in advance for your time.
Mark-Jan Nederhof