Skip to content

Manually correcting segmentation #323

@nederhof

Description

@nederhof

I know that the files *.pseg.png store the coordinates of the automatic line segmentation. I have seen mention on the Web of the use of GIMP for manipulating these coordinates, but without further details on how this is done. When I open GIMP on these files, I see nothing that shows the segmentation, nor anything that I can edit manually.

For background: I am trying to do OCR for some documents that are in a poor state, with smudges and faded ink, and no matter how much image preprocessing I do, automatic segmentation fails on at least some parts of the page. I see manual adjustment as the only viable way forward. That is, I would like to manually remove lines, add new lines, change the positions of lines, and ideally also change the order of lines, before the actual OCR is done. Can I insert this manual correction into the usual OCRopus workflow?

Thanks in advance for your time.

Mark-Jan Nederhof

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions