class hocrPage

Last modified: January 04, 2015

Contents

hocrPage

In module gamera.toolkits.ocr.classes

A class derived from Page that overrides the page_to_lines method. Instead of bbox_merging, page_to_lines reads the segmentation information from a hOCR file for textline detection.

__init__

Like Page.__init__, but with the additional obligatory argument hocr_in_path for the name of a hOCR file from which the textline segmentation is read. Note that the constructor does not do the segmentation; for this, you must call the segment method.

Signature:

init (image, hocr_in_path, glyphs=None, classify_ccs=None)