Platforms to show: All Mac Windows Linux Cross-Platform
TessPageIteratorMBS class
The iterator to walk over page content.
Class to iterate over tesseract page structure, providing access to all levels of the page hierarchy, without including any tesseract headers or having to handle any tesseract structures.
This is an abstract class. You can't create an instance, but you can get one from various plugin functions.
- 10 properties
- 9 methods
- method Baseline(Level as Integer, byref x1 as Integer, byref y1 as Integer, byref x2 as Integer, byref y2 as Integer) as Boolean
- method Begin
- method BinaryImage(Level as Integer) as PIcture
- method BoundingBox(Level as Integer, byref Left as Integer, byref Top as Integer, byref Right as Integer, byref Bottom as Integer) as Boolean
- method Constructor Private
- method CopyIterator as TessPageIteratorMBS
- method IsAtBeginningOf(Level as Integer) as Boolean
- method IsAtFinalElement(Level as Integer, element as Integer) as Boolean
- method MoveNext(Level as Integer) as Boolean
- 34 constants
Block Types
Constant |
Value |
Description |
kBlockTypeCaptionText
|
8 |
Text that belongs to an image.
|
kBlockTypeEquation
|
4 |
Partition belonging to an equation region.
|
kBlockTypeFlowingImage
|
9 |
Image that lives inside a column.
|
kBlockTypeFlowingText
|
1 |
Text that lives inside a column.
|
kBlockTypeHeadingImage
|
10 |
Image that spans more than one column.
|
kBlockTypeHeadingText
|
2 |
Text that spans more than one column.
|
kBlockTypeHorzLine
|
12 |
Horizontal Line.
|
kBlockTypeInlineEquation
|
5 |
Partition has inline equation.
|
kBlockTypeNoise
|
14 |
Lies outside of any column.
|
kBlockTypePulloutImage
|
11 |
Image that is in a cross-column pull-out region.
|
kBlockTypePulloutText
|
3 |
Text that is in a cross-column pull-out region.
|
kBlockTypeTable
|
6 |
Partition belonging to a table region.
|
kBlockTypeUnknown
|
0 |
Type is not yet known. Keep as the first element.
|
kBlockTypeVerticalText
|
7 |
Text-line runs vertically.
|
kBlockTypeVertLine
|
13 |
Vertical Line.
|
Justifications
Constant |
Value |
Description |
kJustificationCenter
|
2 |
The text lines of the paragraph are centered about a line going down through their middle of the text lines.
|
kJustificationLeft
|
1 |
Each line, except possibly the first, is flush to the same left tab stop.
|
kJustificationRight
|
3 |
Each line, except possibly the first, is flush to the same right tab stop.
|
kJustificationUnknown
|
0 |
Unknown.
|
Level
Constant |
Value |
Description |
kLevelBlock
|
0 |
Block
|
kLevelParagraph
|
1 |
Paragraph
|
kLevelSymbol
|
4 |
Symbol
|
kLevelTextline
|
2 |
Textline
|
kLevelWord
|
3 |
Word
|
Orientations
Constant |
Value |
Description |
kOrientationPageDown
|
2 |
Orientation is down.
|
kOrientationPageLeft
|
3 |
Orientation is left.
|
kOrientationPageRight
|
1 |
Orientation is right.
|
kOrientationPageUp
|
0 |
Orientation is up.
|
Text Line Order
Constant |
Value |
Description |
kTextlineOrderLefttoRight
|
0 |
Mongolian is written in vertical columns top to bottom like Chinese, but the lines order left-to right.
|
kTextlineOrderRighttoLeft
|
1 |
In Chinese, vertical text lines are read right-to-left.
|
kTextlineOrderToptoBottom
|
2 |
In English, the order is top-to-bottom.
|
Writing Direction
Constant |
Value |
Description |
kWritingDirectionLefttoRight
|
0 |
For English text, the writing direction is left-to-right.
|
kWritingDirectionRighttoLeft
|
1 |
For Arabic the text direction is right to left.
|
kWritingDirectionToptoBottom
|
2 |
For the Chinese text in the above example, the writing direction is top-to-bottom.
|
Sub classes:
Some methods using this class:
The items on this page are in the following plugins: MBS OCR Plugin.
TesseractResultIteratorMBS
-
TessResultIteratorMBS