DocumentWord Class

Definition

A word object consisting of a contiguous sequence of characters. For non-space delimited languages, such as Chinese, Japanese, and Korean, each character is represented as its own word.

public class DocumentWord
type DocumentWord = class
Public Class DocumentWord
Inheritance
DocumentWord

Properties

BoundingPolygon

The polygon that outlines the content of this word. Coordinates are specified relative to the top-left of the page, and points are ordered clockwise from the left relative to the word orientation. Units are in pixels for images and inches for PDF. The DocumentPageLengthUnit type of a recognized page can be found at Unit.

Confidence

Confidence of correctly extracting the word.

Content

Text content of the word.

Span

Location of the word in the reading order concatenated content.

Applies to