DocumentIntelligenceClient Class
DocumentIntelligenceClient.
Constructor
DocumentIntelligenceClient(endpoint: str, credential: AzureKeyCredential | TokenCredential, **kwargs: Any)
Parameters
| Name | Description |
|---|---|
|
endpoint
Required
|
The Document Intelligence service endpoint. Required. |
|
credential
Required
|
Credential needed for the client to connect to Azure. Is either a AzureKeyCredential type or a TokenCredential type. Required. |
Keyword-Only Parameters
| Name | Description |
|---|---|
|
api_version
|
The API version to use for this operation. Default value is "2024-07-31-preview". Note that overriding this default value may result in unsupported behavior. |
|
polling_interval
|
Default waiting time between two polls for LRO operations if no Retry-After header is present. |
Methods
| begin_analyze_batch_documents | |
| begin_analyze_document |
Analyzes document with document model. |
| begin_classify_document |
Classifies document with document classifier. |
| close | |
| get_analyze_result_figure |
Gets the generated cropped image of specified figure from document analysis. |
| get_analyze_result_pdf |
Gets the generated searchable PDF output from document analysis. |
| send_request |
Runs the network request through the client's chained policies.
For more information on this code flow, see https://aka.ms/azsdk/dpcodegen/python/send_request |
begin_analyze_batch_documents
begin_analyze_batch_documents(model_id: str, analyze_batch_request: AnalyzeBatchDocumentsRequest | MutableMapping[str, Any] | IO[bytes] | None = None, *, pages: str | None = None, locale: str | None = None, string_index_type: str | StringIndexType | None = None, features: List[str | DocumentAnalysisFeature] | None = None, query_fields: List[str] | None = None, output_content_format: str | ContentFormat | None = None, output: List[str | AnalyzeOutputOption] | None = None, **kwargs: Any) -> LROPoller[AnalyzeBatchResult]
Parameters
| Name | Description |
|---|---|
|
model_id
Required
|
|
|
analyze_batch_request
|
Default value: None
|
Keyword-Only Parameters
| Name | Description |
|---|---|
|
pages
|
Default value: None
|
|
locale
|
Default value: None
|
|
string_index_type
|
Default value: None
|
|
features
|
Default value: None
|
|
query_fields
|
Default value: None
|
|
output_content_format
|
Default value: None
|
|
output
|
Default value: None
|
begin_analyze_document
Analyzes document with document model.
begin_analyze_document(model_id: str, analyze_request: AnalyzeDocumentRequest | MutableMapping[str, Any] | IO[bytes] | None = None, *, pages: str | None = None, locale: str | None = None, string_index_type: str | StringIndexType | None = None, features: List[str | DocumentAnalysisFeature] | None = None, query_fields: List[str] | None = None, output_content_format: str | ContentFormat | None = None, output: List[str | AnalyzeOutputOption] | None = None, **kwargs: Any) -> AnalyzeDocumentLROPoller[AnalyzeResult]
Parameters
| Name | Description |
|---|---|
|
model_id
Required
|
Unique document model name. Required. |
|
analyze_request
|
Analyze request parameters. Is one of the following types: AnalyzeDocumentRequest, JSON, IO[bytes] Default value is None. Default value: None
|
Keyword-Only Parameters
| Name | Description |
|---|---|
|
pages
|
Range of 1-based page numbers to analyze. Ex. "1-3,5,7-9". Default value is None. Default value: None
|
|
locale
|
Locale hint for text recognition and document analysis. Value may contain only the language code (ex. "en", "fr") or BCP 47 language tag (ex. "en-US"). Default value is None. Default value: None
|
|
string_index_type
|
Method used to compute string offset and length. Known values are: "textElements", "unicodeCodePoint", and "utf16CodeUnit". Default value is None. Default value: None
|
|
features
|
List of optional analysis features. Default value is None. Default value: None
|
|
query_fields
|
List of additional fields to extract. Ex. "NumberOfGuests,StoreNumber". Default value is None. Default value: None
|
|
output_content_format
|
str or
ContentFormat
Format of the analyze result top-level content. Known values are: "text" and "markdown". Default value is None. Default value: None
|
|
output
|
Additional outputs to generate during analysis. Default value is None. Default value: None
|
Returns
| Type | Description |
|---|---|
|
An instance of AnalyzeDocumentLROPoller that returns AnalyzeResult. The AnalyzeResult is compatible with MutableMapping |
Exceptions
| Type | Description |
|---|---|
begin_classify_document
Classifies document with document classifier.
begin_classify_document(classifier_id: str, classify_request: ClassifyDocumentRequest | MutableMapping[str, Any] | IO[bytes], *, string_index_type: str | StringIndexType | None = None, split: str | SplitMode | None = None, pages: str | None = None, **kwargs: Any) -> LROPoller[AnalyzeResult]
Parameters
| Name | Description |
|---|---|
|
classifier_id
Required
|
Unique document classifier name. Required. |
|
classify_request
Required
|
Classify request parameters. Is one of the following types: ClassifyDocumentRequest, JSON, IO[bytes] Required. |
Keyword-Only Parameters
| Name | Description |
|---|---|
|
string_index_type
|
Method used to compute string offset and length. Known values are: "textElements", "unicodeCodePoint", and "utf16CodeUnit". Default value is None. Default value: None
|
|
split
|
Document splitting mode. Known values are: "auto", "none", and "perPage". Default value is None. Default value: None
|
|
pages
|
Range of 1-based page numbers to analyze. Ex. "1-3,5,7-9". Default value is None. Default value: None
|
Returns
| Type | Description |
|---|---|
|
An instance of LROPoller that returns AnalyzeResult. The AnalyzeResult is compatible with MutableMapping |
Exceptions
| Type | Description |
|---|---|
close
close() -> None
get_analyze_result_figure
Gets the generated cropped image of specified figure from document analysis.
get_analyze_result_figure(model_id: str, result_id: str, figure_id: str, **kwargs: Any) -> Iterator[bytes]
Parameters
| Name | Description |
|---|---|
|
model_id
Required
|
Unique document model name. Required. |
|
result_id
Required
|
Analyze operation result ID. Required. |
|
figure_id
Required
|
Figure ID. Required. |
Returns
| Type | Description |
|---|---|
|
Iterator[bytes] |
Exceptions
| Type | Description |
|---|---|
get_analyze_result_pdf
Gets the generated searchable PDF output from document analysis.
get_analyze_result_pdf(model_id: str, result_id: str, **kwargs: Any) -> Iterator[bytes]
Parameters
| Name | Description |
|---|---|
|
model_id
Required
|
Unique document model name. Required. |
|
result_id
Required
|
Analyze operation result ID. Required. |
Returns
| Type | Description |
|---|---|
|
Iterator[bytes] |
Exceptions
| Type | Description |
|---|---|
send_request
Runs the network request through the client's chained policies.
>>> from azure.core.rest import HttpRequest
>>> request = HttpRequest("GET", "https://www.example.org/")
<HttpRequest [GET], url: 'https://www.example.org/'>
>>> response = client.send_request(request)
<HttpResponse: 200 OK>
For more information on this code flow, see https://aka.ms/azsdk/dpcodegen/python/send_request
send_request(request: HttpRequest, *, stream: bool = False, **kwargs: Any) -> HttpResponse
Parameters
| Name | Description |
|---|---|
|
request
Required
|
The network request you want to make. Required. |
Keyword-Only Parameters
| Name | Description |
|---|---|
|
stream
|
Whether the response payload will be streamed. Defaults to False. Default value: False
|
Returns
| Type | Description |
|---|---|
|
The response of your network call. Does not do error handling on your response. |