Language support for Text Analytics for health

Use this article to learn which natural languages are supported by Text Analytics for health and its Docker container.

Hosted API Service

The hosted API service supports English language, model version 03-01-2022. Additional languages, English, Spanish, French, German Italian, Portuguese and Hebrew are supported with model version 2022-08-15-preview.

When structuring the API request, the relevant language tags must be added for these languages:

English – “en”
Spanish – “es”
French  - “fr”
German – “de”
Italian – “it”
Portuguese – “pt”
Hebrew – “he”
json

{
    "analysisInput": {
        "documents": [
            {
                "text": "El médico prescrió 200 mg de ibuprofeno.",
                "language": "es",
                "id": "1"
            }
        ]
    },
    "tasks": [
        {
            "taskName": "analyze 1",
            "kind": "Healthcare",
            "parameters":
            {
            "modelVersion": "2022-08-15-preview"
            }
        }
    ]
}

Docker container

The docker container supports English language, model version 2022-03-01. Additional languages are also supported when using a docker container to deploy the API: Spanish, French, German Italian, Portuguese and Hebrew. This functionality is currently in preview, model version 2022-08-15-preview. Full details for deploying the service in a container can be found here.

In order to download the new container images from the Microsoft public container registry, use the docker pull command, as follows :

For English, Spanish, Italian, French, German and Portuguese:

docker pull mcr.microsoft.com/azure-cognitive-services/textanalytics/healthcare:latin

For Hebrew:

docker pull mcr.microsoft.com/azure-cognitive-services/textanalytics/healthcare:semitic

When structuring the API request, the relevant language tags must be added for these languages:

English – “en”
Spanish – “es”
French  - “fr”
German – “de”
Italian – “it”
Portuguese – “pt”
Hebrew – “he”

The following json is an example of a JSON file attached to the Language request's POST body, for a Spanish document:

json

{
    "analysisInput": {
        "documents": [
            {
                "text": "El médico prescrió 200 mg de ibuprofeno.",
                "language": "es",
                "id": "1"
            }
        ]
    },
    "tasks": [
        {
            "taskName": "analyze 1",
            "kind": "Healthcare",
        }
    ]
}

Details of the supported model versions for each language:

Language Code Model Version: Featured Tag Specific Tag
en 2022-03-01 latest 3.0.59413252-onprem-amd64
en,es,it,fr,de,pt 2022-08-15-preview latin 3.0.60903415-latin-onprem-amd64
he 2022-08-15-preview semitic 3.0.60903415-semitic-onprem-amd64

See also

Text Analytics for health overview