Azure document Translation service

Thota, Bharath Babu 20 Reputation points
2025-04-12T00:08:44.0066667+00:00

Hi Microsoft Community,

I want to check how all achieve and overcome the inaccuracies in the Document Translation service. I chose the Azure Translation service for my PDF files translation into Chinese Traditional language. It showed high accuracy in text translation and, even worse, the text missing in some areas or parts of the document. For example, in the pictures below the highlighted areas are not translated even when they are in text type(Session 20 and Session 22). Does anyone know how to overcome this kind of problem while translating a bunch of PDF files? Kindly help me and suggest which translation service is better for PDF translation into Chinese traditional and Chinese simplified?

Thank you,tableofcontents

missing.png

Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
461 questions
{count} votes

Accepted answer
  1. Saideep Anchuri 6,185 Reputation points Microsoft External Staff
    2025-04-14T14:26:13.3966667+00:00

    Hi Thota, Bharath Babu

    The Azure Document Translation service can sometimes face challenges with accuracy, especially when translating PDF files.

    1. Text in Images: The Azure translation service does not translate text that is embedded in images within a document. If the highlighted areas you mentioned contain text in images, this would explain why they are not translated.
    2. Document Format: The quality of translation can vary depending on the format of the PDF. Native PDFs (those generated from digital file formats) generally provide better results compared to scanned PDFs, which may lose formatting and layout during translation.
    3. Mixed Language Input: If your PDF contains text in multiple languages, the translation might not be optimal, leading to some text being left untranslated or mistranslated.
    4. Character Count and Context: Machine translation systems often translate documents sentence by sentence without full context, which can lead to inaccuracies, especially with pronouns or gendered language.

    Kindly refer below link: requirements-and-limitations

    use-document-translation-sdk-in-your-applications

    Thank You.

    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Thota, Bharath Babu 20 Reputation points
    2025-04-15T14:43:43.5633333+00:00

    Thank you, It was helpful.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.