Extact Hyperlink in azure document intelligence ai

Taghizadegan Amir 0 Reputation points
2025-03-14T12:34:51.9+00:00

Hello everyone,

I wanted to ask whether it is possible to also extract and show the url of hyperlinks in the markdown generated by Azure document intelligence ai. I can only see the text the link is embedded into but I want also to fetch the url.

Thank you

Amir

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
2,102 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Suwarna S Kale 3,391 Reputation points
    2025-03-14T17:39:37.11+00:00

    Hello Taghizadegan Amir,

    Thank you for posting your question in the Microsoft Q&A forum.

    As for the current capabilities of Azure Document Intelligence (formerly Form Recognizer), the service primarily focuses on extracting structured data, text, and key-value pairs from documents. While it can extract text that includes hyperlinks, it does not natively extract or separate the URLs from the hyperlink text in the output. However, you can implement a workaround to achieve this by combining the extracted text with additional processing logic.

    You may refer below documentations:

    https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/?view=doc-intel-4.0.0

    https://docs.python.org/3/library/re.html

    https://pypi.org/project/PyPDF2/

    While Azure Document Intelligence does not natively extract URLs from hyperlinks, you can use regular expressions or PDF parsing libraries to extract and combine URLs with the hyperlink text. By implementing this workaround, you can generate markdown output that includes both the hyperlink text and the associated URLs. For more advanced use cases, consider submitting a feature request to Microsoft to enhance the hyperlink extraction capabilities of Azure Document Intelligence.

    If the above answer helped, please do not forget to "Accept Answer" as this may help other community members to refer the info if facing a similar issue. Your contribution to the Microsoft Q&A community is highly appreciated.

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.