Hello @Dadfar, Reza , Thanks for using Microsoft Q&A Platform.
Yes, this is an ongoing issue with the output markdown
when using Document Intelligence Studio or Python SDK to analyze a PDF document using prebuilt-layout model with specific page or page range.
The latest update from the product team today is it's started to work after a fix. I have reproduced the same with your sample document by specifying page range and able to get the Markdown output results. I would request you to try the same on your end.
Regarding your question,
Is there a way to directly extract the Markdown file from the Document Intelligence studio without using the SDK?
As shown in the screenshot here, you can either copy the data or download the JSON result from the studio and extract required data from there. This should help.
The LangChain API and the example provided seem to work only with individual files. Is there an API available for processing folders containing several files?
Regarding this LangChain API, we don't have much information. Maybe you can raise this issue here for better assistance on the same. https://github.com/Azure-Samples/function-python-ai-langchain/issues
I hope this helps.
Regards,
Vasavi
-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.