Hello @Syedbabar Ali
Thanks for the question. Here are some recommendations for formatting and structuring your own data for use with the Azure OpenAI service based on the docs
-Azure OpenAI on your data supports the following filetypes:
-
.txt
-
.md
-
.html
- Microsoft Word files
- Microsoft PowerPoint files
-There is an upload limit, and there are some caveats about document structure and how it might affect the quality of responses from the model:
- The model provides the best citation titles from markdown (
.md
) files. - If a document is a PDF file, the text contents are extracted as a preprocessing step (unless you're connecting your own Azure Cognitive Search index). If your document contains images, graphs, or other visual content, the model's response quality depends on the quality of the text that can be extracted from them.
- If you're converting data from an unsupported format into a supported format, make sure the conversion:
- Doesn't lead to significant data loss.
- Doesn't add unexpected noise to your data.
- Doesn't lead to significant data loss.
Hope that helps.
-Grace