Bring your own data azure openai

Sariga Rahul 146 Reputation points
2023-07-12T10:03:38.1066667+00:00

Is there a suggested way to import website content to bring your own data. Also is there an api to upload documents to Bring your own data like in playground?

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,080 questions
{count} votes

Accepted answer
  1. AshokPeddakotla-MSFT 35,971 Reputation points Moderator
    2023-07-12T12:04:47.5333333+00:00

    Sariga Rahul Greetings!

    Is there a suggested way to import website content to bring your own data. Also is there an api to upload documents to Bring your own data like in playground?

    Did you check the documentations Azure OpenAI on your data (preview) and Rest API - Files already?

    Azure OpenAI on your data supports the following filetypes:

    • .txt
    • .md
    • .html
    • Microsoft Word files
    • Microsoft PowerPoint files
    • PDF

    There are some caveats about document structure and how it might affect the quality of responses from the model:

    The model provides the best citation titles from markdown (.md) files.

    If a document is a PDF file, the text contents are extracted as a preprocessing step (unless you're connecting your own Azure Cognitive Search index). If your document contains images, graphs, or other visual content, the model's response quality depends on the quality of the text that can be extracted from them.

    • If you're converting data from an unsupported format into a supported format, make sure the conversion:
      • Doesn't lead to significant data loss.
        • Doesn't add unexpected noise to your data.
        This will impact the quality of Azure Cognitive Search and the model response.

    Let us know if that helps or have any further queries.

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.