Adding data source to a GPT model deployment

Ionut Dutescu 60 Reputation points
2024-03-15T10:06:51.5233333+00:00

When adding a new data source to an existing GPT model deployment, I notice that it requires me to link the model to a blob storage. I have also read on the Azure OpenAI blog that it does not support .xlsx files. How can I transform my xlsx table into some data that can be fed into the model via blob storage. Also is there any preferred format for storing, like the model works the best with a specific .extension ? Does it work the same with all file types?

Thanks!

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,339 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
3,192 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,080 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,598 questions
0 comments No comments
{count} votes

3 answers

Sort by: Most helpful
  1. AshokPeddakotla-MSFT 35,971 Reputation points Moderator
    2024-03-15T11:47:52.38+00:00

    Ionut Dutescu Greetings & Welcome to Microsoft Q&A forum!

    When adding a new data source to an existing GPT model deployment, I notice that it requires me to link the model to a blob storage. I have also read on the Azure OpenAI blog that it does not support .xlsx files.

    Yes, depending on the data source you have chosen, you need to create them earlier to link .

    User's image

    I have also read on the Azure OpenAI blog that it does not support .xlsx files. How can I transform my xlsx table into some data that can be fed into the model via blob storage. Also is there any preferred format for storing, like the model works the best with a specific .extension ? Does it work the same with all file types?

    Please note that, Azure OpenAI On Your Data supports the following file types:

    • .txt
    • .md
    • .html
    • .docx
    • .pptx
    • .pdf

    You can convert your xlsx table into supported formats as mentioned above and that can be fed into the model via blob storage.

    While converting data from an unsupported format into a supported format, make sure the conversion:

    • Doesn't lead to significant data loss.
    • Doesn't add unexpected noise to your data.

    Please see data preparation script. The script chunks data so that the model's responses are more accurate.

    I hope this helps. Do let me know if you have any further queries.

    If the response helped, please do click Accept Answer and Yes for was this answer helpful.


  2. Ionut Dutescu 60 Reputation points
    2024-03-22T11:18:00.92+00:00

    Hello,

    I was wondering if you were able to check my question?

    Thanks,

    Have a nice day!

    0 comments No comments

  3. Ionut Dutescu 60 Reputation points
    2024-03-25T14:03:24.4133333+00:00

    Hello,

    I was wondering, how is the data accessed by the model. I have seen that in the playground chat, it shows me the referenced files from the blob storage. However I only have one file in the storage, and in the references it splits the file into multiple files. Why is this so? It seems like it cannot read the whole file and it is missing details from that part of the file. How can I avoid this split?

    Thanks!


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.