What is the max amount of data an Azure OpenAI Gpt-4 model can use?

Karl-Johan Olofsson 0 Reputation points
2024-02-27T14:20:57.9433333+00:00

Hello, I am interested in connecting a gpt-4 model to my own data through Azure OpenAI and I am trying to understand what the current limitations are in terms on how much data a single gpt-4 model can process/access.

I am aware of the quotas and limits tables but need to understand them better (https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits)

The site says max size of files per upload is limited to 16 MB and max file size per Assistants is limited to 512 MB and Assistants token limit is 2M tokens. Does this mean that one model can at maximum have access to 512 MB of data or 2 million tokens (minus system message and model response), whatever is smallest regardless of which data source I have used? (Azure AI Search, Blob storage, manual upload...) Am I understanding this correctly? I am trying to get an estimate on how much data in form of text one model can process/access.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,083 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Charlie Wei 3,335 Reputation points
    2024-02-27T15:26:02.01+00:00

    Hello Karl-Johan Olofsson,

    The information you provided regarding the Assistants API is accurate; it is a new feature that was just released this month.

    If your considerations are focused on "connecting to my own data" and "how much data", then the On Your Data feature might be a more suitable option for you. The data for this feature is stored on Azure Blob and Azure AI Search, which essentially allows you to regard it as supporting an unlimited amount of data.

    Best regards,
    Charlie


    If you find my response helpful, please consider accepting this answer and voting 'yes' to support the community. Thank you!


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.