An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hello Sarvi,
The text-embedding-3-large model has a maximum request size of 8,191 tokens per single call, as referenced in this document.
Additionally, it has a request limit of 350K Tokens-Per-Minute (TPM), as further detailed in this documentation.
Best regards,
Charlie
If you find my response helpful, please consider accepting this answer and voting yes to support the community. Thank you!