The maximum batch size and Input Size of Azure OpenAI for textembeddinglarge model

Question

Sarvi 0

I am using textembeddinglarge model from Azure OpenAI. What is the maximum batch size and token limit / batch for this model.

1 answer

Answer 1

Charlie Wei 3,335

Hello Sarvi,

The text-embedding-3-large model has a maximum request size of 8,191 tokens per single call, as referenced in this document.

Additionally, it has a request limit of 350K Tokens-Per-Minute (TPM), as further detailed in this documentation.

Best regards,
Charlie

If you find my response helpful, please consider accepting this answer and voting yes to support the community. Thank you!

Charlie Wei 3,335 Reputation points

2024-05-11T07:23:52.2866667+00:00

Hello Sarvi, just checking in to see if above information was helpful. Please let us know if you would like further assistance.