I understand you are experiencing a rate limit issue when trying to utilize ChatCompletions_Create Operation under Azure OpenAI API version 2024-10-01-preview.
6 RPM per 1000 TPM.
Depending on the configuration of your deployment your TPM may be set too low. To address your problem look at increasing your Token per minute in the Azure AI Portal. This will increase the allowed RPM to ensure you hit less rate limits located in Deployments | <select deployment> | Edit.