Hello @Anil Guliyaan Thank you for contacting us on Microsoft Q&A platform. Happy to help!
If the APIs in the below discussion are Query or Forecast, we have been seeing an increase in customers abusing the shared capacity and by that impacting the experience of other customers. Due to that, we revisited our rate limiting.
Here are the response headers that you should look for in case of a 429 response. They will tell you how long to wait before making the next call.
x-ms-ratelimit-microsoft.costmanagement-qpu-retry-after
x-ms-ratelimit-microsoft.costmanagement-entity-retry-after
x-ms-ratelimit-microsoft.costmanagement-tenant-retry-after
x-ms-ratelimit-microsoft.costmanagement-client-retry-after
The rate limits are not specific to a user – and limits are shared with other callers/users in the tenant/account. We can recommend the right APIs if we understand what you are trying to achieve.
Receiving a 429 usually means that we had to protect our service from the caller. Of course, every case can have some recommendations (different calling patterns, different API to use, etc).
I looked at the calling pattern for the tenant id “02cf-f8-48-20-a**b20****5b”, you are not making a lot of calls. You are being rate limited by our ClientType filter (which allows 2000 calls per minute per ClientType). And since you are not providing a ClientType in the request headers, you will share the number with ALL customers that don’t provide a ClientType.
In short, can you please provide a clientType value when making an api call.
Example:
“ClientType” : “AwesomeType”
PS: Please choose one type value for all your calls, that will help us investigate any issue you face in the future.
Hope this helps. Please write back if you need any further assistance!
If the response helped, do "Accept Answer" and up-vote it