Does APIM has a Queuing system?

Question

Does APIM has a Queuing system?

Girish Luckhun (RAPP) 40

Hello,

Lets say I have 900 requests coming in from my apim and only 500 requests can be process. Is there a build-in queuing system that can store the remaining 400 requests and wait for the 500 requests to complete and then proceed with the 400?

Also, As per the documentation, For standard tier. the Cache is 1GB. If i add an external redis cache of 5GB, will the total be 6GB?

Thank you.

Girish Luckhun (RAPP) 40 Reputation points

2024-01-24T06:20:28.9+00:00

Hello, Can i get an update?
JananiRamesh-MSFT 29,276 Reputation points

2024-01-25T14:20:10.63+00:00

@Girish Luckhun (RAPP) Just checking to see if you have chance to see previous response and helped. Please let us know if you have further queries.
JananiRamesh-MSFT 29,276 Reputation points

2024-01-29T04:49:49.0466667+00:00

@Girish Luckhun (RAPP) Just a follow up to my previous comment and see if you have any other questions. Feel free to add a comment and if the below answer helped with your question, please accept as answer. That would really help others in the community with similar interests. Thanks again for posting it in Q&A.

Accepted answer

1 additional answer

Your answer

Girish Luckhun (RAPP) 40 Reputation points

2024-01-24T06:20:28.9+00:00

Hello, Can i get an update?
JananiRamesh-MSFT 29,276 Reputation points

2024-01-25T14:20:10.63+00:00

@Girish Luckhun (RAPP) Just checking to see if you have chance to see previous response and helped. Please let us know if you have further queries.
JananiRamesh-MSFT 29,276 Reputation points

2024-01-29T04:49:49.0466667+00:00

@Girish Luckhun (RAPP) Just a follow up to my previous comment and see if you have any other questions. Feel free to add a comment and if the below answer helped with your question, please accept as answer. That would really help others in the community with similar interests. Thanks again for posting it in Q&A.

Answer 1

@Girish Luckhun (RAPP) Thanks for reaching out. APIM provides a built-in queuing system that can store the remaining requests and wait for the processing of the current requests to complete. It does not cache the requests, but simply holds them in a queue until the server is available to process them. When an APIM instance reaches its physical capacity (capacity= request queue + memory+ CPU), it behaves similar to any overloaded web server that is unable to process incoming requests: latency will increase, connections will get dropped, timeout errors will occur, and so on. Therefore, it’s important to monitor the capacity of your APIM instance and consider scaling or upgrading when the capacity value exceeds a certain threshold for a long period. https://learn.microsoft.com/en-us/azure/api-management/api-management-capacity

The built-in cache and the external cache are separate and do not combine to form a larger cache. So, if you have a 1GB built-in cache and add a 5GB external Redis cache, you will have two separate caches, one of 1GB and another of 5GB, not a single 6GB cache. You can configure your APIs and operations in APIM to utilize either the built-in cache or the external cache as needed https://learn.microsoft.com/en-us/azure/api-management/api-management-howto-cache-external

APIM provides a throttling feature that allows you to limit the number of requests that can be made to your APIs. You can set throttling policies at the API level, product level, or global level. https://learn.microsoft.com/en-us/azure/api-management/api-management-sample-flexible-throttling

I hope this answers your question. Let me know if you have any other questions!

Ben Gimblett 4,560 Reputation points Microsoft Employee

2024-01-25T14:44:05.4966667+00:00

I would echo my colleague answer above - you are relying on the request queueing which will be similar conceptually to that provided by a webhost - and as my colleague points out once APIM is overloaded it will again behave like a webhost as described in the answer

A couple of additional points

*I dont know the overall architecture but if you need load levelling then you should consider Service Bus as part of the architecture to provide a buffer for sudden peaks and guarantees on message delivery

*If you intend to allow APIM to scale out keep in mind you need to use the capacity metric and to scale out early because the current GA sku's can take longer than some customers expect to add additional units
Saidai SRi 0 Reputation points

2024-12-03T15:15:04.9766667+00:00

Is the incoming request length exposed as metric to Application Insights or Log Analytics Workspace?

I don't find this metric on the supported metrics list below.

https://learn.microsoft.com/en-us/azure/azure-monitor/reference/supported-metrics/microsoft-apimanagement-service-metrics

From the console, this metric is accessible from the Diagnose and solve problems section of API Management instance.

Answer 2

Azar 29,520 MVP Volunteer Moderator

Hey Girish Luckhun (RAPP)

I dont think APIM itself have a built-in queuing system for handling requests that exceed the processing capacity

=and regarding your question about cache in Azure API Management, the cache size is determined by the SKU (tier) you choose. In the standard tier, you are correct that the cache size is limited to 1GB. If you add an external Redis cache with a size of 5GB, the total cache size available for your API Management instance will be the sum of both: 1GB (APIM standard tier cache) + 5GB (external Redis cache), resulting in a total of 6GB

Kindly accept the answer if this helps thanks.

Girish Luckhun (RAPP) 40 Reputation points

2024-01-23T10:16:21.9933333+00:00

Thank you for your response

Does Azure have an intelligent queuing system at a certain level (external service) ? Based on a number of requests coming, we throttle the number of request flowing to apim depending on the response success rate
Ben Gimblett 4,560 Reputation points Microsoft Employee

2024-01-25T14:47:41.2533333+00:00

You can both throttle clients to protect APIM and reduce call volume to the backend (to protect backends that may handle less traffic than APIM + Clients , see https://learn.microsoft.com/en-us/azure/api-management/api-management-sample-flexible-throttling#rate-limits-and-quotas
However these are not dynamic in the way that you describe

For load levelling you would need to consider a separate service like service bus as part of your overall architecture

Share via

Does APIM has a Queuing system?

1 additional answer

Your answer