Limiting Number of Requests or Queuing Process on Azure Web App (App Service Plan P2V3)

Niket Kumar Singh 260 Reputation points
2024-06-10T17:12:00.0333333+00:00

I am using an Azure Web App on an App Service Plan P2V3 and am looking to limit the number of incoming requests or manage the queuing process within the web app service.

In Windows IIS, we can set the application pool queue length to control this. Is there a similar feature available for Azure App Services?
User's image

From my understanding of the available documentation: but not sure i'm going in correct direction:

The maximum IP connections are per instance and depend on the instance size:

  • 1,920 per B1/S1/P1V3 instance
    • 3,968 per B2/S2/P2V3 instance
      • 8,064 per B3/S3/P3V3 instance

Additionally, the maxConcurrentRequestsPerCPU setting specifies how many simultaneous requests ASP.NET allows per CPU:

  • Small (A1): 7,500 requests
  • Medium (A2): 7,500 requests (x2 cores = 15,000)
  • Large (A3): 18,750 requests (x4 cores = 75,000)

Given these settings, I have the following questions:

  1. Is it possible to configure a specific limit on the number of incoming requests for an Azure Web App?
  2. Can we limit or control the queuing process in a similar way to the IIS application pool queue length?
  3. What configuration changes are required to achieve this, if possible?
  4. Can scaling up (increasing instance size) help in managing request limits effectively?
  5. Are there any best practices or configurations to achieve this limit.
    I saw arcticle using HttpClientFactory for managing HttpClient instances as suggested in this documentation, can be helpful, i'm not sure whter it is correct or not?

Any insights or recommendations on managing request limits and queuing for an Azure Web App would be greatly appreciated

Azure App Configuration
Azure App Configuration
An Azure service that provides hosted, universal storage for Azure app configurations.
210 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
7,187 questions
{count} votes