Hi anonymous user,
Here are updates which I have received internally -
- There’s not enough detail to be sure if this is present in your scenario, but we had a scenario other user was difficulty scale testing due to Client Affinity being enabled at their load balancer and so all the load testing clients were bound to the first couple of instances and the new scaled-out instances went unused. If you look at the individual CPU usage per app service instance and they’re severely unbalanced, you might be affected by it.
- Yes, it’s a known limitation and there is no ETA or workaround for that right now. How many rows/documents each API call will return? How much data?
I believe for point reads it shouldn’t be a problem and you would adjust the RUs to support the necessary concurrency.
Please let me know if you have any other questions.
Thanks
Saurabh