Share via

ADLS Gen 2 ErrorCode=UserErrorDataStoreServiceThrottling

Lokesh 211 Reputation points
2022-10-26T08:05:31.347+00:00

We have noticed that our QA ADLS storage account gives performance issues intermittently and one such issues is related to throttling and time out.

We had raised a ticket earlier as well, however, didn't get any satisfactory explanation.

Could anyone from Microsoft work with us to resolve this?

ErrorCode=UserErrorDataStoreServiceThrottling,'Type=Microsoft.DataTransfer.Common.Shared.DataStoreThrottlingException,Message=Failed with potential throttling error when accessing AzureBlobFS at side. You are suggested to check and increase the allowed request rate for the data store, or reduce the concurrent workload. More details please refer https://learn.microsoft.com/en-us/azure/data-factory/copy-activity-performance-troubleshooting.. Details: Account: 'xxxx'. FileSystem: 'xxx'. ErrorCode: 'OperationTimedOut'. Message: 'Operation could not be completed within the specified time.'. RequestId: 'af91c927-301f-0058-6732-e89460000000'. TimeStamp: 'Tue, 25 Oct 2022 05:28:29 GMT'.. ,Source=Microsoft.DataTransfer.ClientLibrary,''Type=Microsoft.Azure.Storage.Data.Models.ErrorSchemaException,Message=Operation returned an invalid status code 'InternalServerError',Source=Microsoft.DataTransfer.ClientLibrary,'

Azure Data Lake Storage
Azure Data Lake Storage

An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.

Azure Storage
Azure Storage

Globally unique resources that provide access to data management services and serve as the parent namespace for the services.

0 comments No comments

1 answer

Sort by: Most helpful
  1. MartinJaffer-MSFT 26,161 Reputation points
    2022-10-27T02:50:31.973+00:00

    Hello and welcome to Microsoft Q&A @Lokesh . My condolences your support ticket didn't give a satiffactory explanation. If you share the ticket number with me, I can take a look.

    Taking apart this error, I see several things:

    Failed with potential throttling error when accessing AzureBlobFS at side

    This would suggest there are so many read/write requests at once, a limit was hit on the storage service. The default maximum request rate per storage account is 20000 requests per second for standard Azure Storage accoutns. For requests on a single blob, 500 requests per second. You can go to the storage account and check the metrics to see if you get anywhere near these.

    Usually, it would say "at Source side" or "at Sink side". Just "at side" doesn't seem right.

    Operation could not be completed within the specified time

    This one is a little more ambiguous. Does it mean the pipeline activity timed out, or some underlying service timed out? If it is the pipeline activity, that happens when none of the retries is successful. How long did this pipeline run? How long did this activity run? This would give better hints.

    Operation returned an invalid status code 'InternalServerError

    Internal server error is when something breaks server-side and the request cannot be meaningfully completed. The cause behind internal server error is not something I can look into. If you tell me your support ticket number, I can ask them to look into the logs, with that requestID.

    Azure.Storage.Data.Models.ErrorSchemaException

    This sounds like there is something wrong with the dataset schema.

    So if I had to create a story from these errors, I would guess...
    You have a forEach loop containing many copy activities running in parallel. There is something corrupted in the schema so the copy fails. The copy activity retries until it times out. However there are so many requests, that some get throttled.

    Was this answer helpful?


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.