Hi @Sivangnanam, Kuraliniyan (AWM, IRL) ,
Thanks for reaching out to Microsoft Q&A.
##[error]Agent failed with exception: Task agent exited with exit code 137,##[warning]Received request to deprovision: The request was cancelled by the remote provider.
Possible Causes of the error above:
- Exit code 137 usually means the process was killed due to exceeding memory limits. When the agent's available memory gets depleted, especially on Microsoft-hosted agents, it may forcefully terminate processes to reclaim resources.
- You mentioned setting a timeout of more than three days in your pipeline, but there may be other settings or limits at play that could be causing the pipeline to terminate unexpectedly.
- If you're using Azure Data Bricks or similar services that use cluster autoscaling, the job could be cancelled due to scaling decisions. Ensure that your cluster settings are optimized for long-running jobs.
Follow below steps to resolve the issue:
- If your pipeline runs resource-intensive tasks, consider switching to self-hosted agents that can be provisioned with more powerful hardware tailored to your project's requirements. More on self-hosted agents here.
- To get a clearer picture of the issue, enable verbose logs for your pipeline. This will give you in-depth information about memory and resource usage, helping you identify what might be causing the termination.
- For long-running jobs, use dedicated clusters to avoid issues with autoscaling that might interrupt your workload.
- Ensure that no external processes or configurations like autoscaling settings are canceling your job unintentionally.
Hope it helps!
Please do not forget to click "Accept the answer” and Yes, this can be beneficial to other community members.
If you have any other questions, let me know in the "comments" and I would be happy to help you.