Azure ML online endpoint suddenly returns timeout

Albert Garriga Porqueras 11 Reputation points
2022-10-17T15:42:17.19+00:00

Hi all,

We have deployed a managed online endpoint in Azure ML and first it works fine. However, after a few days, with the exact same request, the endpoint takes much longer to process the request and gives a timeout (the HTTP code returned is 504). We don't understand this behavior since we did not modify the endpoint and the metrics don't show a huge increase in cpu or memory usage. If we restart it then it works again for a few days until it doesn't work anymore. Has anyone faced the same issue? Could you solve it?

Thanks.

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,611 questions
{count} votes

2 answers

Sort by: Most helpful
  1. Hung Nguyen Thanh 16 Reputation points
    2022-10-29T14:10:52.19+00:00

    @Albert Garriga Porqueras
    I came across the same error trying to invoke the model from public internet though, I retried invoking it in a VM that is in the same vnet/subnet that Azure Machine Learning Workspace, Storage Account and Container Registry connect to by private endpoints, and it works.
    My scenario is public inbound, and public egress network access is disabled.


  2. Deleted

    This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.


    Comments have been turned off. Learn more