Can't deploy a real time endpoint in azure Auto ML

Question

I've trained a model using Auto ML and I want to try to deploy it to a Kubernetes service, I've got a pretty simple inference cluster. But no matter how low I set the CPU and memory reserve capacity it always says there isn't enough resources to deploy. Do I need to upgrade my cluster or is this a known issue with a current solution? It seems to me to be quite random as other times I was able to deploy other models with no problems at all. And even if I try to deploy it as a container instance it gets stuck in transitioning state indefinetely . Hope someone might clarify this issue and propose some kind of workaround.

Answer

Hello,

Thanks for the details, I just found a same issue for this abnormal computer resource issue. The solution of the customer isrewriting all the code in Python and using the AKSWebService.update_endpoint method to update the endpoint without having to delete it each time (which was happening using the Model.deploy method).

https://learn.microsoft.com/en-us/answers/questions/249335/azure-machine-learning-update-realtime-endpoint.html

Could you please take a look and have a try?

Regards,
Yutong

Share via

Can't deploy a real time endpoint in azure Auto ML

1 answer