An Azure machine learning service for building and deploying models.
Hi Issac,
short answers:
- Yes, billing begins once a managed online endpoint deployment is running - you are billed continuously for the (VM) resources that are provisioned for that deployment.
- You can reduce costs by lowering instance_count, autoscaling, and downsizing - but you can not pause to zero without deleting or switching to a different deployment pattern.
Source: https://learn.microsoft.com/en-us/azure/machine-learning/concept-endpoints?view=azureml-api-2, search for "Scaling compute to zero".
Hope that helps, best regards! :)