How is Managed Compute on Azure AI Studio actually billed?

Jason Carr 0 Reputation points
2024-07-18T09:37:38.2766667+00:00

Can anyone clarify how Managed Compute in Azure AI Studio is billed?

Is managed compute active 24/7 or does it go into a suspended state automatically?

If managed compute is active constantly, this would mean being billed even when there is no usage, and would be very expensive to run for smaller applications.

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,880 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,039 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,826 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Amira Bedhiafi 24,336 Reputation points
    2024-07-20T14:21:17.2+00:00

    Managed Compute in Azure AI Studio is billed based on the resources provisioned and the time they are in use. The billing encompasses several components, including compute resources, storage, and networking. Compute resources such as virtual machines and GPUs are billed according to their type, size, and the duration they remain active. Additionally, storage for datasets, models, and other artifacts incurs charges, covering both the storage space and associated read/write transactions. Data transfer costs may also apply, especially if there is significant ingress or egress traffic related to your AI workloads.

    When it comes to the compute state, managed compute resources are billed while they are active, which includes both task performance time and idle periods if they are not explicitly stopped. However, the service configuration may include options for auto-scaling and auto-shutdown to manage costs. These features help to suspend or deallocate resources during periods of inactivity, potentially reducing the overall cost.

    To manage costs effectively, you can utilize auto-scaling to dynamically match compute resources to workload demands, avoiding over-provisioning and reducing expenses during low activity periods. Setting up auto-shutdown policies can also help by automatically deallocating resources when they are not in use, which is particularly beneficial for development and testing environments that do not require 24/7 operation.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.