Estimate the cost of archiving data
The archive tier is an offline tier for storing data that is rarely accessed. The archive access tier has the lowest storage cost. However, this tier has higher data retrieval costs with a higher latency as compared to the hot, cool and cold tiers.
This article explains how to calculate the cost of using archive storage and then presents a few example scenarios.
Calculate costs
The cost to archive data is derived from these three components:
- Cost to write data to the archive tier
- Cost to store data in the archive tier
- Cost to rehydrate data from the archive tier
The following sections show you how to calculate each component.
This article uses fictitious prices in all calculations. You can find these sample prices in the Sample prices section at the end of this article. These prices are meant only as examples, and shouldn't be used to calculate your costs.
For official prices, see Azure Blob Storage pricing or Azure Data Lake Storage pricing. For more information about how to choose the correct pricing page, see Understand the full billing model for Azure Blob Storage.
The cost to write
You can calculate the cost of writing to the archive tier by multiplying the number of write operations by the price of each operation. The price of an operation depends on which ones you use to write data to the archive tier.
Put Blob
If you use the Put Blob operation, then the number of operations is the same as the number of blobs. For example, if you plan to write 30,000 blobs to the archive tier, then that will require 30,000 operations. Each operation is charged the price of an archive write operation.
Tip
Operations are billed per 10,000. Therefore, if the price per 10,000 operations is $0.10, then the price of a single operation is $0.10 / 10,000 = $0.00001.
Put Block and Put Block List
If you upload a blob by using the Put Block and Put Block List operations, then an upload will require multiple operations, and each of those operations are charged separately. Each Put Block operation is charged at the price of a write operation for the accounts default access tier. The number of Put Block operations that you need depends on the block size that you specify to upload the data. For example, if the blob size is 100 MiB and you choose block size to 10 MiB when you upload that blob, you would use 10 Put Block operations. Blocks are written (committed) to the archive tier by using the Put Block List operation. That operation is charged the price of an archive write operation. Therefore, to upload a single blob, your cost is (number of blocks * price of a hot write operation) + price of an archive write operation.
Note
If you're not using an SDK or the REST API directly, you might have to investigate which operations your data transfer tool is using to upload files. You might be able to determine this by reaching out the tool provider or by using storage logs.
Set Blob Tier
If you use the Set Blob Tier operation to move a blob from the cool, cold, or hot tier to the archive tier, you're charged the price of an archive write operation.
The cost to store
You can calculate the storage costs by multiplying the size of the data in GB by the price of archive storage.
For example (assuming the sample pricing), if you plan to store 10 TB archived blobs, the capacity cost is $0.00099 * 10 * 1024 = $10.14 per month.
The cost to rehydrate
Blobs in the archive tier are offline and can't be read or modified. To read or modify data in an archived blob, you must first rehydrate the blob to an online tier (either the hot cool, or cold tier).
You can calculate the cost to rehydrate data by adding the cost to retrieve data to the cost of reading the data.
Assuming sample pricing, the cost of retrieving 1 GB of data from the archive tier would be 1 * $0.02 = $0.02.
Read operations are billed per 10,000. Therefore, if the cost per 10,000 operations is $5.00, then the cost of a single operation is $5.00 / 10,000 = $0.0005. The cost of reading 1000 blobs at standard priority is 1000 * $0.0005 = $0.50.
In this example, the total cost to rehydrate (retrieving + reading) would be $0.02 + $0.50 = $0.52.
Note
If you set the rehydration priority to high, then the data retrieval and read rates increase.
If you plan to rehydrate data, you should try to avoid an early deletion fee. To review your options, see Blob rehydration from the archive tier.
Scenario: One-time data backup
This scenario assumes that you plan to remove on-premises tapes or file servers by migrating backup data to cloud storage. If you don't expect users to access that data often, then it might make sense to migrate that data directly to the archive tier. In the first month, you'd assume the cost of writing data to the archive tier. In the remaining months, you'd pay only for the cost to store the data and the cost to rehydrate data as needed for the occasional read operation.
Using the Sample prices that appear in this article, the following table demonstrates three months of spending.
This scenario assumes an initial ingest of 2,000,000 files totaling 102,400 GB in size to archive. It also assumes one-time read each month of about 1% of archived capacity. The operation used this scenario is the Put Blob operation.
Cost factor | January | February | March | Projected annual |
---|---|---|---|---|
Write transactions | 2,000,000 | 0 | 0 | 2,000,000 |
Price of a single write operation | $0.00001 | $0.00001 | $0.00001 | $0.00001 |
Cost to write (transactions * price of a write operation) | $20.00 | $0.00 | $0.00 | $20.00 |
Total file size (GB) | 102,400 | 102,400 | 102,400 | 1,228,800 |
Data prices (pay-as-you-go) | $0.00099 | $0.00099 | $0.00099 | $0.00099 |
Cost to store (file size * data price) | $101.38 | $101.38 | $101.38 | $1,216.51 |
Data retrieval size | 1,024 | 1,024 | 1,024 | 12,288 |
Price of data retrieval | $0.02 | $0.02 | $0.02 | $0.02 |
Number of read transactions (File count * 1%) | 20,000 | 20,000 | 20,000 | 240,000 |
Price of a single read operation | $0.0005 | $0.0005 | $0.0005 | $0.0005 |
Cost to rehydrate (cost to retrieve + cost to read) | $30.48 | $30.48 | $30.48 | $365.76 |
Total cost | $151.86 | $131.86 | $131.86 | $1,602.27 |
Tip
To view these costs over 12 months, open the One-Time Backup tab of this workbook. You can modify the values in that worksheet to estimate your costs.
Scenario: Continuous tiering
This scenario assumes that you plan to periodically move data to the archive tier. Perhaps you're using Blob Storage inventory reports to gauge which blobs are accessed less frequently, and then using lifecycle management policies to automate the archival process.
Each month, you'd assume the cost of writing to the archive tier. The cost to store and then rehydrate data would increase over time as you archive more blobs.
Using the Sample prices that appear in this article, the following table demonstrates three months of spending.
This scenario assumes a monthly ingest of 200,000 files totaling 10,240 GB in size to archive. It also assumes a one-time read each month of about 1% of archived capacity. The operation used this scenario is the Put Blob operation.
Cost factor | January | February | March | Projected annual |
---|---|---|---|---|
Write transactions | 200,000 | 200,000 | 200,000 | 2,400,000 |
Price of a single write operation | $0.00001 | $0.00001 | $0.00001 | $0.00001 |
Cost to write (transactions * price of a write operation)> | $2.00 | $2.00 | $2.00 | $24.00 |
Total file size (GB) | 10,240 | 20,480 | 39,720 | 122,880 |
Data prices (pay-as-you-go) | $0.00099 | $0.00099 | $0.00099 | $0.00099 |
Cost to store (file size * data price) | $10.14 | 20.28> | $30.41> | $790.73 |
Price of data retrieval | $0.02 | $0.02 | $0.02 | $0.02 |
Number of read transactions (File count * 1% storage read) | 2,000 | 4,000 | 6,000 | 156,000 |
Price of a single read operation | $0.0005 | $0.0005 | $0.0005 | $0.0005 |
Cost to rehydrate (cost to retrieve + cost to read) | $3.05 | $6.10 | $9.14 | $237.74 |
Total cost | $15.19 | $28.37 | $41.56 | $1,052.48 |
Tip
To view these costs over 12 months, open the Continuous Tiering tab of this workbook. You can modify the values in that worksheet to estimate your costs.
Archive versus cold and cool
Archive storage is the lowest cost tier. However, it can take up to 15 hours to rehydrate 10 GiB files. To learn more, see Blob rehydration from the archive tier. The archive tier might not be the best fit if your workloads must read data quickly. The cool tier offers a near real-time read latency with a lower price than that the hot tier. Understanding your access requirements will help you to choose between the cool, cold, and archive tiers.
The following table compares the cost of archive storage with the cost of cool and cold storage by using the Sample prices that appear in this article. This scenario assumes a monthly ingest of 200,000 files totaling 10,240 GB in size to archive. It also assumes 1 read each month about 10% of stored capacity (1024 GB), and 10% of total transactions (20,000).
Cost factor | Archive | Cold | Cool |
---|---|---|---|
Write transactions | 200,000 | 200,000 | 200,000 |
Price of a single write operation | $0.00001 | $0.000018 | $0.00001 |
Cost to write (transactions * price of a write operation) | $2.00 | $3.60 | $2.00 |
Total file size (GB) | 10,240 | 10,240 | 10,240 |
Data prices (pay-as-you-go) | $0.00099 | $0.0036 | $0.0152 |
Cost to store (file size * data price) | $10.14 | $36.86 | $155.65 |
Data retrieval size | 1,024 | 1,024 | 1,024 |
Price of data retrieval per GB | $0.02 | $0.03 | $0.01 |
Number of read transactions | 20,000 | 20,000 | 20,000 |
Price of a single read operation | $0.0005 | $0.00001 | $0.000001 |
Cost to rehydrate (cost to retrieve + cost to read) | $30.48 | $30.92 | $10.26 |
Monthly cost | $42.62 | $71.38 | $167.91 |
Tip
To view live formula of these costs, open the Choose Tiers tab of this workbook. You can modify the values in that worksheet to estimate your costs.
The following chart shows the impact on monthly spending given various read percentages. This chart assumes a monthly ingest of 1,000,000 files totaling 10,240 GB in size.
For example, the second pair of bars assumes that workloads read 100,000 files (10% of 1,000,000 files) and 1,024 GB (10% of 10,240 GB). Assuming the sample pricing, the estimated monthly cost of cool storage is $175.99 and the estimated monthly cost of archive storage is $90.62.
This chart shows a break-even point at or around the 25% read level. After that level, the cost of archive storage begins to rise relative to the cost of cool storage.
Sample prices
This article uses the following fictitious prices.
Important
These prices are meant only as examples, and should not be used to calculate your costs.
Price factor | Archive | Cold | Cool |
---|---|---|---|
Price of write transactions (per 10,000) | $0.10 | $0.18 | $0.10 |
Price of a single write operation (cost / 10,000) | $0.00001 | $0.000018 | $0.00001 |
Data prices (pay-as-you-go) | $0.00099 | $0.0036 | $0.0152 |
Price of read transactions (per 10,000) | $5.00 | $0.10 | $0.01 |
Price of a single read operation (cost / 10,000) | $0.0005 | $0.00001 | $0.000001 |
Price of high priority read transactions (per 10,000) | $50.00 | N/A | N/A |
Price of data retrieval (per GB) | $0.02 | $0.03 | $0.01 |
Price of high priority data retrieval (per GB) | $0.10 | N/A | N/A |
For official prices, see Azure Blob Storage pricing or Azure Data Lake Storage pricing.
For more information about how to choose the correct pricing page, see Understand the full billing model for Azure Blob Storage.
Next steps
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for