Split partition function increased the size of the mdf file.

Question

Split partition function increased the size of the mdf file.

Surendra Adhikari 211

I had two partitions for a table. For one of the partitions I further split it into two partitions. This caused increase in the size of the mdf file. Why would the split of the partition increase the size of the mdf though there is no data added?
After splitting partition, I have first, second and third partition, the first being the oldest data and the third being the latest data. I merged the second partition with the first one which exists in different filegroup. But even after merging the file size of the ndf file of the first partition where now the second is also merged is not increased. I expected this file size to increase.

Dan Guzman 9,401 Reputation points

2020-10-29T09:54:34.29+00:00

Provide CREATE scripts of the partition function and scheme before you split and merged. This will provide details and clarity to better answer your question.
Surendra Adhikari 211 Reputation points

2020-10-29T10:18:43.417+00:00

CREATE PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId (BIGINT)
AS RANGE RIGHT FOR VALUES (1701149);

CREATE PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId
AS PARTITION PartitionFuncCommissionHistoryByRowId
TO (SECONDARY, [primary]);
Dan Guzman 9,401 Reputation points

2020-10-29T10:54:20.417+00:00

Thanks for the DDL (and kudos for semicolon statement terminators). What filegroup did you specify for the NEXT USED scheme prior to SPLIT and what was the boundary value added with SPLIT? Since the mdf file size increased during SPLIT, it seems PRIMARY was the NEXT USED filegroup and rows needed to be moved to the new partition to accommodate the new boundaries.
Surendra Adhikari 211 Reputation points

2020-10-29T11:04:11.32+00:00

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED [PRIMARY]
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()SPLIT RANGE(43315513)

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED SECONDARY
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()MERGE RANGE(1701149)

Accepted answer

1 additional answer

Your answer

Dan Guzman 9,401 Reputation points

2020-10-29T09:54:34.29+00:00

Provide CREATE scripts of the partition function and scheme before you split and merged. This will provide details and clarity to better answer your question.
Surendra Adhikari 211 Reputation points

2020-10-29T10:18:43.417+00:00

CREATE PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId (BIGINT)
AS RANGE RIGHT FOR VALUES (1701149);

CREATE PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId
AS PARTITION PartitionFuncCommissionHistoryByRowId
TO (SECONDARY, [primary]);
Dan Guzman 9,401 Reputation points

2020-10-29T10:54:20.417+00:00

Thanks for the DDL (and kudos for semicolon statement terminators). What filegroup did you specify for the NEXT USED scheme prior to SPLIT and what was the boundary value added with SPLIT? Since the mdf file size increased during SPLIT, it seems PRIMARY was the NEXT USED filegroup and rows needed to be moved to the new partition to accommodate the new boundaries.
Surendra Adhikari 211 Reputation points

2020-10-29T11:04:11.32+00:00

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED [PRIMARY]
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()SPLIT RANGE(43315513)

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED SECONDARY
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()MERGE RANGE(1701149)

Answer 1

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED [PRIMARY];  
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()SPLIT RANGE(43315513);

The above statements created partition 3 on the PRIMARY filegroup and moved all rows with values >= 43315513 from the second partition, also on PRIMARY, to the new third partition. Additional space was needed because the same rows were present in both second and third partitions until the SPLIT operation completed and committed. The mdf file grew during the operation because the mdf file apparently didn't have enough space to accommodate both sets of rows side-by-side. That extra space became unused after the SPLIT completed.

ALTER PARTITION SCHEME PartitionSchemeCommissionHistoryByRowId NEXT USED SECONDARY;  
ALTER PARTITION FUNCTION PartitionFuncCommissionHistoryByRowId()MERGE RANGE(1701149);

The NEXT USED specification was not used by MERGE because no new partition was created. All rows from the second partition (on PRIMARY) are moved to the existing first partition (on SECONDARY). Since the SECONDARY ndf file didn't increase it size, it seems that filegroup already had enough unused space to accommodate the moved rows without growing.

Additional answers to questions in comments:

Was there a way I could have done to avoid the expansion of file on primary?

The easiest way is to SPLIT the function before inserting rows greater than or equal to the new boundary. No data needs to be moved in this case so the operation will be fast and not require additional space. This requires planning in advance according to your partitioning scenario.

Can I reduce the size of the file since it has expanded to the size which is not needed?

You can release unused space with DBCC SHRINKFILE. If you specify only the file name (e.g. DBCC SHRINKFILE('YourDataFile');), the file size will be reduced to the original size (if possible) by moving used pages from the end of the file as necessary and releasing space of unallocated pages from the end of the file. Be aware that this can introduce fragmentation, which is a concern mostly with spinning media. You can avoid fragmentation by first rebuilding clustered index on the second partition and then shrinking to the desired size with the TRUNCATEONLY option (e.g. DBCC SHRINKFILE('YourDatabase', 10000, TRUNCATEONLY);).

Surendra Adhikari 211 Reputation points

2020-10-29T14:19:22.443+00:00

I think this explanation has properly addressed my situation. Was there a way I could have done to avoid the expansion of file on primary?
Can I reduce the size of the file since it has expanded to the size which is not needed?
Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-10-29T23:06:35+00:00

As Dan says, the data has to be in two places for the duration of the operation.

You can shrink a file with DBCC SHRINKFILE, but don't to this. The story is the same if you rebuild the index on this partition.. Again the data has to be in two places for the duration of the operation.
Surendra Adhikari 211 Reputation points

2020-10-30T03:03:58.923+00:00

What if I first merge rows to secondary(old data partition) and then split the latest rows back to primary(active data partition)?
Dan Guzman 9,401 Reputation points

2020-10-30T10:16:59.583+00:00

@Surendra Adhikari , merge and then split will avoid the primary filegroup space but I suggest you generally recommend you plan as to avoid data movement completely during merge and split. This will also reduce logging and improve performance.

Answer 2

Uri Dimant 211

I assume you had non-empty partition, so splitting a non-empty partition is an expensive operation, requiring about 4 times the logging compared to DML.

Surendra Adhikari 211 Reputation points

2020-10-29T06:38:22.333+00:00

Yes, its a non-empty partition. Its non-empty so a split was done. How to split empty partition?
Uri Dimant 211 Reputation points

2020-10-29T07:37:36.507+00:00

Here you go
https://phoenixultd.wordpress.com/2014/08/18/how-to-split-non-empty-partitions-when-a-clustered-columnstore-index-exists-on-the-table/
Surendra Adhikari 211 Reputation points

2020-10-29T10:23:19.713+00:00

There is no column store index. I just split the partition because the data was getting huge and to keep less data in active partition and keep old data in another partition. The split and merge operations were successful but the the size of active partition mdf file increased while the size of the old partition file did not as I have explained in the post.

Share via

Split partition function increased the size of the mdf file.

1 additional answer

Your answer