SharePoint Server 2016 FileStream Implementation, reclaim space after migrate

Ariwibawa 2 Reputation points
2021-05-15T04:21:08.027+00:00

so I have a situation where our SharePoint DB grows very big which is double what shows up in the storage metric. I have re-run $web.RecalculateStorageMetrics() and it still show that much.

This is SharePoint Server 2016 and SQL Server 2016 Enterprise installation, it is used as a document repository.

  1. Most files are pdf and around 1/3 are DOCX.
  2. Just recently we implemented FS so we could reduce content DB size.
  3. After setup FS, we ran migrate command from PowerShell and after 3 weeks it is finally done.

This is where things get very confusing, our content database size currently at 2TB (from shrink database GUI) and 4% free space. Where in storage metric, our repository size is at 909.9GB (99.77% from total size). There is more than 1TB size that we didn't know who used it.

I thought that this is used by ex blob data that has been moved to FS, so I have tried several things:

  • run DBCC CHECKDB (I didn't see any error).
  • run DBCC CLEANTABLE to every 200 FileStream tables.
  • execute sp_filestream_force_garbage_collection (return 0).
  • rebuild index on filestream table.
  • run a full backup every week and run an incremental backup every day.
  • I have also done shrink with the option reorganize files before releasing.
  • Running "Microsoft.Data.SqlRemoteBlobs.Maintainer.exe -ConnectionStringName RBSMaintainerConnection -Operation GarbageCollection ConsistencyCheck ConsistencyCheckForStores -GarbageCollectionPhases rdo -ConsistencyCheckMode r -TimeLimit 1440" (I didn't see anything got deleted).

After much searching using this query "select sum(case when blob_data_inline is null then 1 else 0 end) CountFS, sum(case when blob_data_inline is null then blob_size else 0 end) SizeFS, sum(case when blob_data_inline is not null then 1 else 0 end) CountInline, sum(case when blob_data_inline is not null then blob_size else 0 end) SizeInline from mssqlrbs_filestream_data_1.rbs_filestream_data_x", I have determined that many spaces are being occupied by rbs_filestream_data_x. There are 2 column varbinary(max), one with FILESTREAM column (blob_data_filestream) and another without it (blob_data_inline). Each table has a max of 200k rows with around 50-70k filestream column and the rest go to inline column. That inline column size is around 900GB, while FS size is around the same size with blob folder property.

That filestream table is what keeping this DB so large, is this the correct FS implementation? I thought that all blob data would be moved out from content DB. But in this case, it still holds data in the content DB.

SQL Server
SQL Server
A family of Microsoft relational database management and analysis systems for e-commerce, line-of-business, and data warehousing solutions.
13,361 questions
SharePoint Server Management
SharePoint Server Management
SharePoint Server: A family of Microsoft on-premises document management and storage systems.Management: The act or process of organizing, handling, directing or controlling something.
2,900 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Erland Sommarskog 107.1K Reputation points
    2021-05-16T21:54:19.863+00:00

    So I don't know anything about Sharepoint or the lingo it uses, but from an SQL Server perspective I like to point out that Filestream data is just as much part of the database as the other data. It is just stored in a different way.

    So if you move data from being stored in traditional BLOB columns, to be in filestream, the database will not change much in size. But you could expect that the size of the data file where you had the BLOBs to get a lot free space, so that you can shrink the data file in size.

    Then again, shrink has a bad reputation for being very slow when LOB columns are involved. Exactly hos this works here, when the data has been moved, I don't know.

    0 comments No comments

  2. Ariwibawa 2 Reputation points
    2021-08-26T19:08:14.187+00:00

    So I keep Running "Microsoft.Data.SqlRemoteBlobs.Maintainer.exe" for a couple of day, and I started to see some decrease size in content DB. It seems that maintainer is the one that clear blob from this content DB.

    There was an option when maintainer start to delete orphaned data (the default was 30 days).
    A couple of note that I got from this implementation:

    1. FS implementation will just doing add, no update & delete.
    2. Make sure to run maintainer every day and set its timelimit, so it won't run during working hours.
    0 comments No comments