Comparing Disk-Based Table Storage to Memory-Optimized Table Storage
Applies to: SQL Server Azure SQL Database Azure SQL Managed Instance
Categories | Disk-based Table | Durable Memory-Optimized Table |
---|---|---|
DDL | Metadata information is stored in system tables in the primary filegroup of the database and is accessible through catalog views. | Metadata information is stored in system tables in the primary filegroup of the database and is accessible through catalog views. |
Structure | Rows are stored in 8K pages. A page stores only rows from the same table. | Rows are stored as individual rows. There is no page structure. Two consecutive rows in a data file can belong to different memory-optimized tables. |
Indexes | Indexes are stored in a page structure similar to data rows. | Only the index definition is persisted (not index rows). Indexes are maintained in-memory and are regenerated when the memory-optimized table is loaded into memory as part of restarting a database. Since index rows are not persisted, no logging is done for index changes. |
DML operation | The first step is to find the page and then load it into buffer-pool. Insert SQL Server inserts the row on the page accounting for row ordering for clustered index. Delete SQL Server locates the row to be deleted on the page and marks it deleted. Update SQL Server locates the row on the page. The update is done in-place for non-key columns. Key-column update is done by a delete and insert operation. After the DML operation completes, the affected pages are flushed to disk as part of buffer pool policy, checkpoint, or transaction commit for minimally logged operations. Both read/write operations on pages lead to unnecessary I/O. |
For memory-optimized tables, since the data resides in memory, the DML operations are done directly in memory. There is a background thread that reads the log records for memory-optimized tables and persist them into data and delta files. An update generates a new row version. But an update is logged as a delete followed by an insert. |
Data Fragmentation | Data manipulation fragments data leading to partially filled pages and logically consecutive pages that are not contiguous on disk. This degrades data access performance and requires you to defragment data. | Memory-optimized data isn't stored in pages so there's no data fragmentation. However, as rows are updated and deleted, the data and delta files need to be compacted. This is done by a background MERGE thread based on a merge policy. |