Optimal refresh for materialized lake views in a lakehouse

Each time a scheduled refresh runs for your materialized lake views, Fabric determines the best strategy to use—no refresh, incremental, or full—based on what changed in the source data. This behavior is called optimal refresh, and it helps you keep your materialized lake views up to date while minimizing compute costs and refresh time.

This article explains how optimal refresh works, what each strategy does, and how to switch to full refresh mode when needed.

Note

Optimal refresh isn't supported in the following scenarios:

PySpark definitions: Optimal refresh applies only to MLVs defined with Spark SQL. PySpark-defined MLVs always use full refresh.
Non-Delta source tables: Materialized lake views that use non-Delta tables as a source always perform a full refresh. Incremental and no-refresh strategies require Delta table sources.

Benefits of optimal refresh

By analyzing delta commits on source tables, optimal refresh can make smart decisions about how to process your data. Where possible, this can result in:

Lower cost: Less compute and storage are used when Fabric detects that source data didn't change and skips the refresh entirely. No extra fees apply for optimal refresh—you're billed based on compute usage during refresh operations.
Improved efficiency: Faster refresh cycles when only changed data needs to be processed, helping you deliver fresher insights.
Time savings: Reduced refresh duration when incremental processing is applied instead of recomputing the full dataset.

Optimal refresh strategies

The following table describes the refresh strategies that optimal refresh can select:

Refresh Policy	Description
No refresh	If no new delta commits are detected on the source tables, Fabric skips the refresh entirely, avoiding unnecessary compute.
Incremental refresh	Processes only the changed data when new delta commits are detected on the source tables.
Full refresh	Recomputes the entire materialized lake view from the full source dataset. This strategy is used when unsupported expressions are detected, when changes can't be processed incrementally, or when the source dataset is small enough that a full recompute is faster than incremental processing.

Important

Incremental refresh requires the delta change data feed (CDF) property (delta.enableChangeDataFeed=true) on all source tables referenced in the materialized lake view definition. Without CDF enabled, optimal refresh can only choose between no refresh and full refresh. For more information, see Enable incremental refresh.

Set up optimal refresh

The optimal refresh toggle gives you no-refresh and full-refresh strategies without any extra setup. To unlock incremental refresh strategy, you also need to enable change data feed on your source tables.

Turn on optimal refresh mode

By default, optimal refresh mode is enabled for a materialized lake view lineage. If it's not enabled, follow these steps to turn it on:

Go to your lakehouse and select Materialized lake views.
Select Manage, and then select the Optimal refresh toggle to turn it on.

Enable incremental refresh

To use incremental refresh, you need to enable the delta change data feed (CDF) property on all source tables or materialized lake views referenced in the materialized lake view definition. CDF lets Fabric read only the rows that changed since the last refresh, instead of reprocessing the full dataset.

Without CDF enabled, optimal refresh can only choose between no refresh and full refresh.

Incremental refresh is supported for append-only data. If the source data includes deletions or updates, Fabric performs a full refresh.

Note

Enabling CDF on your source tables has no measurable storage or performance effect for append-only workloads, which is the scenario that incremental refresh supports. CDF is a standard Delta Lake table property that other Fabric features can also benefit from. For more information about how CDF works, see Use Delta Lake change data feed.

You can enable CDF at creation time by including TBLPROPERTIES in the CREATE statement:

CREATE OR REPLACE MATERIALIZED LAKE VIEW silver.cleaned_order_data
TBLPROPERTIES (delta.enableChangeDataFeed=true)
AS
SELECT 
    o.order_id,
    o.order_date,
    o.product_id,
    p.product_name,
    o.quantity,
    p.price,
    o.quantity * p.price AS revenue
FROM bronze.orders o
INNER JOIN bronze.products p
ON o.product_id = p.product_id

For existing source tables, use ALTER TABLE to enable CDF:

ALTER TABLE <table-name> SET TBLPROPERTIES (delta.enableChangeDataFeed = true);

For example, to enable CDF on both source tables from the get started guide:

ALTER TABLE bronze.products SET TBLPROPERTIES (delta.enableChangeDataFeed = true);
ALTER TABLE bronze.orders SET TBLPROPERTIES (delta.enableChangeDataFeed = true);

SQL constructs supported by incremental refresh

Incremental refresh works when your materialized lake view definition uses only the SQL constructs described here. If your query includes unsupported constructs—such as window functions or non-deterministic functions—Fabric still refreshes your data, but falls back to a full refresh.

SQL Construct	Remark
SELECT expression	Deterministic built-in functions and expressions are supported. Not supported for incremental refresh: aggregate functions (`SUM()`, `COUNT()`, `AVG()`, `MIN()`, `MAX()`, `STDDEV()`, etc.), `GROUP BY`, `DISTINCT`, window functions, and non-deterministic functions such as `rand()`, `uuid()`, `current_timestamp()`.
FROM	Supports Delta tables and materialized lake views. Subqueries and CTEs work if they use only the supported clauses.
WHERE	Only deterministic built-in functions are supported.
INNER JOIN	Supported.
LEFT OUTER JOIN / LEFT SEMI JOIN	Supported. Incremental refresh works only if the right-side table remains unchanged during the refresh cycle. Any change to the right-side table triggers a full refresh.
UNION ALL	Supported.
WITH	Common table expressions (CTEs) if they use only supported clauses.
Subqueries in expressions	Subqueries within SELECT or WHERE expressions (such as scalar subqueries or `EXISTS`) trigger a full refresh if any referenced table has changes.
Data quality constraints	Only deterministic built-in functions are supported in constraints.

Note

Using unsupported constructs doesn't prevent you from creating a materialized lake view. It only means that Fabric uses a full refresh instead of an incremental refresh.

Full refresh

Optimal refresh automatically falls back to full refresh when needed, so you don't normally need to force it. However, there are cases where you might want to trigger a full refresh manually—for example, to troubleshoot unexpected results or to reprocess data after a correction.

Run a one-time full refresh with SQL

To force a full refresh of a specific materialized lake view, run the following command:

REFRESH MATERIALIZED LAKE VIEW [workspace.lakehouse.schema].MLV_Identifier FULL

Note

If your workspace name contains spaces, enclose it in backticks: `My Workspace`.lakehouse.schema.view_name

Turn off optimal refresh

If you want every scheduled run to perform a full refresh, you can turn off the optimal refresh toggle. This disables both the no-refresh and incremental strategies—every run recomputes the full dataset, even if no source data changed.

Go to your lakehouse and select Materialized lake views.
Click on Manage and turn off the Optimal refresh toggle.

Feedback

Was this page helpful?

Last updated on 2026-04-03

Optimal refresh for materialized lake views in a lakehouse

Benefits of optimal refresh

Optimal refresh strategies

Set up optimal refresh

Turn on optimal refresh mode

Enable incremental refresh

SQL constructs supported by incremental refresh

Full refresh

Run a one-time full refresh with SQL

Turn off optimal refresh

Related content

Feedback

Additional resources