Microsoft Fabric provides a powerful data retention framework for its Warehouse workloads, enabling organisations to balance cost control, compliance requirements, and historical data access. The retention system determines how long data changes are preserved, how far back you can query historical versions, and how storage is managed over time in OneLake.
This article breaks down how data retention works in Fabric Warehouse, why it matters, and how it impacts features like time travel, recovery, and storage costs.
What is Data Retention in Fabric Warehouse?
In Microsoft Fabric Warehouse, data retention defines how long historical versions of data are kept before being automatically removed.
Every time data is inserted, updated, or deleted, Fabric does not immediately discard the old version. Instead, it keeps historical snapshots of data changes so you can:
By default, the system retains this historical data for 30 calendar days, but this can be configured between 1 and 120 days.
![what-is-microsoft-fabric-data-warehouse]()
How Data Retention Works Under the Hood
Fabric Warehouse is built on Delta Lake transaction logs stored in OneLake. This means every change to a table generates versioned data files rather than overwriting existing data.
When a change occurs:
This enables Fabric to reconstruct historical states of the data within the retention window.
Once data exceeds the configured retention period:
Default Retention Period and Configuration Options
By default:
You can configure:
Minimum: 1 day
Maximum: 120 days
Configuration is applied at the warehouse level, not per table.
Changing retention settings
Retention is managed using SQL configuration (e.g., ALTER DATABASE commands in Fabric Warehouse environments).
Key behaviours when changing retention:
Increasing retention
Decreasing retention
Triggers background cleanup of older data
Irreversible in terms of historical recovery
May permanently remove previously accessible versions
Key Features Enabled by Data Retention
Retention directly powers several important Fabric capabilities:
1. Time Travel Queries
You can query data as it existed at a previous point in time using syntax like:
This allows debugging, auditing, and historical analysis within the retention window.
2. Table Cloning
You can create a clone of a table at a specific historical state.
However:
3. Restore Points
Fabric automatically creates restore points:
These are retained only within the configured retention window and are automatically deleted afterward.
4. Warehouse Snapshots
Snapshots allow you to reference a point-in-time version of the warehouse, also limited by the retention period.
Storage and Cost Implications
One of the most important aspects of data retention is its impact on OneLake storage costs.
Because Fabric stores historical versions of data:
Cost trade-offs:
| Retention Strategy | Benefit | Trade-off |
|---|
| Longer retention (e.g. 90–120 days) | Better recovery & auditing | Higher storage costs |
| Shorter retention (e.g. 7–30 days) | Lower storage cost | Reduced historical access |
Retention is therefore a direct cost-control lever in Fabric architectures.
Retention and Data Recovery
Retention is closely tied to disaster recovery and rollback scenarios:
Helps recover from accidental updates or deletions
Supports restoring earlier versions of datasets
Enables investigation of data changes over time
However:
Once data falls outside the retention window, it is permanently deleted
Increasing retention later does not restore previously removed history
Dropped Item Retention (Extra Safety Layer)
Fabric also provides a separate mechanism called dropped item retention.
This allows:
Recovery of deleted warehouses or items
Retention of metadata, tables, and snapshots for a limited period
Protection against accidental deletion
This operates independently of warehouse data retention and is primarily a safety and governance feature.
Best Practices for Data Retention
1. Align retention with business needs
2. Balance cost vs. recovery
More history = better recovery but higher storage costs.
3. Monitor storage usage
Track OneLake consumption after retention changes to avoid unexpected cost spikes.
4. Separate workloads when needed
Different retention requirements may require:
Separate warehouses
Separate environments
Conclusion
Data retention in Microsoft Fabric Warehouse is a foundational capability that controls how long historical data is preserved and how far back users can travel in time.
It directly influences:
By carefully configuring retention periods, organisations can achieve the right balance between cost efficiency, performance, and data governance in Fabric environments.