Undertanding Medallion Architecture in Microsoft Fabric Lakehouse

In Microsoft Fabric, the Medallion Architecture Layers is a design pattern employed to logically organize data in a lakehouse. The architecture comprises three distinct layers (Bronze, Silver and Gold), each indicating the quality of data stored in the lakehouse, with higher levels representing higher quality. The multi-layered approach helps Fabric Engineers to build a single source of truth for enterprise data products.  

Medallion Architecture layers

Bronze Layer 

All data for the lakehouse begins with the bronze layer of the medallion architecture. This layer stores data in its raw format, regardless of whether it is structured, semi-structured, or unstructured. No modifications are done to the data in this layer. 

Silver Layer

The silver layer of the medallion architecture is where fabric data engineers and users process and refine their data which includes performing operations such as appending, merging data, and applying data validation rules like removing nulls and deduplication (removing redundant data). The silver layer is a central repository for Fabric-powered organizations to store their data in a consistent format and seamlessly share it with multiple teams. In the silver layer, data learning is undertaken so that everything is in one place and ready to be modeled and analyzed in the gold layer.

Gold Layer

The gold layer of the medallion architecture is where users and Fabric engineers enrich their data with additional information and analysis. The layer allows engineers or users to aggregate data to a specific level of detail, such as daily or hourly, or add external data sources to their data. The gold layer of the medallion architecture is where you'll enrich your data with additional information and analysis. This layer allows you to aggregate data to a specific level of detail, such as daily or hourly, or add external data sources to your data. Once the data reaches the gold stage, it's ready for use by downstream teams, including analytics, data science, or Machine Learning operations


Similar Articles