What is Microsoft Fabric ?

Pinjari Akbar
7 min readJan 22, 2024

--

Microsoft Fabric provides a unified analytics solution tailored for enterprises, encompassing data movement, data science, real-time analytics, and business intelligence. This all-in-one platform delivers a comprehensive suite of services, including data lake management, data engineering, and seamless data integration, streamlining your analytics workflow.

Fabric eliminates the need to cobble together disparate services from various vendors. Instead, it offers a seamlessly integrated, end-to-end solution designed for simplicity and ease of use in meeting your analytics requirements.

Leveraging a Software as a Service (SaaS) foundation, Fabric takes simplicity and integration to new heights, providing a user-friendly experience for all your analytics needs.

SaaS foundation

SaaS foundation

Microsoft Fabric consolidates both new and existing components from Power BI, Azure Synapse, and Azure Data Factory into a unified and integrated environment. This cohesive platform offers tailored user experiences, combining functionalities such as Data Engineering, Data Factory, Data Science, Data Warehouse, Real-Time Analytics, and Power BI on a shared Software as a Service (SaaS) foundation. The integration of these components provides several key advantages:

1.Extensive and Integrated Analytics:

Fabric delivers a broad range of deeply integrated analytics capabilities, positioning itself as an industry leader.

2.Shared and Familiar Experiences:

Users benefit from shared experiences across different functionalities, ensuring familiarity and ease of learning.

3.Efficient Asset Access and Reuse:

Developers can easily access and reuse assets across the platform, streamlining the development process.

4. Unified Data Lake:

Fabric enables the use of a unified data lake, allowing users to retain data in its original location while utilizing their preferred analytics tools.

5. Centralized Administration and Governance:

The platform facilitates centralized administration and governance across all integrated experiences, ensuring consistent management practices.

6. Seamless SaaS Integration:

Microsoft Fabric provides a seamless integration of data and services within a SaaS experience. IT teams can centrally configure core enterprise capabilities, and permissions are automatically applied across all underlying services.

7. Automatic Inheritance of Data Sensitivity Labels:

Data sensitivity labels are inherited automatically across all items in the suite, enhancing security and compliance.

By adopting Microsoft Fabric, creators can focus on generating their best work without the burden of integrating, managing, or understanding the underlying infrastructure supporting the diverse set of experiences within the platform. This approach enhances productivity and allows teams to concentrate on delivering high-quality outcomes.

Components of Microsoft Fabric

Microsoft Fabric provides a cohesive suite of analytics experiences that seamlessly collaborate to meet diverse analytical needs. Each tailored experience is designed for specific user roles and tasks, encompassing industry-leading functionalities in key categories to address end-to-end analytical requirements.

Microsoft Fabric

1 Data Engineering

The Data Engineering experience within Microsoft Fabric delivers a cutting-edge Spark platform, offering exceptional authoring capabilities. This empowers data engineers to execute large-scale data transformations and democratize data access through the lakehouse architecture. Microsoft Fabric Spark seamlessly integrates with Data Factory, allowing the scheduling and orchestration of notebooks and Spark jobs.

2.Data Factory

Azure Data Factory merges the user-friendly simplicity of Power Query with the robust scale and capabilities of Azure Data Factory. With over 200 native connectors, it facilitates seamless connections to data sources both on-premises and in the cloud. This powerful combination empowers users to efficiently manage and integrate data from diverse sources.

3.Data Science

The Data Science experience in Microsoft Fabric facilitates the effortless creation, deployment, and operationalization of machine learning models directly within the Fabric environment. By integrating with Azure Machine Learning, it offers built-in features for experiment tracking and model registry. This empowers data scientists to enhance organizational data with predictive insights. Furthermore, business analysts can seamlessly incorporate these predictions into their BI reports, transitioning from descriptive to predictive insights within the analytical workflow.

4.Data Warehouse

The Data Warehouse experience within Microsoft Fabric offers industry-leading SQL performance and scalability. Notably, it adopts a model that separates compute from storage, allowing independent scaling of each component. Furthermore, it natively stores data in the open Delta Lake format, enhancing data management and accessibility within the analytical ecosystem.

5.Real-Time Analytics

Real-Time Analytics in Microsoft Fabric excels at handling observational data sourced from diverse channels like applications, IoT devices, and human interactions. This category of data, marked by rapid growth, is often semi-structured in formats like JSON or Text, and it arrives in high volumes with dynamic schemas. Traditional data warehousing platforms face challenges with such data characteristics. Microsoft Fabric’s Real-Time Analytics stands out as a best-in-class engine specifically designed for the analytics of observational data, addressing the complexities associated with its volume, structure, and real-time nature.

6.Power BI

Power BI stands as the world’s premier Business Intelligence platform within Microsoft Fabric. It empowers business owners to swiftly and intuitively access all the data present in Fabric. This accessibility enables informed decision-making, leveraging the power of data for better business outcomes.

Fabric seamlessly consolidates diverse analytical experiences, creating a unified platform that stands as the most comprehensive big data analytics solution in the industry. Microsoft Fabric empowers organizations and individuals to transform large and intricate data repositories into actionable workloads and insights. It represents an implementation of the data mesh architecture, providing a sophisticated and integrated approach to harnessing the power of big data for effective analytics and decision-making.

OneLake and lakehouse-the unification of lakehouses

The Microsoft Fabric platform harmonizes the OneLake and lakehouse architecture across enterprises. This integration brings together the capabilities of OneLake and the lakehouse architecture, offering a unified and cohesive solution for managing and analyzing data within the enterprise environment. This approach streamlines data operations, enhancing the efficiency and effectiveness of data-related processes across the organization.

OneLake

The data lake serves as the foundational core for all services within the Microsoft Fabric platform, known as OneLake. This unified storage system is seamlessly integrated into the Fabric service, offering a singular location to store organizational data for all operational experiences.

OneLake, built on Azure Data Lake Storage (ADLS) Gen2, presents a unified Software as a Service (SaaS) experience. It serves both professional and citizen developers, streamlining the user experience. OneLake’s SaaS model simplifies interactions by eliminating the need for users to grapple with infrastructure concepts such as resource groups, Role-Based Access Control (RBAC), Azure Resource Manager, redundancy considerations, or regional complexities. Remarkably, users are not even required to possess an Azure account.

By doing so, OneLake eradicates the prevalent and disordered data silos created by individual developers when provisioning and configuring isolated storage accounts. Instead, it establishes a centralized and uniform storage system for all developers. This not only simplifies data discovery and sharing but also ensures consistent compliance with policy and security settings across the organization.

Organizational structure of OneLake & lakehouse

OneLake adopts a hierarchical structure within Microsoft Fabric, offering simplified management capabilities throughout the organization. This built-in feature eliminates the need for upfront provisioning and ensures a unified experience. There exists only one OneLake per tenant, providing a single-pane-of-glass file-system namespace that spans across users, regions, and even multiple clouds. The data within OneLake is organized into manageable containers for ease of handling.

At the top level of the hierarchy, the tenant serves as the root of OneLake. Within a tenant, users can create multiple workspaces, akin to folders, to organize and structure their data.

In the hierarchy, you can create any number of workspaces within a tenant, with each workspace acting as a container for data. Within each workspace, users have the flexibility to create multiple lakehouses. A lakehouse, in this context, is a collection of files, folders, and tables that collectively represent a database over a data lake. This hierarchical arrangement simplifies the organization, management, and access to data within Microsoft Fabric’s OneLake structure. The visual representation of various Fabric items storing data inside OneLake showcases the flexibility and scalability of this hierarchical model.

In Microsoft Fabric, every developer and business unit within a tenant gains the instant ability to create their own workspaces in OneLake. This enables them to seamlessly ingest data into their designated lakehouses, initiating data processing, analysis, and collaboration — a functionality reminiscent of OneDrive in Office applications.

All Microsoft Fabric compute experiences are inherently connected to OneLake, similar to how Office applications are preconfigured to utilize the organizational OneDrive. Experiences such as Data Engineering, Data Warehouse, Data Factory, Power BI, and Real-Time Analytics seamlessly integrate with OneLake as their native storage, requiring no additional configuration. OneLake is engineered to support the immediate mounting of existing Platform as a Service (PaaS) storage accounts into its framework using the Shortcut feature. This eliminates the need for data migration or relocation.

Through shortcuts, users can access data stored in Azure Data Lake Storage without the necessity of moving or duplicating existing data. Moreover, the shortcut functionality facilitates effortless data sharing between users and applications. This capability extends beyond Azure Data Lake Storage to other storage systems, enabling the composition and analysis of data across multiple clouds. The transparent and intelligent caching of data reduces egress costs and optimally positions data in close proximity to compute resources. This enhances the efficiency of data utilization and collaboration across diverse storage environments within the Microsoft Fabric ecosystem.

--

--

No responses yet