Glossary · Data & Analytics

Microsoft Fabric Data Engineer (DP-700) Glossary

intermediate

A free Microsoft Fabric Data Engineer (DP-700) glossary: core terms (Lakehouse, Warehouse, OneLake, Dataflows Gen2, KQL, Direct Lake) defined in plain English for study.

By The Exam Atlas Editorial Team · Verified 2026-06-06

Plain-English definitions of the core Microsoft Fabric terms for DP-700 study. Simplified for learning; Microsoft documentation is authoritative.

TermDefinition
Microsoft FabricThe unified analytics platform that holds all the items below over one data lake.
OneLakeThe single, tenant-wide data lake every Fabric workspace and item shares.
WorkspaceA container for Fabric items where a team collaborates and sets permissions.
LakehouseA Fabric item storing files and Delta tables; loaded with notebooks, read via a SQL endpoint.
WarehouseA Fabric item with a full T-SQL engine for set-based transformation and serving.
Delta (Delta Lake)The open table format Fabric uses, adding transactions and versioning over Parquet files.
NotebookA code-first item (PySpark, Spark SQL) for transforming data at scale.
SparkThe distributed engine behind Fabric notebooks for large-scale transformation.
Dataflows Gen2The low-code, Power Query-based transform item for ingesting and shaping data.
Data pipelineAn orchestration item that copies data and runs activities in sequence.
OneLake shortcutA pointer to data in another location, reused without copying it.
MirroringContinuously replicating an external database into OneLake as Delta tables.
Medallion architectureBronze (raw), silver (cleaned), gold (business-ready) data layering.
Dimensional modelFact and dimension tables shaped for analytics and reporting.
Full vs incremental loadReloading all data versus loading only new or changed rows.
KQLKusto Query Language, used to query high-volume event and telemetry data.
EventhouseReal-Time Intelligence storage that holds KQL databases.
EventstreamA no-code item for capturing, transforming and routing streaming data.
Spark structured streamingSpark’s API for processing streaming data in notebooks.
Windowing functionA time-based grouping (such as tumbling or hopping) over a stream.
Direct LakeA semantic-model mode that reads OneLake Delta tables directly, at import-like speed.
Semantic modelThe published data model (dataset) that Power BI reports connect to.
Deployment pipelineA tool to promote content across development, test and production stages.
Row-level security (RLS)Restricting which rows a user can see, by role.
Dynamic data maskingHiding sensitive column values from unauthorised users at query time.
Sensitivity labelA governance tag that classifies and protects an item’s data.

Sources