What does the Databricks Data Engineer Associate exam cover?

Five topic areas: the Databricks Data Intelligence Platform, development and ingestion (Delta Lake, Auto Loader), data processing and transformations (Spark SQL, PySpark, streaming), productionising data pipelines (Lakeflow Declarative Pipelines and Jobs), and data governance and quality (Unity Catalog and expectations).

Does Databricks publish a weight for each topic area?

No. The official exam guide lists the topic areas but does not publish a percentage weight for each, so prepare across all five rather than optimising for one. Third-party figures are not from Databricks.

Syllabus · Data & Analytics

Databricks Certified Data Engineer Associate Topics Explained

intermediate

The Databricks Data Engineer Associate topics explained in plain English: the platform, ingestion, transformations, pipelines and governance. No invented weights.

By The Exam Atlas Editorial Team · Verified 2026-06-06

The Databricks Data Engineer Associate is organised into five topic areas. This is a plain-English summary; the official Databricks exam guide is authoritative. Databricks does not publish a percentage weight for each area, so the table below lists them without invented numbers - prepare across all five.

#	Topic area	Official weight
1	Databricks Data Intelligence Platform	Not published
2	Development and Ingestion	Not published
3	Data Processing and Transformations	Not published
4	Productionizing Data Pipelines	Not published
5	Data Governance and Quality	Not published

1 - Databricks Data Intelligence Platform

The lakehouse foundations: the workspace, clusters and compute, notebooks, Databricks SQL, and how data is organised with the medallion (bronze → silver → gold) design. This is the mental model the other areas build on.

2 - Development and Ingestion

Getting data in and creating tables. Delta Lake tables (ACID, schema enforcement, time travel), the difference between managed and external tables, basic reads and writes, and incremental file ingestion with Auto Loader.

3 - Data Processing and Transformations

Transforming data with Spark SQL and PySpark, and the basics of Structured Streaming for incremental processing - understanding what a streaming query does differently from a one-off batch run.

4 - Productionizing Data Pipelines

Building declarative pipelines with Lakeflow Declarative Pipelines (formerly Delta Live Tables), including data-quality expectations, and orchestrating and scheduling work with Databricks Jobs/Workflows.

5 - Data Governance and Quality

Governing data with Unity Catalog - the catalog.schema.table namespace, permissions and lineage - and applying data-quality checks within pipelines so bad data is caught early.

FAQ

What does the Databricks Data Engineer Associate exam cover?: Five topic areas: the Databricks Data Intelligence Platform, development and ingestion (Delta Lake, Auto Loader), data processing and transformations (Spark SQL, PySpark, streaming), productionising data pipelines (Lakeflow Declarative Pipelines and Jobs), and data governance and quality (Unity Catalog and expectations).
Does Databricks publish a weight for each topic area?: No. The official exam guide lists the topic areas but does not publish a percentage weight for each, so prepare across all five rather than optimising for one. Third-party figures are not from Databricks.

Sources

Databricks - Certified Data Engineer Associate ↗

Quick facts

Provider: Databricks
Exam code: -
Level: intermediate
Format: 45 scored multiple-choice questions (plus a few unscored items), delivered online and proctored through the Databricks testing partner. Tests entry-level data-engineering tasks on the Databricks Data Intelligence Platform with Spark SQL and Python.
Questions: 45 scored questions (plus a small number of unscored items)
Duration: 1h 30m
Passing score: Not published. Databricks does not publish the passing line for this exam, so aim for broad competence across all sections rather than a target percentage.
Exam fee: $200 (US$200 per attempt (plus any applicable tax). Confirm the current fee and your local tax on the Databricks certification page before booking.)
Validity: 2 years. You recertify by retaking the current exam before the credential expires.
Languages: EN