Databricks Certified - Data Engineer Associate

Databricks Certified - Data Engineer Associate (opens in a new tab)

Data Ingestion from Cloud Storage

Lakeflow Connect Managed Connectors: Database Ingestion

Lakeflow Connect Managed Connectors: SaaS Ingestion

Enable features that simplify data layout decisions and optimize query performance.
Explain the value of the Data Intelligence Platform.
Identify the applicable compute to use for a specific use case.

Describe the three layers of the Medallion Architecture and explain the purpose of each layer in a data processing pipeline.
Classify the type of cluster and configuration for optimal performance based on the scenario in which the cluster is used.
Emphasize the advantages of LDP (for ETL process in Databricks).
Implement data pipelines using LDP.
Identify DDL/DML features.
Compute complex aggregations and Metrics with PySpark Dataframes.

Explain the difference between managed and external tables.
Identify the grant of permissions to users and groups within UC.
Identify key roles in UC.
Identify how audit logs are stored.
Use lineage features in UC.
Use the Delta Sharing feature available with UC to share data.
Identify the advantages and limitations of Delta sharing.
Identify types of delta sharing - Databricks vs external system.
Analyze the cost considerations of data sharing across clouds.
Identify Use cases of Lakehouse Federation when connected to external sources.