DATABRICKS CERTIFIED DATA ENGINEER ASSOCIATE PRACTICE

QUESTIONS AND ANSWERS 2023 WITH COMPLETE

DATABRICKS CERTIFIED DATA ENGINEER ASSOCIATE PRACTICE

QUESTIONS AND ANSWERS 2023 WITH COMPLETE

A data analyst has noticed that their Databricks SQL queries are

running too slowly. They claim that this issue is affecting all of

their sequentially run queries. They ask the data engineering

team for help. The data engineering team notices that each of

the queries uses the same SQL endpoint, but the SQL endpoint

is not used by any other user.

Which of the following approaches can the data engineering

team use to improve the latency of the data analyst's queries?

They can increase the cluster size of the SQL endpoint

A data architect is designing a data model that works for both

video-based machine learning workloads and highly audited

batch ETL/ELT workloads.

Which of the following describes how using a data lakehouse

can help the data architect meet the needs of both workloads?

A data lakehouse stores unstructured data and is ACIDcompliant.

A data engineer has a Job with multiple tasks that runs nightly.

One of the tasks unexpectedly fails during 10 percent of the

runs.

Which of the following actions can the data engineer perform

to ensure the Job completes each night while minimizing

compute costs?

They can institute a retry policy for the task that periodically

fails

A data engineer has created a Delta table as part of a data

pipeline. Downstream data analysts now need SELECT

permission on the Delta table.

Assuming the data engineer is the Delta table owner, which part

of the Databricks Lakehouse Platform can the data engineer use

to grant the data analysts the appropriate access?

Data Explorer

A data engineer has ingested data from an external source into

a PySpark DataFrame raw_df. They need to briefly make this

data available in SQL for a data analyst to perform a quality

assurance check on the data.

Which of the following commands should the data engineer run

to make this data available in SQL for only the remainder of the

Spark session?

raw_df.createOrReplaceTempView("raw_df")

A data engineer has set up a notebook to automatically process

using a Job. The data engineer's manager wants to version

control the schedule due to its complexity.

Which of the following approaches can the data engineer use to

obtain a version-controllable configuration of the Job's

schedule?

They can download the JSON description of the Job from the

Job's page.

No comments found.

Login to post a comment

jordancarter 7 months ago

This study guide is clear, well-organized, and covers all the essential topics. The explanations are concise, making complex concepts easier to understand. It could benefit from more practice questions, but overall, it's a great resource for efficient studying. Highly recommend!

Login to review this item

A. You will receive a PDF that is available for instant download upon purchase. The document will be accessible to you at any time, from anywhere, and will remain available indefinitely through your profile.

A. Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

A. you are buying this document from us learnexams

A. No, you only buy these notes for $ indicated . You are not obligated to anything after your purchase.

A. check our reviews at trustpilot

Category	Exams and Certifications
Comments	0
Rating
Sales	0

jordancarter 7 months ago

Buy Our Plan

We have

jordancarter 7 months ago

Buy Our Plan

We have

Share on