Databricks Certified Associate Developer for Apache Spark 3.0 (DATABRICKS-CERTIFIED-ASSOCIATE-DEVELOPER-FOR-APACHE-SPARK-3.0) - DataBricks Actual Exam Questions
Last updated on May 14, 2026
Which of the following code blocks sorts DataFrame transactionsDf both by column storeId in ascending and by column productId in descending order, in this priority?
transactionsDf.sort("storeId", asc("productId"))
transactionsDf.sort(col(storeId)).desc(col(productId))
transactionsDf.order_by(col(storeId), desc(col(productId)))
transactionsDf.sort("storeId", desc("productId"))
transactionsDf.sort("storeId").sort(desc("productId"))
to join the discussion
No discussions yet. Be the first to ask!
Delete Comment
Are you sure? This action cannot be undone.
Which of the following code blocks displays various aggregated statistics of all columns in DataFrame transactionsDf, including the standard deviation and minimum of values in each column?
transactionsDf.summary()
transactionsDf.agg("count", "mean", "stddev", "25%", "50%", "75%", "min")
transactionsDf.summary("count", "mean", "stddev", "25%", "50%", "75%", "max").show()
transactionsDf.agg("count", "mean", "stddev", "25%", "50%", "75%", "min").show()
transactionsDf.summary().show()
to join the discussion
No discussions yet. Be the first to ask!
Delete Comment
Are you sure? This action cannot be undone.
Which of the following are valid execution modes?
Kubernetes, Local, Client
Client, Cluster, Local
Server, Standalone, Client
Cluster, Server, Local
Standalone, Client, Cluster
to join the discussion
No discussions yet. Be the first to ask!
Delete Comment
Are you sure? This action cannot be undone.
Which of the following describes slots?
Slots are dynamically created and destroyed in accordance with an executor's workload.
To optimize I/O performance, Spark stores data on disk in multiple slots.
A Java Virtual Machine (JVM) working as an executor can be considered as a pool of slots for task execution.
A slot is always limited to a single core. Slots are the communication interface for executors and are used for receiving commands and sending results to the driver.
to join the discussion
No discussions yet. Be the first to ask!
Delete Comment
Are you sure? This action cannot be undone.
Which of the following describes characteristics of the Dataset API?
The Dataset API does not support unstructured data.
In Python, the Dataset API mainly resembles Pandas' DataFrame API.
In Python, the Dataset API's schema is constructed via type hints.
The Dataset API is available in Scala, but it is not available in Python.
The Dataset API does not provide compile-time type safety.
to join the discussion
No discussions yet. Be the first to ask!
Delete Comment
Are you sure? This action cannot be undone.
Finish Practice?
Are you sure you want to finish? This will end your practice session.