Databricks Databricks Generative AI Engineer Associate Practice Exams
Last updated on Apr 14,2025 - Exam Code: Databricks Generative AI Engineer Associate
- Exam Name: Databricks Certified Generative AI Engineer Associate
- Certification Provider: Databricks
- Latest update: Apr 14,2025
A Generative Al Engineer has already trained an LLM on Databricks and it is now ready to be deployed.
Which of the following steps correctly outlines the easiest process for deploying a model on Databricks?
- A . Log the model as a pickle object, upload the object to Unity Catalog Volume, register it to Unity Catalog using MLflow, and start a serving endpoint
- B . Log the model using MLflow during training, directly register the model to Unity Catalog using the MLflow API, and start a serving endpoint
- C . Save the model along with its dependencies in a local directory, build the Docker image, and run the Docker container
- D . Wrap the LLM’s prediction function into a Flask application and serve using Gunicorn
A Generative AI Engineer is designing an LLM-powered live sports commentary platform. The platform provides real-time updates and LLM-generated analyses for any users who would like to have live summaries, rather than reading a series of potentially outdated news articles.
Which tool below will give the platform access to real-time data for generating game analyses based on the latest game scores?
- A . DatabrickslQ
- B . Foundation Model APIs
- C . Feature Serving
- D . AutoML
A Generative Al Engineer has developed an LLM application to answer questions about internal company policies. The Generative AI Engineer must ensure that the application doesn’t hallucinate or leak confidential data.
Which approach should NOT be used to mitigate hallucination or confidential data leakage?
- A . Add guardrails to filter outputs from the LLM before it is shown to the user
- B . Fine-tune the model on your data, hoping it will learn what is appropriate and not
- C . Limit the data available based on the user’s access level
- D . Use a strong system prompt to ensure the model aligns with your needs.
A Generative AI Engineer is designing a chatbot for a gaming company that aims to engage users on its platform while its users play online video games.
Which metric would help them increase user engagement and retention for their platform?
- A . Randomness
- B . Diversity of responses
- C . Lack of relevance
- D . Repetition of responses
A Generative Al Engineer has successfully ingested unstructured documents and chunked them by document sections. They would like to store the chunks in a Vector Search index. The current format of the dataframe has two columns: (i) original document file name (ii) an array of text chunks for each document.
What is the most performant way to store this dataframe?
- A . Split the data into train and test set, create a unique identifier for each document, then save to a Delta table
- B . Flatten the dataframe to one chunk per row, create a unique identifier for each row, and save to a Delta table
- C . First create a unique identifier for each document, then save to a Delta table
- D . Store each chunk as an independent JSON file in Unity Catalog Volume. For each JSON file, the key is the document section name and the value is the array of text chunks for that section
What is the most suitable library for building a multi-step LLM-based workflow?
- A . Pandas
- B . TensorFlow
- C . PySpark
- D . LangChain
A Generative Al Engineer interfaces with an LLM with prompt/response behavior that has been trained on customer calls inquiring about product availability. The LLM is designed to output “In Stock” if the product is available or only the term “Out of Stock” if not.
Which prompt will work to allow the engineer to respond to call classification labels correctly?
- A . Respond with “In Stock” if the customer asks for a product.
- B . You will be given a customer call transcript where the customer asks about product availability. The outputs are either “In Stock” or “Out of Stock”. Format the output in JSON, for example: {“call_id”: “123”, “label”: “In Stock”}.
- C . Respond with “Out of Stock” if the customer asks for a product.
- D . You will be given a customer call transcript where the customer inquires about product availability.
Respond with “In Stock” if the product is available or “Out of Stock” if not.
A Generative Al Engineer is tasked with improving the RAG quality by addressing its inflammatory outputs.
Which action would be most effective in mitigating the problem of offensive text outputs?
- A . Increase the frequency of upstream data updates
- B . Inform the user of the expected RAG behavior
- C . Restrict access to the data sources to a limited number of users
- D . Curate upstream data properly that includes manual review before it is fed into the RAG system
A Generative Al Engineer is tasked with developing a RAG application that will help a small internal group of experts at their company answer specific questions, augmented by an internal knowledge base. They want the best possible quality in the answers, and neither latency nor throughput is a huge concern given that the user group is small and they’re willing to wait for the best answer. The topics are sensitive in nature and the data is highly confidential and so, due to regulatory requirements, none of the information is allowed to be transmitted to third parties.
Which model meets all the Generative Al Engineer’s needs in this situation?
- A . Dolly 1.5B
- B . OpenAI GPT-4
- C . BGE-large
- D . Llama2-70B
A Generative AI Engineer is developing a patient-facing healthcare-focused chatbot. If the patient’s question is not a medical emergency, the chatbot should solicit more information from the patient to pass to the doctor’s office and suggest a few relevant pre-approved medical articles for reading. If the patient’s question is urgent, direct the patient to calling their local emergency services.
Given the following user input:
“I have been experiencing severe headaches and dizziness for the past two days.”
Which response is most appropriate for the chatbot to generate?
- A . Here are a few relevant articles for your browsing. Let me know if you have questions after reading them.
- B . Please call your local emergency services.
- C . Headaches can be tough. Hope you feel better soon!
- D . Please provide your age, recent activities, and any other symptoms you have noticed along with your headaches and dizziness.