This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Choose the best response for each of the questions below.
What is the primary purpose of evaluating generative AI applications?
To increase the speed of AI model training.
To validate the performance, reliability, and efficacy of AI systems.
To reduce the cost of AI development.
To replace human evaluators with automated systems.
Which of the characteristics isn't a characteristic of good evaluation data?
Diversity.
Representativeness.
High quality.
Homogeneity.
What is a key reason why strong AI performance in controlled evaluations might not translate to real-world success?
The AI's metrics are too complex to understand.
Real-world scenarios have fewer data variations than controlled environment.
Simulations often use idealized conditions that differ from real-world environments.
Edge cases are always included in simulations.
You must answer all questions before checking your work.
Was this page helpful?