Module assessment

1.

What is the primary purpose of evaluating a Large Language Model (LLM)?

To improve its computational efficiency.

To assess its accuracy and performance on specific tasks.

To increase its training data size.

2.

In the context of evaluating language models, what does perplexity measure?

The size of the training dataset.

The diversity of generated text.

The uncertainty of the model in predicting the next word.

3.

When you evaluate a large language model (LLM) for bias, what is a common approach?

Measuring the model's training time

Analyzing the model's outputs for harmful stereotypes

Counting the number of model parameters

Feedback