Knowledge check

1.

What is the primary reason for organizing agent optimization experiments into separate Git branches?

To enable parallel development by multiple team members simultaneously

To isolate specific changes and attribute performance differences to individual modifications

To comply with regulatory requirements for version control

2.

Adventure Works evaluates their Trail Guide Agent using Intent Resolution scores. Three evaluators score the same response as 5, 3, and 2. What does this indicate?

The response quality is inconsistent and needs improvement

The evaluation rubric lacks sufficient detail and evaluators need calibration training

The average score of 3.3 indicates the response meets the minimum quality threshold

3.

Your experiment shows GPT-4 mini reduces costs by 75% while maintaining an average quality score of 4.1 (threshold: 4.2) and improving response time from 32 to 18 seconds. What should you do?

Immediately deploy GPT-4 mini to production since it meets two of three success criteria

Document the trade-offs and seek business stakeholder input on whether cost savings justify the quality reduction

Reject GPT-4 mini and continue using GPT-4 without further testing

Check your knowledge

Feedback