EvaluatorGenerationInputs interface
Caller-supplied inputs for an evaluator generation job.
Properties
| evaluator_description | Optional human-friendly description for the resulting evaluator. Surfaced as |
| evaluator_display_name | Optional human-friendly display name for the resulting evaluator. Surfaced as |
| evaluator_name | The evaluator name (immutable identifier). 1-256 characters; allowed characters are ASCII letters, digits, underscore ( |
| model | The LLM model to use for rubric generation (e.g., 'gpt-4o'). Required — users must provide their own model rather than relying on service-owned capacity. |
| sources | Source materials for generation — agent descriptions, prompts, traces, or datasets. Each entry is an |
Property Details
evaluator_description
Optional human-friendly description for the resulting evaluator. Surfaced as EvaluatorVersion.description on the persisted evaluator. Typically collected from the UI alongside evaluator_display_name. The evaluator_ prefix disambiguates this from any other description fields on related models.
evaluator_description?: string
Property Value
string
evaluator_display_name
Optional human-friendly display name for the resulting evaluator. Surfaced as EvaluatorVersion.display_name on the persisted evaluator. When omitted, the service uses evaluator_name as the display name. The evaluator_ prefix disambiguates this from the immutable evaluator_name identifier.
evaluator_display_name?: string
Property Value
string
evaluator_name
The evaluator name (immutable identifier). 1-256 characters; allowed characters are ASCII letters, digits, underscore (_), period (.), tilde (~), and hyphen (-). The prefix builtin. is reserved for system-managed evaluators and is rejected by the service. If an evaluator with this name already exists in the project (and is rubric-subtype), the service creates a new version under the same name and uses the prior version's dimensions as context for incremental improvement (foundation of the post-//build adaptive loop). Old versions remain queryable via get_version(name, version). If the existing evaluator is not a rubric-subtype evaluator (built-in, prompt-based, code-based), the request is rejected with 400 Bad Request.
evaluator_name: string
Property Value
string
model
The LLM model to use for rubric generation (e.g., 'gpt-4o'). Required — users must provide their own model rather than relying on service-owned capacity.
model: string
Property Value
string
sources
Source materials for generation — agent descriptions, prompts, traces, or datasets. Each entry is an EvaluatorGenerationJobSource variant discriminated by type.
sources: EvaluatorGenerationJobSourceUnion[]