Defines an early termination policy based on slack criteria, and a frequency and delay interval for evaluation.
BanditPolicy(*, delay_evaluation: int = 0, evaluation_interval: int = 0, slack_amount: float = 0, slack_factor: float = 0)
Number of intervals by which to delay the first evaluation.
Interval (number of runs) between policy evaluations.
Absolute distance allowed from the best performing run.
Ratio of the allowed distance from the best performing run.
Submit and view feedback for