macop.callbacks.policies¶
UCB policy Checkpoint class implementation
Classes
|
UCB checkpoint is used for loading previous Upper Confidence Bound data and start again after loading checkpoint Need to be the same operators used during previous run (see macop.policies.reinforcement.UCBPolicy for more details) |
-
class
macop.callbacks.policies.
UCBCheckpoint
(every, filepath)[source]¶ UCB checkpoint is used for loading previous Upper Confidence Bound data and start again after loading checkpoint Need to be the same operators used during previous run (see macop.policies.reinforcement.UCBPolicy for more details)
-
algo
¶ {Algorithm} – main algorithm instance reference
-
every
¶ {int} – checkpoint frequency used (based on number of evaluations)
-
filepath
¶ {str} – file path where checkpoints will be saved
-