macop.operators.policies.UCBPolicy¶
Policy class implementation which is used for selecting operator using Upper Confidence Bound
Classes
|
UCB policy class which is used for applying UCB strategy when selecting and applying operator |
-
class
macop.operators.policies.UCBPolicy.
UCBPolicy
(operators, C=100.0, exp_rate=0.5)[source]¶ UCB policy class which is used for applying UCB strategy when selecting and applying operator
-
operators
¶ {[Operator]} – list of selected operators for the algorithm
-
C
¶ {float} – tradeoff between EvE parameter for UCB
-
exp_rate
¶ {float} – exploration rate (probability to choose randomly next operator)
-
rewards
¶ {[float]} – list of summed rewards obtained for each operator
-
occurrences
¶ {[int]} – number of use (selected) of each operator
-