Skip to content

Commit 57a325e

Browse files
changed learner type for clarity (#83)
Co-authored-by: HU Xinqiao <[email protected]>
1 parent 0c88fd1 commit 57a325e

File tree

1 file changed

+2
-4
lines changed

1 file changed

+2
-4
lines changed

notebooks/Chapter02_Ten_Armed_Testbed.jl

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -98,16 +98,14 @@ function bandit_testbed(
9898
env = MultiArmBanditsEnv(;true_reward=true_reward)
9999
agent = Agent(
100100
policy=QBasedPolicy(
101-
learner = TDLearner(
101+
learner = MonteCarloLearner(
102102
approximator = TabularQApproximator(
103103
n_state=length(state_space(env)),
104104
n_action=length(action_space(env)),
105105
init=init,
106106
opt = opt
107107
),
108-
γ = 1.0,
109-
method=:SARSA,
110-
n = 0,
108+
γ = 1.0
111109
),
112110
explorer = explorer
113111
),

0 commit comments

Comments
 (0)