Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions about code in the frozen lake experiment #1

Open
hlhang9527 opened this issue Mar 23, 2021 · 0 comments
Open

some questions about code in the frozen lake experiment #1

hlhang9527 opened this issue Mar 23, 2021 · 0 comments

Comments

@hlhang9527
Copy link

hlhang9527 commented Mar 23, 2021

Hello Professor,
Recently, I've been studying your paper and reproducing your code, and I have some question as follows:

  • In the evaluate_single_switch_policy(policy, teacher_env,
    student_final_env, timesteps=10000)
    function of teacher_learning, does the params timesteps means the N_s interaction units in paper?
  • In the evaluate_single_switch_policy() function of frozen_single_switch_utils, does the lines 52-58 implement the get_student() of ALgorithm 1 CISR?
  • The threshold is pre-defined in the teacher_learning function, how do we set the range of domain as(-0.5, 5.5) if we don't know the final reward in advance?

I hope you can give me some advice, thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant