-
Notifications
You must be signed in to change notification settings - Fork 6.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello #241
Open
simplephi
wants to merge
91
commits into
revert-97-patch-1
Choose a base branch
from
master
base: revert-97-patch-1
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Hello #241
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Revert "fix markdown links with spaces"
…dated with cpu/mem usage
DQN copy_model_parameters memory leak
env.P[s][a] is not a tuple but rather a list of tuples (as probability in that tuple wouldn't make sense otherwise).
Update description of env.P[s][a]
bind worker within lambda to avoid running worker twice
… 10-19 will clash in scope with worker 1, and so on.
worker name scope should have trailing slash
…s. Pointed out in #117
Fixes for issues and for compatibility with TensorFlow v 1.0+
…description of env.nS and env.nA
… to Gamma as in the book.
Markdown doesn't allow spaces in links to files, so I replaced the spaces with "%20". Now the links correctly display on GitHub.
Function description update for clarity.
Fix links in all the `README.md`s
Lambda to Gamma. Updated Readme.
Updated link to Sutton's book. Changed Lambda to Gamma in FA
…nk with cached version.
Added CS885 Reinforcement Learning course from University of Waterloo. One of the most comprehensive RL courses.
added link to CS885
* Added UC Berkley class resources * Added OpenAI spinning up resources
updates to README.md
cleaning up lib/envs/gridword.py
…on youtube. Lectures were recorded at UCL in 2018. Lecturer for RL is Hado Vab Hasselt from DeepMind
fixed shape descriptions for neural network input layer
Add link to Advanced Depp Learning & Reinforcement Learning lectures.
Updated links to new version of Sutton and Barto's book
It was crashing on my win10 PC, I found a fix openai/gym#1056 and applied it.
Q-Learning docstring improvements.
Update CliffWalk REINFORCE with Baseline Solution.ipynb
Fix rendering crash on Win 10
a test pull req (corrected few typos)
Update README.md
Fixed compatibility with current version of OpenAI gym without DiscreteEnv class
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I want try to optimizes and learn the code