Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not working? #4

Open
pmarcin92 opened this issue Dec 29, 2016 · 9 comments
Open

Not working? #4

pmarcin92 opened this issue Dec 29, 2016 · 9 comments

Comments

@pmarcin92
Copy link

pmarcin92 commented Dec 29, 2016

Did anyone try to learn it so the agent really can play pong? I tried to learn it for over 30h on Tesla K80 and it doesn't look good at all.
I have also once concern about saving and restoring the learned weights. I modified the code to save the session once every 100000 iterations and I restore it like that:

    saver = tf.train.Saver()
    sess.run(tf.initialize_all_variables())
    saver = tf.train.import_meta_graph('pong-dqn-1300000.meta')
    saver.restore(sess, tf.train.latest_checkpoint('./'))

Is it me doing something wrong or there is a bug somewhere in the code preventing it from learn the pong?

@addisonhuddy
Copy link

@piorunm I had similar results when running locally. After about 5min pygame gets really slow. After letter it train for another couple of hours, no improvement.

@anthdm
Copy link

anthdm commented Apr 29, 2017

This is not working at all.

@wh33ler
Copy link

wh33ler commented May 2, 2017

The original code has some issues... have a look at a working version here
https://github.com/wh33ler/QNet_Pong

@j0el
Copy link

j0el commented May 2, 2017

Thanks, this is very helpful. Any chance you could push your model? 700K iterations take a while on my machine.

@wh33ler
Copy link

wh33ler commented May 2, 2017

sure why not. A added my current checkpoint of 975k steps and added a USE_MODEL Mode which ignores the training aspect.

@to-bee
Copy link

to-bee commented Nov 9, 2017

Thanks @wh33ler Very nice improvement!

@john-theo
Copy link

Hi, @wh33ler is it normal that at 300k timesteps, the ai player moves almost the same as the first 3k steps? I cloned your repo, make a few irrelevant changes(such as rename variables), and I got stupid results. You disabled issue functionality in your repo so I have to question here, LOL.
image

@wh33ler
Copy link

wh33ler commented Aug 2, 2018

I am not sure what exactly you mean. It has been a while since I looked at it. But it might take some time until the AI gets it :)

@Traven16
Copy link

I think on line 22 you have to set USE_MODEL = False for the net to actually train.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants