Not working? #4

pmarcin92 · 2016-12-29T07:32:20Z

Did anyone try to learn it so the agent really can play pong? I tried to learn it for over 30h on Tesla K80 and it doesn't look good at all.
I have also once concern about saving and restoring the learned weights. I modified the code to save the session once every 100000 iterations and I restore it like that:

    saver = tf.train.Saver()
    sess.run(tf.initialize_all_variables())
    saver = tf.train.import_meta_graph('pong-dqn-1300000.meta')
    saver.restore(sess, tf.train.latest_checkpoint('./'))

Is it me doing something wrong or there is a bug somewhere in the code preventing it from learn the pong?

addisonhuddy · 2017-01-05T14:00:55Z

@piorunm I had similar results when running locally. After about 5min pygame gets really slow. After letter it train for another couple of hours, no improvement.

anthdm · 2017-04-29T22:02:23Z

This is not working at all.

wh33ler · 2017-05-02T06:41:57Z

The original code has some issues... have a look at a working version here
https://github.com/wh33ler/QNet_Pong

j0el · 2017-05-02T15:13:51Z

Thanks, this is very helpful. Any chance you could push your model? 700K iterations take a while on my machine.

wh33ler · 2017-05-02T17:25:23Z

sure why not. A added my current checkpoint of 975k steps and added a USE_MODEL Mode which ignores the training aspect.

to-bee · 2017-11-09T14:11:35Z

Thanks @wh33ler Very nice improvement!

john-theo · 2018-08-01T11:07:59Z

Hi, @wh33ler is it normal that at 300k timesteps, the ai player moves almost the same as the first 3k steps? I cloned your repo, make a few irrelevant changes(such as rename variables), and I got stupid results. You disabled issue functionality in your repo so I have to question here, LOL.

wh33ler · 2018-08-02T16:15:37Z

I am not sure what exactly you mean. It has been a while since I looked at it. But it might take some time until the AI gets it :)

Traven16 · 2018-09-22T17:34:35Z

I think on line 22 you have to set USE_MODEL = False for the net to actually train.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not working? #4

Not working? #4

pmarcin92 commented Dec 29, 2016 •

edited

Loading

addisonhuddy commented Jan 5, 2017

anthdm commented Apr 29, 2017

wh33ler commented May 2, 2017 •

edited

Loading

j0el commented May 2, 2017

wh33ler commented May 2, 2017

to-bee commented Nov 9, 2017

john-theo commented Aug 1, 2018

wh33ler commented Aug 2, 2018

Traven16 commented Sep 22, 2018

Not working? #4

Not working? #4

Comments

pmarcin92 commented Dec 29, 2016 • edited Loading

addisonhuddy commented Jan 5, 2017

anthdm commented Apr 29, 2017

wh33ler commented May 2, 2017 • edited Loading

j0el commented May 2, 2017

wh33ler commented May 2, 2017

to-bee commented Nov 9, 2017

john-theo commented Aug 1, 2018

wh33ler commented Aug 2, 2018

Traven16 commented Sep 22, 2018

pmarcin92 commented Dec 29, 2016 •

edited

Loading

wh33ler commented May 2, 2017 •

edited

Loading