45a1297 | Matthias Plappert | 27 July 2017, 02:13:45 UTC | Final finishing touches | 27 July 2017, 02:13:45 UTC |
6964729 | Matthias Plappert | 25 July 2017, 23:54:32 UTC | Update documentation | 25 July 2017, 23:54:32 UTC |
13c4107 | Matthias Plappert | 25 July 2017, 20:42:15 UTC | Export param noise | 25 July 2017, 20:42:54 UTC |
df82a15 | Peter Welinder | 23 July 2017, 16:58:10 UTC | Fix broken links in DQN readme | 23 July 2017, 16:58:10 UTC |
5dc0062 | Jonas Schneider | 20 July 2017, 16:00:24 UTC | readme fiddling | 20 July 2017, 16:00:24 UTC |
79b4a8a | John Schulman | 20 July 2017, 15:55:43 UTC | Merge pull request #60 from openai/ppo-trpo ppo and trpo | 20 July 2017, 15:55:43 UTC |
da99706 | John Schulman | 20 July 2017, 15:52:35 UTC | ppo and trpo | 20 July 2017, 15:52:35 UTC |
80f94f8 | Szymon Sidor | 12 July 2017, 21:48:00 UTC | bump version | 12 July 2017, 21:48:05 UTC |
2b1b437 | Szymon Sidor | 12 July 2017, 21:42:36 UTC | Update simple.py | 12 July 2017, 21:42:36 UTC |
04cd0dc | Szymon Sidor | 12 July 2017, 21:37:28 UTC | Merge pull request #52 from farbeiza/patch-1 Effectively apply weights from the replay buffer | 12 July 2017, 21:37:28 UTC |
248aad1 | Szymon Sidor | 12 July 2017, 21:32:24 UTC | Merge pull request #39 from mirceamironenco/master Fix TF graph variables deprecation | 12 July 2017, 21:32:24 UTC |
d76cd12 | Fernando Arbeiza | 11 July 2017, 09:09:51 UTC | Effectively apply weights from the replay buffer It seems that the weights retrieved from the replay buffer are not applied when training the model. Is there any reason for that or am I missing something? In any case, I have added a parameter in order for them to be used; just in case it is useful. | 11 July 2017, 09:09:51 UTC |
91b1085 | MironencoMircea | 28 June 2017, 13:48:45 UTC | Fixed TF graph variables deprecation | 28 June 2017, 13:48:45 UTC |
0778e9f | Szymon Sidor | 24 June 2017, 00:05:25 UTC | Merge pull request #28 from zach-nervana/patch-1 remove unnecessary initialization of variable resized_screen | 24 June 2017, 00:05:25 UTC |
59c7887 | Szymon Sidor | 24 June 2017, 00:02:05 UTC | Merge pull request #26 from LinZichuan/master Update setup.py | 24 June 2017, 00:02:05 UTC |
3d235ae | Szymon Sidor | 23 June 2017, 23:59:55 UTC | Merge pull request #33 from cxxgtxy/master Fix README since BreakOut pretrained model doesn't match the correct … | 23 June 2017, 23:59:55 UTC |
5e73387 | cxx | 16 June 2017, 07:38:42 UTC | Fix README since BreakOut pretrained model doesn't match the correct tensor shape. Therefore, Pong is used instead. | 16 June 2017, 07:38:42 UTC |
ec38bf4 | Zach Dwiel | 09 June 2017, 12:53:10 UTC | remove unnecessary initialization of variable resized_screen | 09 June 2017, 12:53:10 UTC |
ef1a240 | Zichuan Lin | 07 June 2017, 09:29:38 UTC | Update setup.py | 07 June 2017, 09:29:38 UTC |
184440f | Szymon Sidor | 04 June 2017, 07:41:34 UTC | Merge pull request #22 from ngc92/doc_fixes docstring and comment fixes | 04 June 2017, 07:41:34 UTC |
fba0ac3 | Szymon Sidor | 04 June 2017, 07:40:58 UTC | Merge pull request #15 from tiagosgc/patch-1 Update README.md | 04 June 2017, 07:40:58 UTC |
584261a | Szymon Sidor | 04 June 2017, 07:40:42 UTC | Merge pull request #14 from quanvuong/master Consistent initial type (float) for episode_rewards | 04 June 2017, 07:40:42 UTC |
9c10c2f | Szymon Sidor | 04 June 2017, 07:40:31 UTC | Merge pull request #13 from ppwwyyxx/patch-1 Update setup.py | 04 June 2017, 07:40:31 UTC |
0291948 | ngc92 | 01 June 2017, 23:43:51 UTC | docstring and comment fixes | 01 June 2017, 23:43:51 UTC |
1f3c3e3 | Tiago Carvalho | 31 May 2017, 11:14:28 UTC | Update README.md | 31 May 2017, 11:14:28 UTC |
86054f7 | Quan Vuong | 30 May 2017, 03:49:25 UTC | Consistent initial type (float) for episode_rewards | 30 May 2017, 03:49:25 UTC |
709c327 | Yuxin Wu | 30 May 2017, 02:39:25 UTC | Update setup.py `PongNoFrameskip-v4` seems to require `gym>=0.9.1` | 30 May 2017, 02:39:25 UTC |
fc2bbed | Szymon Sidor | 28 May 2017, 19:56:46 UTC | Merge pull request #11 from yenchenlin/fix-typo Fix typos | 28 May 2017, 19:56:46 UTC |
4fd1d21 | YenChenLin | 28 May 2017, 17:13:47 UTC | Fix typo | 28 May 2017, 17:13:47 UTC |
5c80997 | Szymon Sidor | 26 May 2017, 23:24:57 UTC | Merge pull request #9 from AAorris/patch-1 tf_util: spelling / docstring updates | 26 May 2017, 23:24:57 UTC |
2361220 | Aaron Morris | 26 May 2017, 05:51:14 UTC | tf_util: spelling / docstring updates | 26 May 2017, 05:53:09 UTC |
7327a15 | Szymon Sidor | 25 May 2017, 21:40:26 UTC | update Atari envs to v4 and warn Python 2 users. | 25 May 2017, 21:53:29 UTC |
0071b85 | Szymon Sidor | 24 May 2017, 21:55:54 UTC | Merge pull request #2 from omoindrot/patch-1 Correct path to script "download_model" | 24 May 2017, 21:55:54 UTC |
d2c51f5 | Olivier Moindrot | 24 May 2017, 20:13:30 UTC | Correct path to script "download_model" `python -m baselines.deepq.experiments.download_model` becomes `python -m baselines.deepq.experiments.atari.download_model` | 24 May 2017, 20:13:30 UTC |
958810e | Szymon Sidor | 17 May 2017, 21:41:46 UTC | Initial commit | 24 May 2017, 09:34:20 UTC |