ea68f3b | Peter Zhokhov | 10 August 2018, 16:46:43 UTC | dummy commit to RUN BENCHMARKS | 10 August 2018, 16:46:43 UTC |
ca721a4 | Peter Zhokhov | 10 August 2018, 16:45:50 UTC | Merge branch 'observation-dtype' of github.com:openai/baselines into peterz_benchmarks | 10 August 2018, 16:45:50 UTC |
72f3572 | Peter Zhokhov | 08 August 2018, 23:24:39 UTC | fixed syntax in conv_only RUN BENCHMARKS | 08 August 2018, 23:24:39 UTC |
b9cd941 | Peter Zhokhov | 08 August 2018, 22:59:59 UTC | dummy commit to RUN BENCHMARKS | 08 August 2018, 22:59:59 UTC |
0899b71 | Peter Zhokhov | 08 August 2018, 22:15:03 UTC | scale the images in conv_only RUN BENCHMARKS | 08 August 2018, 22:15:03 UTC |
cc8c954 | Peter Zhokhov | 08 August 2018, 22:10:39 UTC | dummy commit to RUN BENCHMARKS | 08 August 2018, 22:10:39 UTC |
cb32522 | Peter Zhokhov | 08 August 2018, 22:10:00 UTC | enable all benchmarks | 08 August 2018, 22:10:00 UTC |
1e40ec2 | Peter Zhokhov | 08 August 2018, 17:45:18 UTC | dummy commit to RUN BENCHMARKS | 08 August 2018, 17:45:18 UTC |
701a36c | Peter Zhokhov | 08 August 2018, 17:44:58 UTC | added a note in README about TfRunningMeanStd and serialization of VecNormalize | 08 August 2018, 17:44:58 UTC |
5a7f984 | Peter Zhokhov | 03 August 2018, 20:59:58 UTC | flake8 complaints | 03 August 2018, 20:59:58 UTC |
b63134e | Peter Zhokhov | 03 August 2018, 20:31:37 UTC | added acer runner (missing import) | 03 August 2018, 20:31:37 UTC |
db314cd | Peter Zhokhov | 03 August 2018, 18:47:36 UTC | Merge branch 'peterz_profile_vec_normalize' into peterz_migrate_rlalgs | 03 August 2018, 18:47:36 UTC |
b08c083 | Peter Zhokhov | 03 August 2018, 18:44:12 UTC | use VecNormalize with regular RunningMeanStd | 03 August 2018, 18:44:12 UTC |
bfbbe66 | Peter Zhokhov | 02 August 2018, 18:23:12 UTC | profiling wip | 02 August 2018, 18:23:12 UTC |
1c5c656 | Peter Zhokhov | 02 August 2018, 17:55:09 UTC | reverted VecNormalize to use RunningMeanStd (no tf) | 02 August 2018, 17:55:09 UTC |
1fa8c58 | Peter Zhokhov | 02 August 2018, 17:54:07 UTC | reverted VecNormalize to use RunningMeanStd (no tf) | 02 August 2018, 17:54:07 UTC |
f6d1115 | Peter Zhokhov | 02 August 2018, 17:32:22 UTC | reverted running_mean_std to user property decorators for mean, var, count | 02 August 2018, 17:32:22 UTC |
f6d5a47 | Peter Zhokhov | 02 August 2018, 17:24:21 UTC | use ncpu=1 for mujoco sessions - gives a bit of a performance speedup | 02 August 2018, 17:24:21 UTC |
c2df27b | Peter Zhokhov | 02 August 2018, 16:41:41 UTC | non-tf normalization benchmark RUN BENCHMARKS | 02 August 2018, 16:41:41 UTC |
974c157 | Peter Zhokhov | 01 August 2018, 23:24:44 UTC | changed default ppo2 lr schedule to linear RUN BENCHMARKS | 01 August 2018, 23:24:44 UTC |
ad43fd9 | Peter Zhokhov | 01 August 2018, 23:15:59 UTC | add defaults | 01 August 2018, 23:15:59 UTC |
72c357c | Peter Zhokhov | 01 August 2018, 22:18:59 UTC | hardcode names of retro environments | 01 August 2018, 22:18:59 UTC |
e00e5ca | Peter Zhokhov | 01 August 2018, 21:56:08 UTC | run ppo_mpi benchmarks only RUN BENCHMARKS | 01 August 2018, 21:56:08 UTC |
705797f | Peter Zhokhov | 01 August 2018, 21:46:40 UTC | Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks | 01 August 2018, 21:46:40 UTC |
fcd84aa | Peter Zhokhov | 01 August 2018, 21:46:18 UTC | make_atari_env compatible with mpi | 01 August 2018, 21:46:18 UTC |
390b515 | Peter Zhokhov | 01 August 2018, 18:01:50 UTC | benchmarks on ppo2 only RUN BENCHMARKS | 01 August 2018, 18:01:50 UTC |
95104a3 | Peter Zhokhov | 01 August 2018, 17:50:29 UTC | Merge branch 'peterz_migrate_rlalgs' into peterz_benchmarks | 01 August 2018, 17:50:29 UTC |
3528f7b | Peter Zhokhov | 01 August 2018, 17:12:19 UTC | save all variables to make sure we save the vec_normalize normalization | 01 August 2018, 17:12:19 UTC |
151e480 | Peter Zhokhov | 31 July 2018, 23:25:12 UTC | flake8 complaints | 31 July 2018, 23:25:12 UTC |
92f3333 | Peter Zhokhov | 31 July 2018, 22:53:18 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 22:53:18 UTC |
af729cf | Peter Zhokhov | 31 July 2018, 22:37:00 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 22:37:00 UTC |
10f815f | Peter Zhokhov | 31 July 2018, 22:19:43 UTC | fixed import in vec_normalize | 31 July 2018, 22:19:43 UTC |
8c4adac | Peter Zhokhov | 31 July 2018, 21:45:55 UTC | running_mean_std uses tensorflow variables | 31 July 2018, 21:45:55 UTC |
2a93ea8 | Peter Zhokhov | 31 July 2018, 18:13:31 UTC | serialize variables as a dict, not as a list | 31 July 2018, 18:13:31 UTC |
9c48f9f | Peter Zhokhov | 31 July 2018, 17:23:43 UTC | very dummy commit to RUN BENCHMARKS | 31 July 2018, 17:23:43 UTC |
348cbb4 | Peter Zhokhov | 31 July 2018, 16:42:23 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 16:42:23 UTC |
a1602ab | Peter Zhokhov | 31 July 2018, 00:51:16 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 00:51:16 UTC |
e63e69b | Peter Zhokhov | 31 July 2018, 00:39:22 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 00:39:22 UTC |
385e7e5 | Peter Zhokhov | 31 July 2018, 00:21:05 UTC | dummy commit to RUN BENCHMARKS | 31 July 2018, 00:21:05 UTC |
d112a2e | Peter Zhokhov | 30 July 2018, 23:58:31 UTC | added approximate humanoid reward with ppo2 into the README for reference | 30 July 2018, 23:58:31 UTC |
e662dd6 | Peter Zhokhov | 30 July 2018, 23:09:48 UTC | run.py can run algos from both baselines and rl_algs | 30 July 2018, 23:09:48 UTC |
efc6bff | Peter Zhokhov | 30 July 2018, 22:58:56 UTC | replaced atari_arg_parser with common_arg_parser | 30 July 2018, 22:58:56 UTC |
872181d | Peter Zhokhov | 30 July 2018, 22:49:48 UTC | re-exported rl_algs - fixed problems with serialization test and test_cartpole | 30 July 2018, 22:49:48 UTC |
628ddec | Peter Zhokhov | 30 July 2018, 19:15:46 UTC | re-exported rl_algs | 30 July 2018, 19:15:46 UTC |
83a4a4b | peter | 26 July 2018, 21:39:25 UTC | run slow tests | 26 July 2018, 21:39:25 UTC |
7edac38 | peter | 26 July 2018, 21:26:57 UTC | more stuff from rl-algs | 26 July 2018, 21:26:57 UTC |
a6dca44 | peter | 26 July 2018, 21:02:04 UTC | exported rl-algs | 26 July 2018, 21:02:04 UTC |
f272969 | Adam Gleave | 06 July 2018, 23:12:14 UTC | GAIL: bugfix in dataset loading (#447) * Fix silly typo * Replace ad-hoc function with NumPy code | 06 July 2018, 23:12:14 UTC |
622915c | Karl Cobbe | 12 June 2018, 21:48:39 UTC | fix dtype for wrapper observation spaces | 12 June 2018, 21:48:39 UTC |
a1d3c18 | Karl Cobbe | 11 June 2018, 20:35:47 UTC | fix dtype for wrapper observation spaces | 11 June 2018, 20:35:47 UTC |
a6b1bc7 | pzhokhov | 08 June 2018, 16:41:45 UTC | re-import internal; fix missing tile_images.py (#427) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal * adding missing tile_images.py | 08 June 2018, 16:41:45 UTC |
36ee5d1 | pzhokhov | 06 June 2018, 18:39:13 UTC | Import internal changes (#422) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal | 06 June 2018, 18:39:13 UTC |
24fe3d6 | pzhokhov | 21 May 2018, 22:24:00 UTC | Import internal repo (#409) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity | 21 May 2018, 22:24:00 UTC |
9cb7ece | pzhokhov | 14 May 2018, 17:52:19 UTC | add opencv-python to the dependencies (#407) | 14 May 2018, 17:52:19 UTC |
9cf95a0 | pzhokhov | 03 May 2018, 16:43:28 UTC | setup travis ci build (#388) * simple .travis.yml file * added static syntax checks of common to .travis.yml * dockerizing the build * fix Dockerfile, adding build shield * cleaning up workdir in Dockerfile and .travis.yml * .travis.yml fixed common -> baselines/common for style check | 03 May 2018, 16:43:28 UTC |
8b78103 | pzhokhov | 03 May 2018, 01:42:48 UTC | put filters and running_stat files in common instead of acktr (#389) | 03 May 2018, 01:42:48 UTC |
69f25c6 | pzhokhov | 01 May 2018, 23:54:04 UTC | import internal repo (#385) | 01 May 2018, 23:54:04 UTC |
2b0283b | pzhokhov | 26 April 2018, 00:40:48 UTC | Readme.md detailed installation instructions (#377) * changes to README.md files with more detailed installation instructions * md-fying the changes better * link on the word homebrew in readme.md * typos in README.md * README.md * removed extra comma sign * removed sudo from brew command | 26 April 2018, 00:40:48 UTC |
1f8a03f | Matthias Plappert | 26 March 2018, 14:50:22 UTC | Update README | 26 March 2018, 14:50:22 UTC |
3cc7df0 | Matthias Plappert | 05 March 2018, 10:06:17 UTC | Minor fixes to HER release (#319) * Fix plotting script * Add warning if num_cpu = 1 | 05 March 2018, 10:06:17 UTC |
8b3a6c2 | Alex Nichol | 03 March 2018, 01:18:07 UTC | fix DummyVecEnv reusing buffers | 03 March 2018, 01:18:07 UTC |
569bd42 | Alex Nichol | 01 March 2018, 18:45:04 UTC | Merge pull request #308 from araffin/master Bug fix in saving ACER model | 01 March 2018, 18:45:04 UTC |
f49a9c3 | Daniel Ziegler | 01 March 2018, 17:00:34 UTC | Fix bug in DDPG parameter space noise adaptation (#306) The training loop used the rollout step variable `t` rather than the training step variable `t_train` to decide when to adapt the scale of the parameter space noise. | 01 March 2018, 17:00:34 UTC |
14f2f93 | Antonin RAFFIN | 01 March 2018, 09:24:14 UTC | Bug fix in saving ACER model | 01 March 2018, 09:24:14 UTC |
6bdf2f5 | Alex Nichol | 28 February 2018, 03:00:37 UTC | Merge pull request #132 from bhatiaabhinav/bug_fixes Bug fix in saving a2c model. | 28 February 2018, 03:00:37 UTC |
97be70d | Alex Nichol | 28 February 2018, 02:55:10 UTC | fixes for DummyVecEnv Fixes various problems running MuJoCo tasks. | 28 February 2018, 02:55:10 UTC |
b71152e | Matthias Plappert | 26 February 2018, 16:40:16 UTC | Adds support for Hindsight Experience Replay (HER) (#299) * Add Hindsight Experience Replay (HER) * Minor improvements | 26 February 2018, 16:40:16 UTC |
df2e846 | Christopher Hesse | 15 February 2018, 06:01:16 UTC | export: fix accidental rename | 15 February 2018, 06:01:16 UTC |
edb52c2 | Christopher Hesse | 15 February 2018, 05:42:22 UTC | export: Fix deepq param noise refactoring, remove atari experiments and azure dependency | 15 February 2018, 05:42:22 UTC |
98257ef | Andrei Kashin | 02 October 2017, 22:29:07 UTC | Flush temporary file before compressing it. We need to flush the buffer after `pickle.dump`, otherwise the resulting zip archive might be incomplete (reproducible, if the state consists of a single integer). | 06 February 2018, 15:04:44 UTC |
d9b3660 | Oleg Klimov | 05 February 2018, 20:25:05 UTC | comment about loading weights in ppo2 | 05 February 2018, 20:25:05 UTC |
2793971 | Oleg Klimov | 05 February 2018, 15:51:27 UTC | fix gail tf_util usage | 05 February 2018, 15:51:27 UTC |
16d7d23 | John Schulman | 03 February 2018, 07:04:53 UTC | Merge pull request #271 from simontudo/add-requirement-cloudpickle added cloudpickle to requirements | 03 February 2018, 07:04:53 UTC |
9175b77 | John Schulman | 03 February 2018, 07:03:51 UTC | Merge pull request #273 from simontudo/videorecorder-import updated videorecorder import | 03 February 2018, 07:03:51 UTC |
615870a | simontudo | 01 February 2018, 11:09:08 UTC | updated videorecorder import | 01 February 2018, 11:09:08 UTC |
7bd264e | simontudo | 31 January 2018, 09:43:17 UTC | added cloudpickle to requirements | 31 January 2018, 09:43:17 UTC |
8d03102 | John Schulman | 29 January 2018, 08:54:51 UTC | Merge pull request #265 from 20chase/patch-1 fix logger error for trpo_mpi | 29 January 2018, 08:54:51 UTC |
4a77855 | 20chase | 29 January 2018, 08:52:01 UTC | using mujoco_arg_parser as args remove origin parser | 29 January 2018, 08:52:01 UTC |
2e29b41 | John Schulman | 28 January 2018, 02:42:31 UTC | Merge pull request #268 from ei-grad/master Fix fc call in AcerLstmPolicy | 28 January 2018, 02:42:31 UTC |
634e37c | Andrew Grigorev | 27 January 2018, 20:18:02 UTC | Fix fc call in AcerLstmPolicy The `act` keyword was removed from baselines.a2c.utils.fc in commit 9fa8e1b. | 27 January 2018, 20:18:02 UTC |
452b548 | 20chase | 26 January 2018, 06:34:01 UTC | Merge branch 'master' into patch-1 | 26 January 2018, 06:34:01 UTC |
ebb8aff | John Schulman | 26 January 2018, 05:17:40 UTC | fix trpo_mpi bug where logstd wasn’t included | 26 January 2018, 05:17:40 UTC |
c9613b2 | John Schulman | 26 January 2018, 04:54:34 UTC | Merge pull request #259 from andrewliao11/openai_gail Add gail maintainer list | 26 January 2018, 04:54:34 UTC |
459f007 | John Schulman | 26 January 2018, 04:54:20 UTC | Merge pull request #260 from uidilr/master Add GAIL | 26 January 2018, 04:54:20 UTC |
9fa8e1b | John Schulman | 26 January 2018, 02:33:48 UTC | Lots of cleanups Fixes for new gym version Add @olegklimov and @unixpickle to authors list | 26 January 2018, 02:54:24 UTC |
ac2ea4f | 20chase | 25 January 2018, 14:09:00 UTC | fix logger error for MPI Can't run logger.configure() if rank != 0 | 25 January 2018, 14:09:00 UTC |
d8cce23 | Yusuke Nakata | 23 January 2018, 03:00:00 UTC | Add GAIL | 23 January 2018, 03:02:03 UTC |
0c207f0 | andrew | 22 January 2018, 06:13:01 UTC | fix typo | 22 January 2018, 06:13:01 UTC |
41d41fa | andrew | 22 January 2018, 06:12:03 UTC | add gail maintainer list | 22 January 2018, 06:12:03 UTC |
b5be53d | John Schulman | 22 January 2018, 01:30:20 UTC | Merge pull request #229 from andrewliao11/gail GAIL implementation | 22 January 2018, 01:30:20 UTC |
49c1a8e | Matthias Plappert | 16 January 2018, 18:24:30 UTC | Fix bug in parameter space noise DQN | 16 January 2018, 18:24:30 UTC |
e5a714b | andrew | 12 January 2018, 23:12:45 UTC | fix relative import | 12 January 2018, 23:12:45 UTC |
f9d1d33 | John Schulman | 12 January 2018, 19:05:29 UTC | remove mpirun from ppo2 instructions | 12 January 2018, 19:05:29 UTC |
8c90f67 | Alex Nichol | 15 December 2017, 23:54:43 UTC | don't list TensorFlow as a requirement fixes #146 A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly. | 15 December 2017, 23:54:43 UTC |
f22bee0 | Andrew | 13 December 2017, 03:03:42 UTC | Add files via upload | 13 December 2017, 03:03:42 UTC |
4acc71f | andrew | 13 December 2017, 02:58:57 UTC | add x, y, axis name | 13 December 2017, 02:58:57 UTC |
2f1b629 | andrew | 13 December 2017, 02:56:00 UTC | Merge branch 'gail' of https://github.com/andrewliao11/baselines into gail | 13 December 2017, 02:56:00 UTC |
00573cf | andrew | 13 December 2017, 02:54:03 UTC | add x, y axis name | 13 December 2017, 02:54:03 UTC |
cfa1236 | Andrew | 12 December 2017, 05:21:56 UTC | Update README.md | 12 December 2017, 05:21:56 UTC |
64288f9 | Andrew | 12 December 2017, 05:19:47 UTC | Update gail-result.md | 12 December 2017, 05:19:47 UTC |