Revision history - refs/heads/old_acktr_cont - origin: https://github.com/openai/baselines

visit type:

Revision	Author	Date	Message	Commit Date
0a40206	uronce-cc	31 August 2018, 16:02:18 UTC	ncpu needs to be an integer. (#558)	31 August 2018, 16:02:18 UTC
1937826	Alfredo Canziani	31 August 2018, 00:21:25 UTC	Fix alien syntax and apply PEP 8 style (#554)	31 August 2018, 00:21:25 UTC
b29c802	pzhokhov	30 August 2018, 20:40:40 UTC	remove saving model as a pickle file in ppo2 (tries to pull environment in; bad idea - may need to use constructor argument pickling or somesuch if at all necessary) (#69)	30 August 2018, 20:41:38 UTC
4ec308a	Peter Zhokhov	30 August 2018, 17:27:18 UTC	fixed syntax	30 August 2018, 20:41:38 UTC
3bbf3f3	Peter Zhokhov	30 August 2018, 16:40:42 UTC	allow_early_resets=True in create_vec_env	30 August 2018, 20:41:38 UTC
e5de29a	Joshua Meier	29 August 2018, 22:25:47 UTC	instructions for tensorboard (#61)	30 August 2018, 20:41:37 UTC
2507d33	Joshua Meier	29 August 2018, 22:17:43 UTC	Tensorboard util (#60) * separate_validation_set was not imported * launching tensorboard automatically	30 August 2018, 20:41:37 UTC
bdd4d38	Damien Lancry	29 August 2018, 00:48:56 UTC	Fix result_plotters in vectorized mujoco environments (#533) * I investigated a bit about running a training in a vectorized monitored mujoco env and found out that the 0.monitor.csv file could not be plotted using baselines.results_plotter.py functions. Moreover the seed is the same in every parallel environments due to the particular behaviour of lambda. this fixes both issues without breaking the function in other files (baselines.acktr.run_mujoco still works) * unifies make_atari_env and make_mujoco_env * redefine make_mujoco_env because of run_mujoco in acktr not compatible with DummyVecEnv and SubprocVecEnv * fix if else * Update run.py	29 August 2018, 00:48:56 UTC
0961f5d	Peter Zhokhov	27 August 2018, 23:39:51 UTC	git subrepo pull (merge) baselines subrepo: subdir: "baselines" merged: "95a81e86" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "c6c0f45c" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8"	27 August 2018, 23:40:14 UTC
337d913	Christopher Hesse	27 August 2018, 19:48:05 UTC	remove reset_task from subproc vec env (#45)	27 August 2018, 23:40:14 UTC
34af61a	Karl Cobbe	27 August 2018, 03:54:38 UTC	baselines: fix dummy vec env render mode (#42)	27 August 2018, 23:40:14 UTC
1ea5ec6	Christopher Hesse	24 August 2018, 22:44:56 UTC	export SimpleEnv and assert_envs_equal, fix minor bug in action space (#46)	27 August 2018, 23:40:14 UTC
2fc7a1c	pzhokhov	23 August 2018, 20:20:01 UTC	Trigger benchmarks from buildkite (#40) * rig buildkite pipeline to run benchmarks when commit ends with RUN BENCHMARKS * fix the buildkite pipeline file * fix the buildkite pipeline file * fix the buildkite pipeline file * fix the buildkite pipeline file * fix the buildkite pipeline file * fix the buildkite pipeline file * fix the buildkite pipeline file - merge test and benchmark steps * fix the buildkite pipeline file - merge test and benchmark steps * fix buildkite pipeline file * fix buildkite pipeline file * dry RUN BENCHMARKS * dry RUN BENCHMARKS * dry not run BENCHMARKS * not run benchmarks * not running benchmarks * no running benchmarks * no running benchmarks * still not running benchmarks * dummy commit to RUN BENCHMARKS * trigger benchmarks from buildkite RUN BENCHMARKS * specifying RCALL_KUBE_CLUSTER RUN BENCHMARKS * remove rl-algs/run-benchmarks-new.py (moved to ci), merged baselines/common/console_util and baselines/common/util.py * added missing imports in console_util * clone subrepo over https	27 August 2018, 23:40:14 UTC
14c1d69	John Schulman	22 August 2018, 20:54:34 UTC	Reduce duplication in VecEnv subclasses. (#38) * Reduce duplication in VecEnv subclasses. Now VecEnv base class handles rendering and closing; subclasses should provide get_images and (optionally) close_extras. * fix tests * minor docstring change * raise NotImplementedError	27 August 2018, 23:40:13 UTC
c8f6d8b	pzhokhov	22 August 2018, 03:51:09 UTC	address rl-algs issue #169 (missing util functions from rcall) (#30) * copied parts of util.py to baselines.common from rcall * merged fix for baselines.logger, resolved conflicts * copied ccap to baselines/baselines/common/util.py	27 August 2018, 23:40:13 UTC
3a006ba	pzhokhov	22 August 2018, 00:41:25 UTC	flake8 fixes (#35) * flake8 fixes * added baselines/setup.cfg * style checks using setup.cfg in baselines	27 August 2018, 23:40:13 UTC
c6c0f45	Tom	27 August 2018, 19:36:43 UTC	fix 'async' is a reserved word in Python >= 3.7 (#495) (#542)	27 August 2018, 19:36:43 UTC
e92a6ad	wangjksjtu	27 August 2018, 19:35:48 UTC	Update README.md (#537) １. Delete repetitive section 2. Align the commands	27 August 2018, 19:35:48 UTC
92b9a37	HelgeS	23 August 2018, 22:58:27 UTC	Updated example commands to run ppo2 (#534) The headline mentions PPO, but the command was for A2C	23 August 2018, 22:58:27 UTC
cb14da9	Armin Primadi	23 August 2018, 22:56:13 UTC	Fix typo on policies documentation (#535)	23 August 2018, 22:56:13 UTC
3900f2a	pzhokhov	21 August 2018, 23:51:37 UTC	baselines issue 146 (remove tensorflow from setup.py) (#34) * baselines does not reinstall tensorflow * fix the version check in baselines/setup.py * replace print and assert with assert, str (thanks @csh)	21 August 2018, 23:59:05 UTC
20d22a5	pzhokhov	21 August 2018, 02:15:17 UTC	Fix baselines build (fails due to lack of mujoco in public baselines container) (#29) * make nminibatces = min(nminibatches, nenv) * clarify the usage of lstm policy, add an example and a test * cleaned up example, added assert to the test * remove nminibatches -> min(nminibatches, num_env) * removed code snippet from the docstring, pointing to the file * add _mujoco_present flag to skip the tests that require mujoco if mujoco is not present * re-format skip message in test_doc_examples * flake8 complaints	21 August 2018, 17:08:24 UTC
caf7b08	pzhokhov	20 August 2018, 20:54:56 UTC	Baselines issue #525 (lack of docs for recurrent policies) (#27) * make nminibatces = min(nminibatches, nenv) * clarify the usage of lstm policy, add an example and a test * cleaned up example, added assert to the test * remove nminibatches -> min(nminibatches, num_env) * removed code snippet from the docstring, pointing to the file	20 August 2018, 20:55:35 UTC
ca0165c	Peter Zhokhov	18 August 2018, 01:09:13 UTC	flake8 complaints	18 August 2018, 01:11:00 UTC
eb5b605	pzhokhov	18 August 2018, 00:01:40 UTC	restore subrepo conftest.py files (#22) * restore conftest.py in subrepos * remove conftest files from subrepos in the docker image * remove runslow flag from baselines .travis.yml and rl-algs ci/runtests.sh * move import of rendering module into the code to fix tests that don't require a display * restore the dockerfile	18 August 2018, 00:02:39 UTC
a89bee3	Peter Zhokhov	17 August 2018, 20:55:27 UTC	Merge commit 'refs/subrepo/baselines/fetch' into subrepo/baselines	17 August 2018, 20:55:27 UTC
353bb15	pzhokhov	17 August 2018, 16:40:35 UTC	deduplicate algorithms in rl-algs and baselines (#18) * move vec_env * cleaning up rl_common * tests are passing (but mosts tests are deleted as moved to baselines) * add benchmark runner for smoke tests * removed duplicated algos * route references to rl_algs.a2c to baselines.a2c * route references to rl_algs.a2c to baselines.a2c * unify conftest.py * removing references to duplicated algs from codegen * removing references to duplicated algs from codegen * alex's changes to dummy_vec_env * fixed test_carpole[deepq] testcase by decreasing number of training steps... alex's changes seemed to have fixed the bug and make it train better, but at seed=0 there is a dip in the training curve at 30k steps that fails the test * codegen tests with atol=1e-6 seem to be unstable * rl_common.vec_env -> baselines.common.vec_env mass replace * fixed reference in trpo_mpi * a2c.util references * restored rl_algs.bench in sonic_prob * fix reference in ci/runtests.sh * simplifed expression in baselines/common/cmd_util * further increased rtol to 1e-3 in codegen tests * switched vecenvs to use SimpleImageViewer from gym instead of cv2 * make run.py --play option work with num_envs > 1 * make rosenbrock test reproducible * git subrepo pull (merge) baselines subrepo: subdir: "baselines" merged: "e23524a5" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "bcde04e7" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8" * updated baselines README (num-timesteps --> num_timesteps) * typo in deepq/README.md	17 August 2018, 20:54:11 UTC
64c0c0a	pzhokhov	15 August 2018, 17:34:38 UTC	Setup travis (#12) * re-setting up travis * re-setting up travis * resolved merge conflicts, added missing dependency for codegen * removed parallel tests (workers are failing for some reason) * try test baselines only * added language options - some weirdness in rcall image that requires them? * added verbosity to tests * try tests in baselines only * ci/runtests.sh tests codegen (some failure on baselines specifically on travis, trying to narrow down the problem) * removed render from codegen test - maybe that's the problem? * trying even simpler command within the image to figure out the problem * print out system info in ci/runtests.sh * print system info outside of docker as well * trying single test file in codegen * install graphviz in the docker image * git subrepo pull baselines subrepo: subdir: "baselines" merged: "8c2aea2" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "8c2aea2" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8" * added graphviz to the dockerfile (need both graphviz-dev and graphviz) * only tests in codegen/algo/test_algo_builder.py * run baselines tests only. still no clue why collection of codegen tests fails * update baselines setup to install filelock for tests * run slow tests * skip slow tests in baselines * single test file in baselines * try reinstalling tensorflow * running slow tests * try full baselines and codegen test suite * in the test Dockerfile, reinstall tensorflow * using fake display for codegen render tests * fixed display-related failures by adding a custom entrpoint to the docker image * set LC_ALL and LANG env variables in docker image * try sequential tests * include psutil in requirements; increase relative tolerance in test_low_level_algo_distr * trying to fix codegen failures on travis * git subrepo commit (merge) baselines subrepo: subdir: "baselines" merged: "9ce84da" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "b222dd0" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8" * syntax in install.py * changing the order of package installation * removed supervised-reptile from installation list * cron uses the full games repo in rcall * flake8 complaints * rewrite all extras logic in baselines, install.py always uses [all]	17 August 2018, 20:54:10 UTC
5fee99e	pzhokhov	15 August 2018, 17:34:38 UTC	Setup travis (#12) * re-setting up travis * re-setting up travis * resolved merge conflicts, added missing dependency for codegen * removed parallel tests (workers are failing for some reason) * try test baselines only * added language options - some weirdness in rcall image that requires them? * added verbosity to tests * try tests in baselines only * ci/runtests.sh tests codegen (some failure on baselines specifically on travis, trying to narrow down the problem) * removed render from codegen test - maybe that's the problem? * trying even simpler command within the image to figure out the problem * print out system info in ci/runtests.sh * print system info outside of docker as well * trying single test file in codegen * install graphviz in the docker image * git subrepo pull baselines subrepo: subdir: "baselines" merged: "8c2aea2" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "8c2aea2" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8" * added graphviz to the dockerfile (need both graphviz-dev and graphviz) * only tests in codegen/algo/test_algo_builder.py * run baselines tests only. still no clue why collection of codegen tests fails * update baselines setup to install filelock for tests * run slow tests * skip slow tests in baselines * single test file in baselines * try reinstalling tensorflow * running slow tests * try full baselines and codegen test suite * in the test Dockerfile, reinstall tensorflow * using fake display for codegen render tests * fixed display-related failures by adding a custom entrpoint to the docker image * set LC_ALL and LANG env variables in docker image * try sequential tests * include psutil in requirements; increase relative tolerance in test_low_level_algo_distr * trying to fix codegen failures on travis * git subrepo commit (merge) baselines subrepo: subdir: "baselines" merged: "9ce84da" upstream: origin: "git@github.com:openai/baselines.git" branch: "master" commit: "b222dd0" git-subrepo: version: "0.4.0" origin: "git@github.com:ingydotnet/git-subrepo.git" commit: "74339e8" * syntax in install.py * changing the order of package installation * removed supervised-reptile from installation list * cron uses the full games repo in rcall * flake8 complaints * rewrite all extras logic in baselines, install.py always uses [all]	17 August 2018, 20:40:02 UTC
5edcd68	Youngjin Kim	16 August 2018, 21:55:57 UTC	Fix argument error in deepq (#508) * Fix argment error in deepq * Fix argment error in deepq	16 August 2018, 21:55:57 UTC
bcde04e	Youngjin Kim	16 August 2018, 21:55:57 UTC	Fix argument error in deepq (#508) * Fix argment error in deepq * Fix argment error in deepq	16 August 2018, 21:55:57 UTC
5622a09	pzhokhov	16 August 2018, 21:53:49 UTC	update readmes (#514) * update per-algorithm READMEs to reflect new way of running algorithms * adding a link to repo-wide README * updated README files and deepq.train_cartpole example	16 August 2018, 21:53:49 UTC
cd375ab	pzhokhov	16 August 2018, 21:53:49 UTC	update readmes (#514) * update per-algorithm READMEs to reflect new way of running algorithms * adding a link to repo-wide README * updated README files and deepq.train_cartpole example	16 August 2018, 21:53:49 UTC
e2da7cd	Pim de Haan	16 August 2018, 19:08:53 UTC	Several bugfixes for #504, #505, #506 related to Classic Control and deepq (#507) * Several bugfixes * Fixed ActWrapper.step bug	16 August 2018, 19:08:53 UTC
b222dd0	Peter Zhokhov	13 August 2018, 23:01:24 UTC	updated links in README to point to master	13 August 2018, 23:01:24 UTC
1870685	pzhokhov	13 August 2018, 22:59:43 UTC	Publish benchmark results (#502) * updated benchmark pages with final rewards * use htmlpreview to render pages * use htmlpreview to render pages * use htmlpreview to render pages * updated README to reflect ppo1 being obsolete * removed navbars from published benchmark pages * fixed link in README	13 August 2018, 22:59:43 UTC
8c2aea2	pzhokhov	13 August 2018, 16:56:44 UTC	refactor a2c, acer, acktr, ppo2, deepq, and trpo_mpi (#490) * exported rl-algs * more stuff from rl-algs * run slow tests * re-exported rl_algs * re-exported rl_algs - fixed problems with serialization test and test_cartpole * replaced atari_arg_parser with common_arg_parser * run.py can run algos from both baselines and rl_algs * added approximate humanoid reward with ppo2 into the README for reference * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * very dummy commit to RUN BENCHMARKS * serialize variables as a dict, not as a list * running_mean_std uses tensorflow variables * fixed import in vec_normalize * dummy commit to RUN BENCHMARKS * dummy commit to RUN BENCHMARKS * flake8 complaints * save all variables to make sure we save the vec_normalize normalization * benchmarks on ppo2 only RUN BENCHMARKS * make_atari_env compatible with mpi * run ppo_mpi benchmarks only RUN BENCHMARKS * hardcode names of retro environments * add defaults * changed default ppo2 lr schedule to linear RUN BENCHMARKS * non-tf normalization benchmark RUN BENCHMARKS * use ncpu=1 for mujoco sessions - gives a bit of a performance speedup * reverted running_mean_std to user property decorators for mean, var, count * reverted VecNormalize to use RunningMeanStd (no tf) * reverted VecNormalize to use RunningMeanStd (no tf) * profiling wip * use VecNormalize with regular RunningMeanStd * added acer runner (missing import) * flake8 complaints * added a note in README about TfRunningMeanStd and serialization of VecNormalize * dummy commit to RUN BENCHMARKS * merged benchmarks branch	13 August 2018, 16:56:44 UTC
366f486	Tony Yu Cao	08 August 2018, 17:42:10 UTC	Update README.md (#416) Update Atari example	08 August 2018, 17:42:10 UTC
f272969	Adam Gleave	06 July 2018, 23:12:14 UTC	GAIL: bugfix in dataset loading (#447) * Fix silly typo * Replace ad-hoc function with NumPy code	06 July 2018, 23:12:14 UTC
a6b1bc7	pzhokhov	08 June 2018, 16:41:45 UTC	re-import internal; fix missing tile_images.py (#427) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal * adding missing tile_images.py	08 June 2018, 16:41:45 UTC
36ee5d1	pzhokhov	06 June 2018, 18:39:13 UTC	Import internal changes (#422) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity * import internal	06 June 2018, 18:39:13 UTC
24fe3d6	pzhokhov	21 May 2018, 22:24:00 UTC	Import internal repo (#409) * import rl-algs from 2e3a166 commit * extra import of the baselines badge * exported commit with identity test * proper rng seeding in the test_identity	21 May 2018, 22:24:00 UTC
9cb7ece	pzhokhov	14 May 2018, 17:52:19 UTC	add opencv-python to the dependencies (#407)	14 May 2018, 17:52:19 UTC
9cf95a0	pzhokhov	03 May 2018, 16:43:28 UTC	setup travis ci build (#388) * simple .travis.yml file * added static syntax checks of common to .travis.yml * dockerizing the build * fix Dockerfile, adding build shield * cleaning up workdir in Dockerfile and .travis.yml * .travis.yml fixed common -> baselines/common for style check	03 May 2018, 16:43:28 UTC
8b78103	pzhokhov	03 May 2018, 01:42:48 UTC	put filters and running_stat files in common instead of acktr (#389)	03 May 2018, 01:42:48 UTC
69f25c6	pzhokhov	01 May 2018, 23:54:04 UTC	import internal repo (#385)	01 May 2018, 23:54:04 UTC
2b0283b	pzhokhov	26 April 2018, 00:40:48 UTC	Readme.md detailed installation instructions (#377) * changes to README.md files with more detailed installation instructions * md-fying the changes better * link on the word homebrew in readme.md * typos in README.md * README.md * removed extra comma sign * removed sudo from brew command	26 April 2018, 00:40:48 UTC
1f8a03f	Matthias Plappert	26 March 2018, 14:50:22 UTC	Update README	26 March 2018, 14:50:22 UTC
3cc7df0	Matthias Plappert	05 March 2018, 10:06:17 UTC	Minor fixes to HER release (#319) * Fix plotting script * Add warning if num_cpu = 1	05 March 2018, 10:06:17 UTC
8b3a6c2	Alex Nichol	03 March 2018, 01:18:07 UTC	fix DummyVecEnv reusing buffers	03 March 2018, 01:18:07 UTC
569bd42	Alex Nichol	01 March 2018, 18:45:04 UTC	Merge pull request #308 from araffin/master Bug fix in saving ACER model	01 March 2018, 18:45:04 UTC
f49a9c3	Daniel Ziegler	01 March 2018, 17:00:34 UTC	Fix bug in DDPG parameter space noise adaptation (#306) The training loop used the rollout step variable `t` rather than the training step variable `t_train` to decide when to adapt the scale of the parameter space noise.	01 March 2018, 17:00:34 UTC
14f2f93	Antonin RAFFIN	01 March 2018, 09:24:14 UTC	Bug fix in saving ACER model	01 March 2018, 09:24:14 UTC
6bdf2f5	Alex Nichol	28 February 2018, 03:00:37 UTC	Merge pull request #132 from bhatiaabhinav/bug_fixes Bug fix in saving a2c model.	28 February 2018, 03:00:37 UTC
97be70d	Alex Nichol	28 February 2018, 02:55:10 UTC	fixes for DummyVecEnv Fixes various problems running MuJoCo tasks.	28 February 2018, 02:55:10 UTC
b71152e	Matthias Plappert	26 February 2018, 16:40:16 UTC	Adds support for Hindsight Experience Replay (HER) (#299) * Add Hindsight Experience Replay (HER) * Minor improvements	26 February 2018, 16:40:16 UTC
df2e846	Christopher Hesse	15 February 2018, 06:01:16 UTC	export: fix accidental rename	15 February 2018, 06:01:16 UTC
edb52c2	Christopher Hesse	15 February 2018, 05:42:22 UTC	export: Fix deepq param noise refactoring, remove atari experiments and azure dependency	15 February 2018, 05:42:22 UTC
98257ef	Andrei Kashin	02 October 2017, 22:29:07 UTC	Flush temporary file before compressing it. We need to flush the buffer after `pickle.dump`, otherwise the resulting zip archive might be incomplete (reproducible, if the state consists of a single integer).	06 February 2018, 15:04:44 UTC
d9b3660	Oleg Klimov	05 February 2018, 20:25:05 UTC	comment about loading weights in ppo2	05 February 2018, 20:25:05 UTC
2793971	Oleg Klimov	05 February 2018, 15:51:27 UTC	fix gail tf_util usage	05 February 2018, 15:51:27 UTC
16d7d23	John Schulman	03 February 2018, 07:04:53 UTC	Merge pull request #271 from simontudo/add-requirement-cloudpickle added cloudpickle to requirements	03 February 2018, 07:04:53 UTC
9175b77	John Schulman	03 February 2018, 07:03:51 UTC	Merge pull request #273 from simontudo/videorecorder-import updated videorecorder import	03 February 2018, 07:03:51 UTC
615870a	simontudo	01 February 2018, 11:09:08 UTC	updated videorecorder import	01 February 2018, 11:09:08 UTC
7bd264e	simontudo	31 January 2018, 09:43:17 UTC	added cloudpickle to requirements	31 January 2018, 09:43:17 UTC
8d03102	John Schulman	29 January 2018, 08:54:51 UTC	Merge pull request #265 from 20chase/patch-1 fix logger error for trpo_mpi	29 January 2018, 08:54:51 UTC
4a77855	20chase	29 January 2018, 08:52:01 UTC	using mujoco_arg_parser as args remove origin parser	29 January 2018, 08:52:01 UTC
2e29b41	John Schulman	28 January 2018, 02:42:31 UTC	Merge pull request #268 from ei-grad/master Fix fc call in AcerLstmPolicy	28 January 2018, 02:42:31 UTC
634e37c	Andrew Grigorev	27 January 2018, 20:18:02 UTC	Fix fc call in AcerLstmPolicy The `act` keyword was removed from baselines.a2c.utils.fc in commit 9fa8e1b.	27 January 2018, 20:18:02 UTC
452b548	20chase	26 January 2018, 06:34:01 UTC	Merge branch 'master' into patch-1	26 January 2018, 06:34:01 UTC
ebb8aff	John Schulman	26 January 2018, 05:17:40 UTC	fix trpo_mpi bug where logstd wasn’t included	26 January 2018, 05:17:40 UTC
c9613b2	John Schulman	26 January 2018, 04:54:34 UTC	Merge pull request #259 from andrewliao11/openai_gail Add gail maintainer list	26 January 2018, 04:54:34 UTC
459f007	John Schulman	26 January 2018, 04:54:20 UTC	Merge pull request #260 from uidilr/master Add GAIL	26 January 2018, 04:54:20 UTC
9fa8e1b	John Schulman	26 January 2018, 02:33:48 UTC	Lots of cleanups Fixes for new gym version Add @olegklimov and @unixpickle to authors list	26 January 2018, 02:54:24 UTC
ac2ea4f	20chase	25 January 2018, 14:09:00 UTC	fix logger error for MPI Can't run logger.configure() if rank != 0	25 January 2018, 14:09:00 UTC
d8cce23	Yusuke Nakata	23 January 2018, 03:00:00 UTC	Add GAIL	23 January 2018, 03:02:03 UTC
0c207f0	andrew	22 January 2018, 06:13:01 UTC	fix typo	22 January 2018, 06:13:01 UTC
41d41fa	andrew	22 January 2018, 06:12:03 UTC	add gail maintainer list	22 January 2018, 06:12:03 UTC
b5be53d	John Schulman	22 January 2018, 01:30:20 UTC	Merge pull request #229 from andrewliao11/gail GAIL implementation	22 January 2018, 01:30:20 UTC
49c1a8e	Matthias Plappert	16 January 2018, 18:24:30 UTC	Fix bug in parameter space noise DQN	16 January 2018, 18:24:30 UTC
e5a714b	andrew	12 January 2018, 23:12:45 UTC	fix relative import	12 January 2018, 23:12:45 UTC
f9d1d33	John Schulman	12 January 2018, 19:05:29 UTC	remove mpirun from ppo2 instructions	12 January 2018, 19:05:29 UTC
8c90f67	Alex Nichol	15 December 2017, 23:54:43 UTC	don't list TensorFlow as a requirement fixes #146 A better (more involved) solution might be to check for a TensorFlow installation manually in setup.py and deal with that accordingly.	15 December 2017, 23:54:43 UTC
f22bee0	Andrew	13 December 2017, 03:03:42 UTC	Add files via upload	13 December 2017, 03:03:42 UTC
4acc71f	andrew	13 December 2017, 02:58:57 UTC	add x, y, axis name	13 December 2017, 02:58:57 UTC
2f1b629	andrew	13 December 2017, 02:56:00 UTC	Merge branch 'gail' of https://github.com/andrewliao11/baselines into gail	13 December 2017, 02:56:00 UTC
00573cf	andrew	13 December 2017, 02:54:03 UTC	add x, y axis name	13 December 2017, 02:54:03 UTC
cfa1236	Andrew	12 December 2017, 05:21:56 UTC	Update README.md	12 December 2017, 05:21:56 UTC
64288f9	Andrew	12 December 2017, 05:19:47 UTC	Update gail-result.md	12 December 2017, 05:19:47 UTC
5f647d4	Andrew	12 December 2017, 05:18:05 UTC	Update README.md	12 December 2017, 05:18:05 UTC
6723455	Andrew	12 December 2017, 05:15:30 UTC	Update gail-result.md	12 December 2017, 05:15:30 UTC
45a93cf	Andrew	12 December 2017, 05:06:04 UTC	add training curve from tensorboard	12 December 2017, 05:06:04 UTC
11604f7	andrew	07 December 2017, 20:08:20 UTC	add download link to readme and add description to python file	07 December 2017, 20:08:20 UTC
2444034	John Schulman	04 December 2017, 22:07:01 UTC	Merge pull request #194 from ryanjulian/env_lines Force shebang lines to Python 3	04 December 2017, 22:07:01 UTC
041b6b7	John Schulman	04 December 2017, 22:02:49 UTC	Merge pull request #215 from chris-chris/feature/typo-2017-11-19 fix misspellings	04 December 2017, 22:02:49 UTC
5d62b5b	John Schulman	04 December 2017, 22:01:38 UTC	Merge pull request #221 from jvmancuso/patch-1 Docstring fix	04 December 2017, 22:01:38 UTC
2fcc9b9	John Schulman	04 December 2017, 22:01:12 UTC	Merge pull request #226 from definitelyuncertain/master Call ppo2 and not ppo1 in ppo2 README.md	04 December 2017, 22:01:12 UTC
0000339	Andrew	03 December 2017, 23:50:24 UTC	Update gail-result.md	03 December 2017, 23:50:24 UTC
6090ee8	andrew	03 December 2017, 23:46:52 UTC	add comparison for expert/BC/gail	03 December 2017, 23:46:52 UTC
7954327	andrew	03 December 2017, 21:55:44 UTC	add behavior cloning learn/eval code	03 December 2017, 21:55:44 UTC

Newer
Older