Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update master #2

Merged
merged 22 commits into from
Jul 20, 2019
Merged

update master #2

merged 22 commits into from
Jul 20, 2019

Conversation

XMaster96
Copy link
Owner

No description provided.

kantneel and others added 22 commits May 8, 2019 13:55
* fixed bug in VecEnvWrapper.__getattr__ where inherited methods were inaccessible

* improved test for VecEnvWrapper.__getattr__ to be more comprehensive

* changed test function to satisfy code checks

* updated changelog and simplified declaration of self.class_attributes

* modified getattr_depth_check for consistency and added helper method for getting all attributes
* Fixed path splitting in _get_latest_run_id() on Windows machines

* Returned to previous split method, replaced split delimiter with os.sep in _get_latest_run_id function. Wrote test for saving tensorboard data twice with the same logname

* Fixed tests for saving tensorboard twice with same logname

* Updated tensorboard tests

* Updated tensorboard tests. Added name and fix to changelog

* Update test_tensorboard.py

* Update test_tensorboard.py
* added example project

* format

* some textual improvements

* Update docs/misc/projects.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>
* Issue #317 [feature request] filter_size can be a array instead of one value

* Issues #326 [Feature] filter_size can be a array

* Issue #326 [Feature] filter_size can be a array

* Issues #326 [Feature] filter_size can be a array: Line too long

* Update changelog.rst

* Issue #326 [Feature] filter_size can be a array, the added test code is test_a2c_conv.py

* Issues #326 [Feature] filter_size can be a array, remove the unused variables

* Issues #326 [Feature] filter_size can be a array, remove the unused library

* Issue #326, [Feature] filter_size can be a array. Clean up the test code
* Minor typo correction

* Minor typo correction
* fix timestep and learning rate recording

* fix issue related to reset_num_timesteps

* update changelog

* Update changelog.rst
* Update doc: hyperparam tuning for rl zoo

* Add colab notebook link
* * Add `get_parameters` function (returns all loadable/saveable tensorflow Variables)
* Add `load_parameters` function (loads model parameters from file/file-like/list of ndarrays)
* Update A2C, ACER, ACKTR, DDPG, DQN, PPOs, SAC and TRPO to use `get_parameters` to define
  parameters necessary for correctly loading/saving models.

* * Switch from using lists of parameters to dicts of
  variable name -> ndarray.
  * Includes support for loading from older .pkl files
    with a list of parameters

* Renamed `get_parameters` to `_get_parameter_list`
  * `get_parameters` dictionary of variable name -> ndarrays
  * `_get_parameter_list` returns list of tensorflow Variables
    that should be saved/loaded

* Updated changelog for  changes

* Clarified name of  function parameter

* Updated contributor's list

* Fix few PEP8 errors

* Update docs to reflect variable name

* Fix PEP8/style in test_load_parameters

* Requested small typo/doc changes and removed unused parameter in tests

* Add  parameter for  with tests

* Add tests for  from a file/file-like objects

* Use format-function and small line-length change

* Add warning about not updating trainer parameters upon 'load_parameters'

* Add an example of using load/get parameters with a simple ES example

* Use OrderedDict for get_params rather than normal dict

* Make _get_variable_list a public function

* Update load_parameters example with A2C+ES hybrid and only mutating specific parameters
* Add bit flipping env

* HER reloaded (WIP)

* DQN + HER

* Add support for SAC and DDPG

* Add tests for SAC and DDPG + HER
+ add comments

* Bug fix + add comments

* Add action noise for SAC

* Add note about pop-art normalization

* Add saving/loading
+ begin support for VecEnv

* Add success rate

* Fix HER learning method

* Add support for VecEnv
+ improve comments
+ add properties to ReplayBuffer

* Update documentation

* Add HER example

* Removed unused dependencies (tdqm, dill, progressbar2, seaborn, glob2, click)

* Remove note on the replay buffer

* Update doc + add a check for VecEnvWrapper with HER

* Update examples + add notebook for HER

* Add random exploration for SAC and DDPG

* Typo in docstring

* Doc update: add fix for DDPG saved models

* Test with reward offset

* Add GoalEnvNormalize draft

* Remove GoalEnvNormalize

* Fix typo

* Bug fix for HER + VecEnv

* Fix HER test env

* Fixed key order

* Add support for discrete obs space

* Update doc about reproducing experiments

* Update doc: DDPG supports multiprocessing with MPI

* Fix for new abstract method

* Update changelog

* Fix custom policy example

* Add replay_wrapper to base OffPolicy class

* Fix reimport
* Bug fix when not enough samples in the replay buffer

* Correct typo
* Remove buggy `find_trainable_variables` and replace it with `tf_util.get_trainable_vars`

* Patch loading of old DDPG models

* Fix indentation
* improved venenv doc

* updated dummy_vec_env doc + improved vec_env doc

* added VecCheckNan

* added checking nan guide

* added test

* added hyperparam warning to doc

* clean up and typos

* codacy fixes + cleanup + changelog

* hotfix

* fix test

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Update docs/guide/checking_nan.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* fixed VecCheckNan exception only called once

* add tf NaN debugging options to the NaN guide
* Clarify recommended SB usage
+ no tech support

* Doc fix

* Bump version

* Update issue template

* Update issue-template.md
* Fix GAE bug for TRPO/PPO1/GAIL

* Update changelog.rst

* Retrieve episode infos when using Monitor

* Add comment about subsampling fisher vector product
…ning log probabilities (#397)

* Support Gaussian probabilities and logp calculation

* Fix linting + missing normalizer

* Add & fix tests

* Fix Gaussian PDF calculation

* Bugfix in Gaussian probability calculation

* Address review comments
* allow logging lists

* Update docs/misc/changelog.rst

Co-Authored-By: Antonin RAFFIN <[email protected]>

* Move imports to top level

* Add return type
@XMaster96 XMaster96 merged commit b9e0fc0 into XMaster96:master Jul 20, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.