update master #2

XMaster96 · 2019-07-20T18:02:29Z

No description provided.

* fixed bug in VecEnvWrapper.__getattr__ where inherited methods were inaccessible * improved test for VecEnvWrapper.__getattr__ to be more comprehensive * changed test function to satisfy code checks * updated changelog and simplified declaration of self.class_attributes * modified getattr_depth_check for consistency and added helper method for getting all attributes

* Fixed path splitting in _get_latest_run_id() on Windows machines * Returned to previous split method, replaced split delimiter with os.sep in _get_latest_run_id function. Wrote test for saving tensorboard data twice with the same logname * Fixed tests for saving tensorboard twice with same logname * Updated tensorboard tests * Updated tensorboard tests. Added name and fix to changelog * Update test_tensorboard.py * Update test_tensorboard.py

* added example project * format * some textual improvements * Update docs/misc/projects.rst Co-Authored-By: Antonin RAFFIN <[email protected]>

* Issue #317 [feature request] filter_size can be a array instead of one value * Issues #326 [Feature] filter_size can be a array * Issue #326 [Feature] filter_size can be a array * Issues #326 [Feature] filter_size can be a array: Line too long * Update changelog.rst * Issue #326 [Feature] filter_size can be a array, the added test code is test_a2c_conv.py * Issues #326 [Feature] filter_size can be a array, remove the unused variables * Issues #326 [Feature] filter_size can be a array, remove the unused library * Issue #326, [Feature] filter_size can be a array. Clean up the test code

* Minor typo correction * Minor typo correction

* fix timestep and learning rate recording * fix issue related to reset_num_timesteps * update changelog * Update changelog.rst

* Update doc: hyperparam tuning for rl zoo * Add colab notebook link

* * Add `get_parameters` function (returns all loadable/saveable tensorflow Variables) * Add `load_parameters` function (loads model parameters from file/file-like/list of ndarrays) * Update A2C, ACER, ACKTR, DDPG, DQN, PPOs, SAC and TRPO to use `get_parameters` to define parameters necessary for correctly loading/saving models. * * Switch from using lists of parameters to dicts of variable name -> ndarray. * Includes support for loading from older .pkl files with a list of parameters * Renamed `get_parameters` to `_get_parameter_list` * `get_parameters` dictionary of variable name -> ndarrays * `_get_parameter_list` returns list of tensorflow Variables that should be saved/loaded * Updated changelog for changes * Clarified name of function parameter * Updated contributor's list * Fix few PEP8 errors * Update docs to reflect variable name * Fix PEP8/style in test_load_parameters * Requested small typo/doc changes and removed unused parameter in tests * Add parameter for with tests * Add tests for from a file/file-like objects * Use format-function and small line-length change * Add warning about not updating trainer parameters upon 'load_parameters' * Add an example of using load/get parameters with a simple ES example * Use OrderedDict for get_params rather than normal dict * Make _get_variable_list a public function * Update load_parameters example with A2C+ES hybrid and only mutating specific parameters

* Add bit flipping env * HER reloaded (WIP) * DQN + HER * Add support for SAC and DDPG * Add tests for SAC and DDPG + HER + add comments * Bug fix + add comments * Add action noise for SAC * Add note about pop-art normalization * Add saving/loading + begin support for VecEnv * Add success rate * Fix HER learning method * Add support for VecEnv + improve comments + add properties to ReplayBuffer * Update documentation * Add HER example * Removed unused dependencies (tdqm, dill, progressbar2, seaborn, glob2, click) * Remove note on the replay buffer * Update doc + add a check for VecEnvWrapper with HER * Update examples + add notebook for HER * Add random exploration for SAC and DDPG * Typo in docstring * Doc update: add fix for DDPG saved models * Test with reward offset * Add GoalEnvNormalize draft * Remove GoalEnvNormalize * Fix typo * Bug fix for HER + VecEnv * Fix HER test env * Fixed key order * Add support for discrete obs space * Update doc about reproducing experiments * Update doc: DDPG supports multiprocessing with MPI * Fix for new abstract method * Update changelog * Fix custom policy example * Add replay_wrapper to base OffPolicy class * Fix reimport

* Bug fix when not enough samples in the replay buffer * Correct typo

* Remove buggy `find_trainable_variables` and replace it with `tf_util.get_trainable_vars` * Patch loading of old DDPG models * Fix indentation

* improved venenv doc * updated dummy_vec_env doc + improved vec_env doc * added VecCheckNan * added checking nan guide * added test * added hyperparam warning to doc * clean up and typos * codacy fixes + cleanup + changelog * hotfix * fix test * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Update docs/guide/checking_nan.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * fixed VecCheckNan exception only called once * add tf NaN debugging options to the NaN guide

…375)

* Clarify recommended SB usage + no tech support * Doc fix * Bump version * Update issue template * Update issue-template.md

* Fix GAE bug for TRPO/PPO1/GAIL * Update changelog.rst * Retrieve episode infos when using Monitor * Add comment about subsampling fisher vector product

…ning log probabilities (#397) * Support Gaussian probabilities and logp calculation * Fix linting + missing normalizer * Add & fix tests * Fix Gaussian PDF calculation * Bugfix in Gaussian probability calculation * Address review comments

* allow logging lists * Update docs/misc/changelog.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Move imports to top level * Add return type

kantneel and others added 22 commits May 8, 2019 13:55

Remove get_available_gpus() which was unused (#295)

bea2eed

Add ROS local planner project (#327)

c9be8dc

* added example project * format * some textual improvements * Update docs/misc/projects.rst Co-Authored-By: Antonin RAFFIN <[email protected]>

Minor typo correction (#332)

a37f1eb

* Minor typo correction * Minor typo correction

Update README.md (#336)

4bb82b5

Update custom_env.rst (#337)

f3c5897

fix timestep and learning rate recording (#338)

90ab67a

* fix timestep and learning rate recording * fix issue related to reset_num_timesteps * update changelog * Update changelog.rst

Update doc: hyperparameter tuning for rl zoo (#330)

e78a29d

* Update doc: hyperparam tuning for rl zoo * Add colab notebook link

Add cliprange for value fn (PPO2) (#343)

fefff48

Bug fix when not enough samples in the replay buffer (#354)

65ed396

* Bug fix when not enough samples in the replay buffer * Correct typo

Fix find trainable vars (#364)

72dab6a

* Remove buggy `find_trainable_variables` and replace it with `tf_util.get_trainable_vars` * Patch loading of old DDPG models * Fix indentation

Release 2.6.0 (#369)

0b7726e

Add Adversarial Policies to list of projects using Stable Bsaelines (#…

45dcfb9

…375)

Clarify recommended SB usage and no tech support (#377)

c68385a

* Clarify recommended SB usage + no tech support * Doc fix * Bump version * Update issue template * Update issue-template.md

Hotfix: GAE computation for TRPO/PPO1/GAIL (#388)

0e940c7

* Fix GAE bug for TRPO/PPO1/GAIL * Update changelog.rst * Retrieve episode infos when using Monitor * Add comment about subsampling fisher vector product

Allow logging lists (#355)

dc31d83

* allow logging lists * Update docs/misc/changelog.rst Co-Authored-By: Antonin RAFFIN <[email protected]> * Move imports to top level * Add return type

XMaster96 merged commit b9e0fc0 into XMaster96:master Jul 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update master #2

update master #2

XMaster96 commented Jul 20, 2019

update master #2

update master #2

Conversation

XMaster96 commented Jul 20, 2019