diff --git a/docs/guide/custom_policy.rst b/docs/guide/custom_policy.rst index 16585f92e8..109afe0600 100644 --- a/docs/guide/custom_policy.rst +++ b/docs/guide/custom_policy.rst @@ -36,9 +36,10 @@ You can also easily define a custom architecture for the policy (or value) netwo .. note:: - Defining a custom policy class is equivalent to passing ``policy_kwargs``. However, - it lets you name the policy and so makes usually the code clearer. ``policy_kwargs`` should be rather used - when doing hyperparameter search. + Defining a custom policy class is equivalent to passing ``policy_kwargs``. + However, it lets you name the policy and so makes usually the code clearer. + ``policy_kwargs`` should be rather used when doing hyperparameter search. + .. code-block:: python @@ -74,7 +75,8 @@ You can also easily define a custom architecture for the policy (or value) netwo .. warning:: - When loading a model with a custom policy, you must pass the custom policy explicitly when loading the model. (cf previous example) + When loading a model with a custom policy, you must pass the custom policy explicitly when loading the model. + (cf previous example) You can also register your policy, to help with code simplicity: you can refer to your custom policy using a string. diff --git a/docs/misc/changelog.rst b/docs/misc/changelog.rst index 4169197863..9649332fe6 100644 --- a/docs/misc/changelog.rst +++ b/docs/misc/changelog.rst @@ -5,9 +5,17 @@ Changelog For download links, please look at `Github release page `_. -Pre-Release 2.4.0a (WIP) +Pre-Release 2.4.1 (WIP) -------------------------- + + + +Release 2.4.0 (2019-01-17) +-------------------------- + +**Soft Actor-Critic (SAC) and policy kwargs** + - added Soft Actor-Critic (SAC) model - fixed a bug in DQN where prioritized_replay_beta_iters param was not used - fixed DDPG that did not save target network parameters diff --git a/setup.py b/setup.py index 845df8c48c..34f358fded 100644 --- a/setup.py +++ b/setup.py @@ -112,7 +112,7 @@ license="MIT", long_description=long_description, long_description_content_type='text/markdown', - version="2.4.0a", + version="2.4.0", ) # python setup.py sdist diff --git a/stable_baselines/__init__.py b/stable_baselines/__init__.py index 787b243781..e628f63e17 100644 --- a/stable_baselines/__init__.py +++ b/stable_baselines/__init__.py @@ -12,7 +12,7 @@ from stable_baselines.trpo_mpi import TRPO from stable_baselines.sac import SAC -__version__ = "2.4.0a" +__version__ = "2.4.0" # patch Gym spaces to add equality functions, if not implemented