MaxInfoRL: Boosting exploration in RL through information gain maximization

A Pytorch implementation of MaxInfoRL, a simple, flexible, and scalable class of reinforcement learning algorithms that enhance exploration in RL by automatically combining intrinsic and extrinsic rewards. For a jax implementation, visit this jax repository.

To learn more:

MaxInfoRL

MaxInfoRL boosts exploration in RL by combining extrinsic rewards with intrinsic exploration bonuses derived from information gain of the underlying MDP. MaxInfoRL naturally trades off maximization of the value function with that of the entropy over states, rewards, and actions. MaxInfoRL is very general and can be combined with a variety of off-policy model-free RL methods for continuous state-action spaces. We provide implementations of MaxInfoSac, MaxRNDSAC, MaxInfoOAC, $\epsilon$--MaxInfoRL. Our implementations build up on the stable-baselines3 package.

Instructions

Installation

pip install -e .

Training

Training script:

python examples/dmc/experiment.py \
  --project_name maxinforl \
  --alg maxinfosac \
  --domain_name cartpole-swingup_sparse

You can run sac, oac, maxinfosac, maxinfooac, maxrndsac, or maxinfo_eps_greedy by specifying the alg flag.

Custom environments

This repo relies on stable-baselines3 to load environments, natively supporting Gym environments. If your environment is registered in Gym, you can directly use it (just adjust the configs.yaml file accordingly).

Citation

If you find MaxInfoRL useful for your research, please cite this work:

@article{sukhija2024maxinforl,
  title={MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization},
  author={Sukhija, Bhavya and Coros, Stelian and Krause, Andreas and Abbeel, Pieter and Sferrazza, Carmelo},
  journal={arXiv preprint arXiv:2412.12098},
  year={2024}
}

References

This codebase contains some files adapted from other sources:

Stable-Baselines3: https://github.com/DLR-RM/stable-baselines3

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
examples/dmc		examples/dmc
maxinforl_torch		maxinforl_torch
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MaxInfoRL: Boosting exploration in RL through information gain maximization

MaxInfoRL

Instructions

Installation

Training

Custom environments

Citation

References

About

Releases

Packages

Contributors 2

Languages

License

sukhijab/maxinforl_torch

Folders and files

Latest commit

History

Repository files navigation

MaxInfoRL: Boosting exploration in RL through information gain maximization

MaxInfoRL

Instructions

Installation

Training

Custom environments

Citation

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages