pytorch-a2c-ppo-acktr

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

Original repository - Link

This is a PyTorch implementation of

Advantage Actor Critic (A2C), a synchronous deterministic version of A3C
Proximal Policy Optimization PPO
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation ACKTR
Generative Adversarial Imitation Learning GAIL

Requirements

Python 3 (it might work with Python 2, but I didn't test it)
PyTorch
OpenAI baselines

In order to install requirements, follow:

# PyTorch
conda install pytorch torchvision -c soumith

# Baselines for Atari preprocessing
git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

# Other requirements
pip install -r requirements.txt

Visualization

In order to visualize the results use visualize.ipynb.

Training

PPO Single column

python main.py --env-name "PongNoFrameskip-v4" --use-pnn --use-gae   --num-processes 8 --num-steps 128 --num-mini-batch 4  --use-linear-lr-decay

Progressive neural network with 2 columns

python main.py --env-name "PongNoFrameskip-v4"  --use-pnn --n-columns 2 --pnn-paths "path_to_trained_model_from_previous_runs"  --use-gae   --num-processes 8 --num-steps 128 --num-mini-batch 4  --use-linear-lr-decay

Works with minigrid environments. Pass 'MiniGrid-xyz' (change this to environment's name) as the argument for --env-name

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
a2c_ppo_acktr		a2c_ppo_acktr
gail_experts		gail_experts
imgs		imgs
logs		logs
trained_models/gridworld_more_seeds/5/ppo-0/ppo		trained_models/gridworld_more_seeds/5/ppo-0/ppo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cluster_runs.py		cluster_runs.py
enjoy.py		enjoy.py
evaluation.py		evaluation.py
generate_shell_script.py		generate_shell_script.py
main.py		main.py
requirements.txt		requirements.txt
run_all.yaml		run_all.yaml
setup.py		setup.py
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pytorch-a2c-ppo-acktr

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

Requirements

Visualization

Training

PPO Single column

Progressive neural network with 2 columns

About

Releases

Packages

Languages

License

rohitnandwani/CCM_project

Folders and files

Latest commit

History

Repository files navigation

pytorch-a2c-ppo-acktr

Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!

Requirements

Visualization

Training

PPO Single column

Progressive neural network with 2 columns

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages