Name		Name	Last commit message	Last commit date
parent directory ..
config		config
dataset		dataset
README.md		README.md
__init__.py		__init__.py
expert_trajecotry_collector.py		expert_trajecotry_collector.py
gail.py		gail.py
main.py		main.py
test.py		test.py

README.md

GAIL

This GAIL implementation is highly correlated to PPO algorithm:

Expert trajectories generated according to PPO pre-trained model;
GAIL learn policy utilizing PPO algorithm.

1. Usage

Generating expert trajectories by expert_trajectory_collector.py (You should pre-train a model by specific RL algorithm);
Filling in a custom config file for gail, a template is provided in config/config.yml;
Training GAIL from main.py.

2. Performance

Run the algorithm on BipedalWalker-v3 for continuous control.

Expert trajectories are collected by running PPO, trajectories are saved as .npz format, then GAIL utilizes PPO algorithm for policy optimization.

The performance (average reward) curve looks like this:

GAIL:

PPO:

You may see that GAIL is not as good as PPO, however for imitating, GAIL is good.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GAIL

GAIL

README.md

GAIL

1. Usage

2. Performance

Files

GAIL

Directory actions

More options

Directory actions

More options

Latest commit

History

GAIL

Folders and files

parent directory

README.md

GAIL

1. Usage

2. Performance