Deep Reinforcement Learning Lab

Author

Abubakar Aliyu BADAWI

University

University of Toulon

Overview

This project explores advanced techniques in deep reinforcement learning, including Imitation Learning, Deep Q-Networks (DQN), and Proximal Policy Optimization (PPO). The objective is to implement these techniques, test different architectures, tweak hyperparameters, and evaluate their performance on various tasks and environments.

Results

This repository contains the implementation and results of various deep reinforcement learning algorithms, including Imitation Learning, Deep Q-Networks (DQN), and Proximal Policy Optimization (PPO). The main focus of these experiments was to explore the effects of different architectures and hyperparameter settings on the performance of models in simulated environments.

Project Structure

report.pdf: A comprehensive report detailing the methodology, experiments, and findings.
code/: Directory containing the source code used for all experiments.
Images/: Contains all the plots generated during the experiments.

Viewing the Plots

The plots are stored in the Images folder and are referenced in the report. Here is how you can view them directly from GitHub:

Imitation Learning

Deep Q-Networks (DQN)

Proximal Policy Optimization (PPO)

![PPO BipedalWalker 500 Episodes](Images/pg-500eps-0.01 betadecay.png)

Setup and Running Instructions

Ensure you have Python 3.x installed.
Install the necessary dependencies as listed in requirements.txt.
Run the scripts in the code/ directory to reproduce the experiments.

Contributing

Feel free to fork this repository and submit pull requests to contribute to this project. You can also open an issue if you find any bugs or have suggestions for additional experiments.

License

This project is open-sourced under the MIT license. See the LICENSE file for more details.

Conclusion

The project highlights the significance of architecture selection and hyperparameter tuning in deep reinforcement learning. Each technique and modification provided valuable lessons on the models' behavior and performance in complex environments.

Acknowledgements

Special thanks to Prof. J. Arjona-Medina and the University of Toulon for guidance and resources throughout this research.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Deep Q-Network		Deep Q-Network
Images		Images
Imitation-Learning		Imitation-Learning
Policy Gradient - PPO		Policy Gradient - PPO
README.md		README.md
Report-DRL_Abubakar-Aliyu-Badawi.pdf		Report-DRL_Abubakar-Aliyu-Badawi.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning Lab

Author

University

Overview