This project was created as an assignment for a lecture at the UZH. A group of 4 students were involved. The project uses two different reinforcement learning algorithms, both of which are applied to the game Flappy Bird. The reinforcement learning algorithms were taken from other existing Repos and merged into one project and the model was re-trained.
- Asynchronous Actor-Critic Agents (A3C)
- Deep Q-Network
- Python 3.5
- pygame
- Keras 2.0
- scikit-image
- TensorFlow
- OpenCV-Python
- h5py
git clone https://github.com/jgresc/FlappyBird-GameAgent.git
cd RLGameAgents
python run.py
Different hyperparameters were tried out, such as various activation functions for the A3C algorithm.
The DQN algorithm was less efficient, and took more time to produce decent results.
However, it also turned out that the trained DQN can play FlappyBird better than the A3C on average.
This work is highly based on the following repos: