Q-Learning

Intro

Q-learning is a reinforcement learning technique used in machine learning. The purpose of Q-Learning is to learn a policy, which tells an agent what action to take under certain circumstances. It does not require an environment model and can handle problems with transitions and stochastic rewards without requiring adaptations.

For any finite Markov decision process (FMDP), Q-learning finds a policy that is optimal in that it maximizes the expected value of the total reward over all successive steps from the current state. Q-learning can identify an optimal policy of action selection for any FMDP, given infinite time of exploration and a partly random policy. "Q" names the function that returns the reward used to provide reinforcement and can be considered the "quality" of an action taken in a given state. The function Q will be aproximated by an Residual Neural Networkd in this work.

Used to update the Neural Network, the optimality principle of Bellman is a definition of recursion for an optimal Q function. Q(S(t), A(t)) equals the sum of the immediate reward after performing an action at some time and an expected future reward after a transition to a next state.

Q(S(t),A(t) )←Q(S(t),A(t) )+ α[R(t+1)+γmax(Q(S(t+1),A(t+1)))-Q(S(t),A(t) )]

Algorithm flow

for each iteration:

Initialize Neural Networkd to aproximate Q
Choose action to take: The action is chosen according to epsilon-greedy strategy
Take the action
Observe the environment and measure reward
Update Neural Network

end

Code in DQN.py

to start training, use

python DQN.py train

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
__pycache__		__pycache__
.gitattributes		.gitattributes
.gitignore		.gitignore
Apresentacao Deep Q-Learning.pptx		Apresentacao Deep Q-Learning.pptx
DQN.py		DQN.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-Learning

Intro

About

Releases

Packages

Languages

License

alvaroqueiroz/Q-Learning

Folders and files

Latest commit

History

Repository files navigation

Q-Learning

Intro

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages