-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathrl.html
89 lines (87 loc) · 4.51 KB
/
rl.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
<!DOCTYPE html>
<html lang="en">
<head>
<!-- required meta tags -->
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="author" content="Andrew Liang">
<!-- Bootstrap css -->
<link rel="stylesheet" href="css/bootstrap.min.css">
<!-- Fontawesome css -->
<link rel="stylesheet" href="css/all.css">
<link rel="stylesheet" href="styling.css">
<title>Andrew Liang's Website: Reinforcement Learning</title>
</head>
<body>
<nav class="transparent navbar navbar-inverse navbar-static-top">
<div class="container-fluid">
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed"
data-toggle="collapse" data-target="#nav-content" aria-expanded="false">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a id="brand" class="navbar-brand" href="index.html">
<span class="fa fa-home" aria-hidden="true"></span>
</a>
<a id="brand" class="navbar-brand" href="rl.html">Reinforcement Learning</a>
</div>
<div class="transparent collapse navbar-collapse" id="nav-content">
<ul class="nav navbar-nav navbar-right">
<li><a href="andrewLiangResume.pdf" target="new">
<span class="fas fa-file-alt" aria-hidden="true"></span> Résumé
</a></li>
<li><a href="https://www.linkedin.com/in/liang-y-andrew/" target="new">
<span class="fab fa-linkedin" aria-hidden="true"></span> LinkedIn
</a></li>
<li><a href="https://github.com/itsabigaundy" target="new">
<span class="fab fa-github" aria-hidden="true"></span> Github
</a></li>
<!--
<li class="dropdown">
<a href="#" role="button" class="dropdown-toggle" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Dropdown
</a>
<ul class="dropdown-menu">
<li>Element 1</li>
<li>Element 2</li>
</ul>
</li>
-->
</ul>
</div>
</div>
</nav>
<div class="jumbotron semi-transparent">
<div class="container">
<h1>My Reinforcement Learning Experience</h1>
</div>
</div>
<div class="container">
<h1 id="who-header">CartPole</h1>
<img class="center-picture" src="images/cartpole-v1-uniformreplay.gif">
<div id="about-me" class="row">
<p class="text-center">
My solution to the <a href="https://gym.openai.com/envs/CartPole-v1/" target="new">CartPole-v1</a>
environment. I used a DDQN agent with uniform replay sampling.
</p>
</div>
<img class="center-picture" src="images/cartpole-v1-uniformreplay.png">
<div id="about-me" class="row">
<p class="text-center">
Cartpole-v1 is considered solved when the agent scores at least 475 points on average
over 100 consecutive episodes.
<br>
The maximum score possible in a single run is 500 points.
</p>
</div>
</div>
<!-- Bootstrap JS -->
<script src="https://code.jquery.com/jquery-3.3.1.slim.min.js" integrity="sha384-q8i/X+965DzO0rT7abK41JStQIAqVgRVzpbzo5smXKp4YfRvH+8abtTE1Pi6jizo" crossorigin="anonymous"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/popper.js/1.14.3/umd/popper.min.js" integrity="sha384-ZMP7rVo3mIykV+2+9J3UJ46jBk0WLaUAdn689aCwoqbBJiSnjAK/l8WvCWPIPm49" crossorigin="anonymous"></script>
<script src="js/bootstrap.min.js"></script>
</body>
</html>