Skip to content

Commit e77589a

Browse files
author
soycode
committedJul 9, 2018
Add parts 2/3 to assignment
1 parent db0832f commit e77589a

File tree

1 file changed

+28
-0
lines changed

1 file changed

+28
-0
lines changed
 

‎assignment/README.md

+28
Original file line numberDiff line numberDiff line change
@@ -19,3 +19,31 @@ resources you have available.
1919

2020
Please turn in an updated notebook with your strategy, as well as text in the
2121
notebook describing it and its performance.
22+
23+
24+
## Part 2 - Play Pong
25+
26+
Implement an agent to play Pong - the OpenAI Gym
27+
[integrates an Atari environment](https://github.com/openai/gym#atari), and you
28+
can start by trying
29+
[example agents](https://github.com/openai/gym/tree/master/examples/agents). The
30+
random agent is a simple initial example, and the keyboard agent can allow you
31+
to actually play the game yourself (and potentially use that data to train
32+
some other agent). The example notebook from the first coding challenge has
33+
useful snippets of code for visualizing what is going on.
34+
35+
After getting set up and playing with a few agents, check out
36+
[this article](http://karpathy.github.io/2016/05/31/rl/) on using policy
37+
gradients to train a model to play Pong. Code your own agent based on this
38+
technique - [this video](https://youtu.be/YOW8m2YGtRg) illustrates the end
39+
result of such an agent.
40+
41+
42+
## Part 3 - An environment of your choice
43+
44+
OpenAI supports
45+
[many interesting environments](https://github.com/openai/gym#environments) -
46+
pick one, and start by trying to run existing agents and find prior work (as
47+
with Pong). Then take a shot at writing your own - this is open-ended, so have
48+
fun and explore something that interests you, but also chat and share what
49+
you're doing so people can collaborate and learn from one another.

0 commit comments

Comments
 (0)
Please sign in to comment.