Skip to content

Commit a967f42

Browse files
committed
small update
1 parent 6952b9d commit a967f42

File tree

1 file changed

+27
-0
lines changed

1 file changed

+27
-0
lines changed

rl/extra_reading.txt

+27
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,35 @@ http://web.mst.edu/~gosavia/joc.pdf
44
Algorithms for Reinforcement Learning - Csaba Szepesv´ari
55
http://old.sztaki.hu/~szcsaba/papers/RLAlgsInMDPs-lecture.pdf
66

7+
Markov Decision Processes in Artificial Intelligence
8+
https://zodml.org/sites/default/files/Markov_Decision_Processes_and_Artificial_Intelligence.pdf
9+
10+
MDP Preliminaries
11+
http://nanjiang.cs.illinois.edu/files/cs598/note1.pdf
12+
13+
Concentration Inequalities and Multi-Armed Bandits
14+
http://nanjiang.cs.illinois.edu/files/cs598/note_bandit.pdf
15+
16+
Notes on Tabular Methods
17+
http://nanjiang.cs.illinois.edu/files/cs598/note3.pdf
18+
19+
Notes on State Abstractions
20+
http://nanjiang.cs.illinois.edu/files/cs598/note4.pdf
21+
22+
Notes on Fitted Q-iteration
23+
http://nanjiang.cs.illinois.edu/files/cs598/note5.pdf
24+
25+
Convergence of Stochastic Iterative Dynamic Programming Algorithms
26+
https://papers.nips.cc/paper/764-convergence-of-stochastic-iterative-dynamic-programming-algorithms.pdf
27+
728
Sutton & Barto
829
http://incompleteideas.net/sutton/book/the-book-2nd.html
930

31+
Finite-Sample Analysis of Proximal Gradient TD Algorithms
32+
https://marek.petrik.us/pub/Liu2015.pdf
33+
34+
Finite Sample Analyses for TD(0) with Function Approximation
35+
https://arxiv.org/pdf/1704.01161.pdf
36+
1037
Mastering the game of Go with deep neural networks and tree search - Silver, D. et al.
1138
https://storage.googleapis.com/deepmind-media/alphago/AlphaGoNaturePaper.pdf

0 commit comments

Comments
 (0)