File tree 1 file changed +27
-0
lines changed
1 file changed +27
-0
lines changed Original file line number Diff line number Diff line change @@ -4,8 +4,35 @@ http://web.mst.edu/~gosavia/joc.pdf
4
4
Algorithms for Reinforcement Learning - Csaba Szepesv´ari
5
5
http://old.sztaki.hu/~szcsaba/papers/RLAlgsInMDPs-lecture.pdf
6
6
7
+ Markov Decision Processes in Artificial Intelligence
8
+ https://zodml.org/sites/default/files/Markov_Decision_Processes_and_Artificial_Intelligence.pdf
9
+
10
+ MDP Preliminaries
11
+ http://nanjiang.cs.illinois.edu/files/cs598/note1.pdf
12
+
13
+ Concentration Inequalities and Multi-Armed Bandits
14
+ http://nanjiang.cs.illinois.edu/files/cs598/note_bandit.pdf
15
+
16
+ Notes on Tabular Methods
17
+ http://nanjiang.cs.illinois.edu/files/cs598/note3.pdf
18
+
19
+ Notes on State Abstractions
20
+ http://nanjiang.cs.illinois.edu/files/cs598/note4.pdf
21
+
22
+ Notes on Fitted Q-iteration
23
+ http://nanjiang.cs.illinois.edu/files/cs598/note5.pdf
24
+
25
+ Convergence of Stochastic Iterative Dynamic Programming Algorithms
26
+ https://papers.nips.cc/paper/764-convergence-of-stochastic-iterative-dynamic-programming-algorithms.pdf
27
+
7
28
Sutton & Barto
8
29
http://incompleteideas.net/sutton/book/the-book-2nd.html
9
30
31
+ Finite-Sample Analysis of Proximal Gradient TD Algorithms
32
+ https://marek.petrik.us/pub/Liu2015.pdf
33
+
34
+ Finite Sample Analyses for TD(0) with Function Approximation
35
+ https://arxiv.org/pdf/1704.01161.pdf
36
+
10
37
Mastering the game of Go with deep neural networks and tree search - Silver, D. et al.
11
38
https://storage.googleapis.com/deepmind-media/alphago/AlphaGoNaturePaper.pdf
You can’t perform that action at this time.
0 commit comments