File tree
72 files changed
+824
-276
lines changed- d3rlpy
- algos
- qlearning
- torch
- transformer
- torch
- models/torch
- ope
- torch
- optimizers
- reproductions
- finetuning
- offline
- online
- tests
- algos/qlearning
- models
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
72 files changed
+824
-276
lines changedDiff for: .readthedocs.yaml
+1-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
5 |
| - | |
| 5 | + | |
6 | 6 |
| |
7 | 7 |
| |
8 | 8 |
| |
|
Diff for: README.md
+1-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
54 | 54 |
| |
55 | 55 |
| |
56 | 56 |
| |
57 |
| - | |
| 57 | + | |
58 | 58 |
| |
59 | 59 |
| |
60 | 60 |
| |
|
Diff for: d3rlpy/__init__.py
+5
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
1 | 2 |
| |
2 | 3 |
| |
3 | 4 |
| |
| |||
68 | 69 |
| |
69 | 70 |
| |
70 | 71 |
| |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
71 | 76 |
| |
72 | 77 |
| |
73 | 78 |
| |
|
Diff for: d3rlpy/algos/qlearning/awac.py
+8-2
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
70 | 70 |
| |
71 | 71 |
| |
72 | 72 |
| |
| 73 | + | |
73 | 74 |
| |
74 | 75 |
| |
75 | 76 |
| |
| |||
130 | 131 |
| |
131 | 132 |
| |
132 | 133 |
| |
133 |
| - | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
134 | 137 |
| |
135 | 138 |
| |
136 |
| - | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
137 | 142 |
| |
138 | 143 |
| |
139 | 144 |
| |
| |||
158 | 163 |
| |
159 | 164 |
| |
160 | 165 |
| |
| 166 | + | |
161 | 167 |
| |
162 | 168 |
| |
163 | 169 |
| |
|
Diff for: d3rlpy/algos/qlearning/bc.py
+10-2
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
49 | 49 |
| |
50 | 50 |
| |
51 | 51 |
| |
| 52 | + | |
52 | 53 |
| |
53 | 54 |
| |
54 | 55 |
| |
| |||
93 | 94 |
| |
94 | 95 |
| |
95 | 96 |
| |
96 |
| - | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
97 | 100 |
| |
98 | 101 |
| |
99 | 102 |
| |
| |||
103 | 106 |
| |
104 | 107 |
| |
105 | 108 |
| |
| 109 | + | |
106 | 110 |
| |
107 | 111 |
| |
108 | 112 |
| |
| |||
137 | 141 |
| |
138 | 142 |
| |
139 | 143 |
| |
| 144 | + | |
140 | 145 |
| |
141 | 146 |
| |
142 | 147 |
| |
| |||
168 | 173 |
| |
169 | 174 |
| |
170 | 175 |
| |
171 |
| - | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
172 | 179 |
| |
173 | 180 |
| |
174 | 181 |
| |
| |||
178 | 185 |
| |
179 | 186 |
| |
180 | 187 |
| |
| 188 | + | |
181 | 189 |
| |
182 | 190 |
| |
183 | 191 |
| |
|
Diff for: d3rlpy/algos/qlearning/bcq.py
+14-3
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
137 | 137 |
| |
138 | 138 |
| |
139 | 139 |
| |
| 140 | + | |
140 | 141 |
| |
141 | 142 |
| |
142 | 143 |
| |
| |||
228 | 229 |
| |
229 | 230 |
| |
230 | 231 |
| |
231 |
| - | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
232 | 235 |
| |
233 | 236 |
| |
234 |
| - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
235 | 240 |
| |
236 | 241 |
| |
237 | 242 |
| |
238 | 243 |
| |
239 | 244 |
| |
| 245 | + | |
240 | 246 |
| |
241 | 247 |
| |
242 | 248 |
| |
| |||
264 | 270 |
| |
265 | 271 |
| |
266 | 272 |
| |
| 273 | + | |
267 | 274 |
| |
268 | 275 |
| |
269 | 276 |
| |
| |||
331 | 338 |
| |
332 | 339 |
| |
333 | 340 |
| |
| 341 | + | |
334 | 342 |
| |
335 | 343 |
| |
336 | 344 |
| |
| |||
402 | 410 |
| |
403 | 411 |
| |
404 | 412 |
| |
405 |
| - | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
406 | 416 |
| |
407 | 417 |
| |
408 | 418 |
| |
| |||
422 | 432 |
| |
423 | 433 |
| |
424 | 434 |
| |
| 435 | + | |
425 | 436 |
| |
426 | 437 |
| |
427 | 438 |
| |
|
Diff for: d3rlpy/algos/qlearning/bear.py
+15-4
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
114 | 114 |
| |
115 | 115 |
| |
116 | 116 |
| |
| 117 | + | |
117 | 118 |
| |
118 | 119 |
| |
119 | 120 |
| |
| |||
217 | 218 |
| |
218 | 219 |
| |
219 | 220 |
| |
220 |
| - | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
221 | 224 |
| |
222 | 225 |
| |
223 |
| - | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
224 | 229 |
| |
225 | 230 |
| |
226 | 231 |
| |
227 | 232 |
| |
228 | 233 |
| |
| 234 | + | |
229 | 235 |
| |
230 | 236 |
| |
231 |
| - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
232 | 240 |
| |
233 | 241 |
| |
234 |
| - | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
235 | 245 |
| |
236 | 246 |
| |
237 | 247 |
| |
| |||
266 | 276 |
| |
267 | 277 |
| |
268 | 278 |
| |
| 279 | + | |
269 | 280 |
| |
270 | 281 |
| |
271 | 282 |
| |
|
Diff for: d3rlpy/algos/qlearning/cal_ql.py
+14-5
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
69 | 69 |
| |
70 | 70 |
| |
71 | 71 |
| |
| 72 | + | |
72 | 73 |
| |
73 | 74 |
| |
74 | 75 |
| |
| |||
88 | 89 |
| |
89 | 90 |
| |
90 | 91 |
| |
91 |
| - | |
92 | 92 |
| |
93 | 93 |
| |
94 | 94 |
| |
| |||
128 | 128 |
| |
129 | 129 |
| |
130 | 130 |
| |
131 |
| - | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
132 | 134 |
| |
133 | 135 |
| |
134 |
| - | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
135 | 139 |
| |
136 | 140 |
| |
137 | 141 |
| |
138 |
| - | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
139 | 145 |
| |
140 | 146 |
| |
141 | 147 |
| |
142 | 148 |
| |
143 | 149 |
| |
144 |
| - | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
145 | 153 |
| |
146 | 154 |
| |
147 | 155 |
| |
| |||
171 | 179 |
| |
172 | 180 |
| |
173 | 181 |
| |
| 182 | + | |
174 | 183 |
| |
175 | 184 |
| |
176 | 185 |
| |
|
Diff for: d3rlpy/algos/qlearning/cql.py
+19-6
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
100 | 100 |
| |
101 | 101 |
| |
102 | 102 |
| |
| 103 | + | |
103 | 104 |
| |
104 | 105 |
| |
105 | 106 |
| |
| |||
142 | 143 |
| |
143 | 144 |
| |
144 | 145 |
| |
145 |
| - | |
146 | 146 |
| |
147 | 147 |
| |
148 | 148 |
| |
| |||
182 | 182 |
| |
183 | 183 |
| |
184 | 184 |
| |
185 |
| - | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
186 | 188 |
| |
187 | 189 |
| |
188 |
| - | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
189 | 193 |
| |
190 | 194 |
| |
191 | 195 |
| |
192 |
| - | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
193 | 199 |
| |
194 | 200 |
| |
195 | 201 |
| |
196 | 202 |
| |
197 | 203 |
| |
198 |
| - | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
199 | 207 |
| |
200 | 208 |
| |
201 | 209 |
| |
| |||
225 | 233 |
| |
226 | 234 |
| |
227 | 235 |
| |
| 236 | + | |
228 | 237 |
| |
229 | 238 |
| |
230 | 239 |
| |
| |||
272 | 281 |
| |
273 | 282 |
| |
274 | 283 |
| |
| 284 | + | |
275 | 285 |
| |
276 | 286 |
| |
277 | 287 |
| |
| |||
318 | 328 |
| |
319 | 329 |
| |
320 | 330 |
| |
321 |
| - | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
322 | 334 |
| |
323 | 335 |
| |
324 | 336 |
| |
| |||
336 | 348 |
| |
337 | 349 |
| |
338 | 350 |
| |
| 351 | + | |
339 | 352 |
| |
340 | 353 |
| |
341 | 354 |
| |
|
0 commit comments