Skip to content

RL Theory #1

@eleurent

Description

@eleurent
Owner

RL Theory is not properly represented. A new section should be added, with at least:

  • Tabular setting
    • With a generative model
      • QVI
    • Without
      • UCRL2
      • UCBVI
    • Episodic
    • Q-learning+UCB
  • Extensions to compact state-action spaces
  • Extension to Kernels
  • Performance measures: PAC, simple regret, cumulative regret, etc.
  • RL with compatible function approximation

Is there a difference between generative models (sample any transition) and simulators (simulate trajectories from current states only)?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @eleurent

        Issue actions

          RL Theory · Issue #1 · eleurent/phd-bibliography