PPR

MDP R implementation of the PPR problem

Run PPR_nonStationary_example.r

generates a PPR problem at random Specify # of sites (<7) and # of time steps

solves finite horizon non stationary MDP

simulate a trajectory and plots results.

Worth remembering: how to convert a site states [site1 site2] into an MDP state stateID

When using getState it returns state id from index 0, to use the state to check the action in policy, requires +1 as MDP states start at 1 not 0.

> x= c(2,1,0) # x codes the state of 3 sites: site 1 Converted, site 2 is purchased, site 3 is available
> getState(x) # call get state function to return the stateID
[1] 5       # state id provided from index 0 - must add +1 for querying policy/P/R/V
> policy[6,2] # query policy for state id 6 at timestep 2
[1] 3        # optimal action is purchase site 3
> #Similarly:
> getSite(5,3) # getSite assumes stateID index starts at 0 - Checking we get x back
[1] 2 1 0
> getSite(6,3) # check we don't get x when calling getSite(stateID+1, total number of sites)
[1] 0 2 0

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
LICENSE		LICENSE
PPR_nonStationay_example.r		PPR_nonStationay_example.r
PPR_toyproblem.R		PPR_toyproblem.R
README.md		README.md
binvec2dec.r		binvec2dec.r
dec2binvec.r		dec2binvec.r
explore_solution_PPR.r		explore_solution_PPR.r
explore_solution_reserve.r		explore_solution_reserve.r
getSite.r		getSite.r
getState.r		getState.r
mdp_LP.r		mdp_LP.r
mdp_Q_learning.r		mdp_Q_learning.r
mdp_bellman_operator.r		mdp_bellman_operator.r
mdp_check.r		mdp_check.r
mdp_check_square_stochastic.r		mdp_check_square_stochastic.r
mdp_computePR.r		mdp_computePR.r
mdp_computePpolicyPRpolicy.r		mdp_computePpolicyPRpolicy.r
mdp_eval_policy_TD_0.r		mdp_eval_policy_TD_0.r
mdp_eval_policy_iterative.r		mdp_eval_policy_iterative.r
mdp_eval_policy_matrix.r		mdp_eval_policy_matrix.r
mdp_eval_policy_optimality.r		mdp_eval_policy_optimality.r
mdp_example_PPR_non_stationary.r		mdp_example_PPR_non_stationary.r
mdp_example_forest.r		mdp_example_forest.r
mdp_example_rand.r		mdp_example_rand.r
mdp_example_reserve.r		mdp_example_reserve.r
mdp_finite_horizon.r		mdp_finite_horizon.r
mdp_finite_horizon_nonStationary.r		mdp_finite_horizon_nonStationary.r
mdp_policy_iteration.r		mdp_policy_iteration.r
mdp_policy_iteration_modified.r		mdp_policy_iteration_modified.r
mdp_relative_value_iteration.r		mdp_relative_value_iteration.r
mdp_span.r		mdp_span.r
mdp_value_iteration.r		mdp_value_iteration.r
mdp_value_iterationGS.r		mdp_value_iterationGS.r
mdp_value_iteration_bound_iter.r		mdp_value_iteration_bound_iter.r
reserve_example.r		reserve_example.r

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPR

Worth remembering: how to convert a site states [site1 site2] into an MDP state stateID

About

Releases

Packages

Languages

License

ashander/PPR

Folders and files

Latest commit

History

Repository files navigation

PPR

Worth remembering: how to convert a site states [site1 site2] into an MDP state stateID

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages