GitHub - anujnm/EducationMDP: A Markhov Decision Process to demonstrate reinforcement learning

A Markhov Decision Process to demonstrate reinforcement learning.

This MDP assumes that the agent has recently graduated from their undergrad program, and the agent must now choose one of three choices:

The system is stochastic, and each choice has its own reward.

The MDP leverages the BURLAP library (burlap.cs.brown.edu), and performs policy iteration, value iteration, and Q learning.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
resources/img		resources/img
src		src
README.md		README.md

Provide feedback