[ICML 2024] Author's Implementation of RVI-SAC
-
Updated
Jan 1, 2026 - Python
[ICML 2024] Author's Implementation of RVI-SAC
MDP Battery decision-making framework, 2024-2025.
Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
Efficient solving of large Single Input Superstate Decomposable MDP with application to PV Energy storage, 2025-2026.
Add a description, image, and links to the average-reward topic page so that developers can more easily learn about it.
To associate your repository with the average-reward topic, visit your repo's landing page and select "manage topics."