Hexapawn Game Engine Proper 3×3 board with pawn movement Strategic RL Agents Minimax with Alpha-Beta Pruning (depth configurable 1-7) Q-Learning with temporal difference updates Experience replay for efficient learning Epsilon-greedy exploration with decay Multi-level decision hierarchy (immediate threats → strategic planning)
-
Updated
Dec 14, 2025 - Python