One doc tagged with "bellman-equation"

Q-Learning: Learning Through Rewards and Penalties

Mastering the Bellman Equation, Temporal Difference learning, and the Exploration-Exploitation trade-off.