Skip to main content

5 docs tagged with "reinforcement-learning"

View all tags

Actor-Critic Methods

Combining value-based and policy-based methods for stable and efficient reinforcement learning.

Deep Q-Networks (DQN)

Scaling Reinforcement Learning with Deep Learning using Experience Replay and Target Networks.

Policy Gradients

Optimizing the policy directly: understanding the REINFORCE algorithm, stochastic policies, and the Policy Gradient Theorem.