Definition
A hybrid approach that combines policy gradient methods (actor) with value function approximation (critic).
Detailed Explanation
Actor-Critic methods maintain both a policy (actor) and a value function (critic). The critic estimates the value function and uses it to update the actor's policy parameters, while the actor determines which actions to take. This combination often leads to reduced variance and improved learning stability.
Use Cases
Robot control, game AI, autonomous systems, process optimization