TAAFT
Free mode
100% free
Freemium
Free Trial
Create tool

Asynchronous Advantage Actor-Critic

[eɪˈsɪŋkrənəs ədˈvɑːntɪdʒ ˈæktə ˈkrɪtɪk]
Machine Learning
Last updated: December 9, 2024

Definition

A parallel implementation of the actor-critic architecture that runs multiple learning agents asynchronously.

Detailed Explanation

A3C runs multiple agents in parallel, each interacting with its own copy of the environment. This parallelization provides diverse experience and stable learning. It uses an advantage function to reduce variance in policy gradient estimates and combines this with asynchronous updates to a global network.

Use Cases

Distributed training systems, game AI, robot control, parallel simulations

Related Terms