Asynchronous Advantage Actor-Critic

[eɪˈsɪŋkrənəs ədˈvɑːntɪdʒ ˈæktə ˈkrɪtɪk]

Machine Learning

Last updated: December 9, 2024

Definition

A parallel implementation of the actor-critic architecture that runs multiple learning agents asynchronously.

Detailed Explanation

A3C runs multiple agents in parallel, each interacting with its own copy of the environment. This parallelization provides diverse experience and stable learning. It uses an advantage function to reduce variance in policy gradient estimates and combines this with asynchronous updates to a global network.

Use Cases

Distributed training systems, game AI, robot control, parallel simulations

Definition

Detailed Explanation

Use Cases

Related Terms

Q-Learning

Hyperparameter Optimization

K-Nearest Neighbors

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool