Definition
Neural network components that allow models to focus on relevant parts of input data
Detailed Explanation
Attention mechanisms compute weighted sums of input features, allowing models to dynamically focus on relevant information. They are crucial for handling variable-length input sequences and improving model interpretability.
Use Cases
Machine translation, image captioning, text summarization