Definition
A simplified version of LSTM networks that combines the forget and input gates into a single update gate
Detailed Explanation
GRUs are designed to solve the vanishing gradient problem like LSTMs but with a simpler architecture. They use update and reset gates to control information flow making them computationally more efficient while maintaining good performance.
Use Cases
Text generation sentiment analysis machine translation sequence prediction
