Dueling Network Architectures for Deep Reinforcement Learning

Factoring Value network
DOI: 10.48550/arxiv.1511.06581 Publication Date: 2015-01-01
ABSTRACT
In recent years there have been many successes of using deep representations in reinforcement learning. Still, these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. this paper, we present a new neural network architecture for model-free Our dueling represents two separate estimators: one the state value function and state-dependent action advantage function. The main benefit factoring is to generalize learning across actions without imposing any change underlying algorithm. results show that leads better policy evaluation presence similar-valued actions. Moreover, enables our RL agent outperform state-of-the-art on Atari 2600 domain.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....