Dueling Network Architectures for Deep Reinforcement Learning
Factoring
Value network
DOI:
10.48550/arxiv.1511.06581
Publication Date:
2015-01-01
AUTHORS (6)
ABSTRACT
In recent years there have been many successes of using deep representations in reinforcement learning. Still, these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. this paper, we present a new neural network architecture for model-free Our dueling represents two separate estimators: one the state value function and state-dependent action advantage function. The main benefit factoring is to generalize learning across actions without imposing any change underlying algorithm. results show that leads better policy evaluation presence similar-valued actions. Moreover, enables our RL agent outperform state-of-the-art on Atari 2600 domain.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....