Advancements in Deep Reinforcement Learning: A Comprehensive Survey on Policy Optimization Techniques

Authors

  • Deepak Harish Associate Professor, Department of Computer Science & Engineering, PSG College of Technology, Coimbatore, India. Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V1I2P101

Keywords:

Deep Reinforcement Learning, Policy Optimization, Policy Gradient Methods, Actor-Critic, Trust Region Methods, Proximal Policy Optimization, Model-Based Reinforcement Learning, Sample Efficiency, Exploration Strategies, Generalization in DRL

Abstract

Deep Reinforcement Learning (DRL) has emerged as a powerful paradigm for solving complex decision-making problems in various domains, including robotics, gaming, and autonomous systems. At the core of DRL lies the optimization of policies that map states to actions, enabling agents to learn optimal behaviors through interaction with their environment. This paper provides a comprehensive survey of recent advancements in policy optimization techniques in DRL. We categorize and discuss the key methods, including policy gradient methods, actor-critic algorithms, and model-based approaches. We also explore the challenges and future directions in the field, highlighting the integration of DRL with other machine learning techniques and the application of DRL in real-world scenarios. The paper aims to serve as a valuable resource for researchers and practitioners interested in the latest developments in DRL

Downloads

Download data is not yet available.

References

[1] Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction. MIT Press.

[2] Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., & Silver, D. (2015). Continuous control with deep

reinforcement learning. arXiv preprint arXiv:1509.02971.

[3] Schulman, J., Levine, S., Abbeel, P., Jordan, M., & Moritz, P. (2015). Trust region policy optimization. In International

Conference on Machine Learning (pp

[4] https://escholarship.org/content/qt9z908523/qt9z908523.pdf?t=otc2ko

[5] https://par.nsf.gov/servlets/purl/10321727

[6] https://arxiv.org/html/2502.06869v1

[7] https://jmlr.org/papers/volume20/18-476/18-476.pdf

[8] https://www.mdpi.com/1424-8220/23/7/3762

[9] https://www.researchgate.net/publication/389064637_The_advancements_and_applications_of_deep_reinforcement_learning

_in_Go

[10] https://ieeexplore.ieee.org/document/8103164

[11] https://www.researchgate.net/publication/361719832_A_survey_on_deep_reinforcement_learning_architectures_applications

_and_emerging_trends

Published

2020-05-04

Issue

Section

Articles

How to Cite

1.
Harish D. Advancements in Deep Reinforcement Learning: A Comprehensive Survey on Policy Optimization Techniques. IJETCSIT [Internet]. 2020 May 4 [cited 2025 Sep. 13];1(2):1-7. Available from: https://ijetcsit.org/index.php/ijetcsit/article/view/41

Similar Articles

11-20 of 239

You may also start an advanced similarity search for this article.