Automated Root Cause Analysis in SAP Landscapes Using Large Language Models and Operational Telemetry

Authors

  • Gururaj Veershetty Independent Researcher, USA. Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V7I1P127

Keywords:

Root Cause Analysis (RCA), Sap Landscape Monitoring, Large Language Models (LLMs), AIOps, Operational Telemetry, Log Intelligence, Incident Management, Observability, Mean Time To Resolution (MTTR), Retrieval-Augmented Generation (Rag), Enterprise AI, Hybrid Reasoning Systems

Abstract

Enterprise SAP landscapes are increasingly complex, distributed, and mission-critical, making rapid and accurate root cause analysis (RCA) essential for operational resilience. Traditional monitoring approaches rely on rule-based correlation, threshold alerts, and deterministic workflows, which struggle to handle cross-layer dependencies, unstructured log data, and cascading failures. This paper proposes an automated RCA framework that integrates Large Language Models (LLMs) with operational telemetry collected across SAP application, database, and infrastructure layers. The proposed architecture combines telemetry ingestion, semantic enrichment, vector-based retrieval, and LLM-driven reasoning to identify probable root causes and generate human-readable explanations. A hybrid reasoning model integrates deterministic graph-based correlation with probabilistic scoring and contextual LLM analysis to improve diagnostic accuracy while mitigating hallucination risks. Experimental evaluation across representative SAP failure scenarios including database lock contention, RFC timeouts, and background job failures demonstrates significant improvements in root cause identification accuracy and reduction in Mean Time to Resolution (MTTR) compared to traditional rule-based systems. The findings suggest that LLM-augmented AIOps systems can enable cross-domain reasoning, enhance explainability, and reduce alert fatigue in SAP operations. This work contributes a scalable architectural blueprint and evaluation methodology for deploying AI-driven RCA in enterprise SAP environments.

Downloads

Download data is not yet available.

References

[1] A. Karnam, “The architecture of reliability: SAP landscape strategy, system refreshes, and cross-platform integrations,” International Journal of Research and Applied Innovations (IJRAI), 2021. [Online]. Available: https://doi.org/10.15662/IJRAI.2021.0405005

[2] G. Kabra and H. S. Mukerjee, “Design thinking adoption in software development organizations: A SAP LAP analysis,” The Design Journal, pp. 798–816, Jul. 2023. [Online]. Available: https://www.tandfonline.com/doi/abs/10.1080/14606925.2023.2237297

[3] A. Pasumarthi, “Dynamic repurpose architecture for SAP HANA transforming DR systems into active quality environments without compromising resilience,” International Journal of Engineering & Extended Technologies Research (IJEETR), vol. 5, no. 2, 2023. [Online]. Available: https://ijeetr.com/index.php/ijeetr/article/view/104

[4] P. Jin et al., “Assess and summarize: Improve outage understanding with large language models,” in Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE), 2023, pp. 1657–1668. [Online]. Available: https://dl.acm.org/doi/abs/10.1145/3611643.3613891

[5] O. M. Hansen, “Privacy-preserving AI and machine learning for enterprise risk detection in SAP-based cloud business processes,” International Journal of Research and Applied Innovations (IJRAI), vol. 5, no. 6, Dec. 2022.[Online]. Available: https://doi.org/10.15662/IJRAI.2022.0506023

[6] A. Karnam, “SAP beyond uptime: Engineering intelligent AMS with high availability and DR through pacemaker automation,” International Journal of Research in Practical Engineering, Technology and Management (IJRPETM), Sep. 2023. [Online]. Available: https://ijrpetm.com/index.php/IJRPETM/article/view/263

[7] V. Sharma, “Cognitive data governance pipelines for autonomous multi-cloud enterprise platforms,” International Journal of Scientific Research & Engineering Trends, vol. 9, no. 6, Nov.–Dec. 2023. [Online]. Available:https://www.researchgate.net/profile/Vasudev-Sharma-3/publication/397705567

[8] E. Zeydan and J. Mangues-Bafalluy, “Recent advances in data engineering for networking,” IEEE Access, vol. 10, pp. 34449–34496, Mar. 2022. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9743922

[9] S. Annam, “Enhancing IT support for enterprise-scale applications,” Mar. 2023. [Online]. Available: https://www.researchgate.net/publication/389088869_Enhancing_IT_Support_for_Enterprise-Sca le_Applications

[10] G. Fortino, A. Guerrieri, P. Pace, C. Savaglio, and G. Spezzano, “IoT platforms and security: An analysis of the leading industrial/commercial solutions,” Sensors, vol. 22, no. 6, Art. no. 2196, Mar. 2022. [Online]. Available: https://www.mdpi.com/1424-8220/22/6/2196

[11] E. Cadet et al., “The role of reinforcement learning in adaptive cyber defense mechanisms,” International Journal of Multidisciplinary Research and Growth Evaluation, vol. 2, no. 2, pp. 544–559, Mar.–Apr. 2021. [Online]. Available:https://www.allmultidisciplinaryjournal.com/uploads/archives/20250911115802_MGE-2025-5-02 7.1.pdf .

[12] Tirumalasetty, P. (2025). Deep Graph Learning for Autonomous Data Reconciliation across Heterogeneous Enterprise Systems.

Published

2026-02-20

Issue

Section

Articles

How to Cite

1.
Veershetty G. Automated Root Cause Analysis in SAP Landscapes Using Large Language Models and Operational Telemetry. IJETCSIT [Internet]. 2026 Feb. 20 [cited 2026 Mar. 7];7(1):186-91. Available from: https://ijetcsit.org/index.php/ijetcsit/article/view/606

Similar Articles

31-40 of 476

You may also start an advanced similarity search for this article.