Correlated Independence: Why Redundant Storage Systems Share the Same Fate

Authors

  • Mallikarjun Vppalapati Sr Technical Consultant at Hitachi Vantara, USA. Author
  • Phani Kumar Talasila Storage Engineer III at Romedica Health Systems, USA. Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V3I1P119

Keywords:

Redundant Storage Systems, Correlated Failures, Fault Tolerance, Data Reliability, RAID, Distributed Storage, Cloud Infrastructure, Storage System Design, Data Center Reliability, Hardware Failures, Software Failures, Failure Correlation Metrics, Redundancy Strategies, Risk Assessment, Predictive Reliability, Storage Resilience, Environmental Factors

Abstract

To maintain data availability and fault tolerance, redundant storage systems such as RAID arrays and distributed storage architectures are generally implemented. The usual approach assumes that redundancy by itself ensures independence, i.e., the failure of a single component does not coincide with the failure of other components. However, studies based on experience reveal that this assumption fails most of the time: components which are considered independent regularly show correlated failures, i.e., several components fail simultaneously due to a common cause or a combination of factors. Such factors may be common hardware designs, software bugs, firmware interactions, human operation mistakes, or environmental conditions such as temperature or power variations. These situations of correlation go against the functioning of redundancy and therefore risk the continuity of data centers, cloud infrastructures, and other crucial data systems where the service continuity and data integrity are mandatory. For this purpose, analysis of failure events was performed statistically, various storage installations were evaluated for correlation, and a detailed study of the case was done to identify the examples of synchronized failure from the present. The study states that the correlation between “independent” storage components is higher than the conventional assumption, thus fault-tolerance models are challenged. Here, the emphasis is laid on the necessity of revising the concept of redundancy, incorporating correlated failure models into system designs and risk assessments, and upgrading the predictive tools for the reliability of storage. Therefore, taking into account correlated failures is a prerequisite for the development of storage systems that are really self-supporting, so that redundancy not only guarantees protection but also avoids inducing a false feeling of ​‍​‍​‍‌security.

Downloads

Download data is not yet available.

References

[1] Ganesan, Aishwarya, et al. "Redundancy does not imply fault tolerance: Analysis of distributed storage reactions to file-system faults." ACM Transactions on Storage (TOS) 13.3 (2017): 1-33.

[2] Chechik, Gal, et al. "Reduction of information redundancy in the ascending auditory pathway." Neuron 51.3 (2006): 359-368.

[3] Parakala, Adityamallikarjunkumar, and Aaron Bell. "How Citizen Developers Changed the Game." American International Journal of Computer Science and Technology 3.5 (2021): 14-24.

[4] Pinheiro, Eduardo, Ricardo Bianchini, and Cezary Dubnicki. "Exploiting redundancy to conserve energy in storage systems." Proceedings of the joint international conference on Measurement and modeling of computer systems. 2006.

[5] Nath, Suman, et al. "Subtleties in Tolerating Correlated Failures in Wide-area Storage Systems." NSDI. Vol. 6. 2006.

[6] Kumar Doodala, Appala Nooka. “Intelligent EOB ERA Generation and Validation Framework on Legacy Systems Like Mainframes”. International Journal of Emerging Research in Engineering and Technology, vol. 2, no. 1, Mar. 2021, pp. 111-2.

[7] Gaddam, Rohit Reddy. “Hermetic ML Environments Using Conda-Lock and Docker”. American International Journal of Computer Science and Technology, vol. 3, no. 4, July 2021, pp. 22-34

[8] Sobe, Peter, and Kathrin Peter. "Comparison of redundancy schemes for distributed storage systems." Fifth IEEE International Symposium on Network Computing and Applications (NCA'06). IEEE, 2006.

[9] Bendor, Jonathan B. Parallel systems: Redundancy in government. Univ of California Press, 1985.

[10] Klein, Stanley B. The two selves: Their metaphysical commitments and functional independence. OUP USA, 2014.

[11] Suryadevara, Siva Sai Krishna. “Generative AI–Powered Authoring Assistant for Enterprise Content Management”. International Journal of Artificial Intelligence, Data Science, and Machine Learning, vol. 2, no. 2, June 2021, pp. 103-1

[12] Muralidhar, Subramanian, et al. "f4: Facebook's warm {BLOB} storage system." 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14). 2014.

[13] Kahana, Michael J., Daniel S. Rizzuto, and Abraham R. Schneider. "Theoretical correlations and measured correlations: relating recognition and recall in four distributed memory models." Journal of Experimental Psychology: Learning, Memory, and Cognition 31.5 (2005): 933.

[14] Katangoori, Sivadeep, and Anudeep Katangoori. “AI-Augmented Data Governance: Enabling Intelligent Access, Lineage, and Compliance Across Hybrid Clouds”. American Journal of Autonomous Systems and Robotics Engineering, vol. 1, Nov. 2021, pp. 716-38

[15] Espinola, Sergio Martin, et al. "Cis-regulatory chromatin loops arise before TADs and gene activation, and are independent of cell fate during early Drosophila development." Nature genetics 53.4 (2021): 477-486.

[16] Gaddam, Rohit Reddy. “Vertex AI As a Unified Control Plane for MLOps”. International Journal of Artificial Intelligence, Data Science, and Machine Learning, vol. 2, no. 2, June 2021, pp. 92-102

[17] Massicotte, Philippe, and Jean-Jacques Frenette. "Spatial connectivity in a large river system: resolving the sources and fate of dissolved organic matter." Ecological Applications 21.7 (2011): 2600-2617.

[18] Muppaneni, Kavya. “Cross-Browser Debugging Strategies”. American International Journal of Computer Science and Technology, vol. 3, no. 5, Sept. 2021, pp. 25-3

[19] Cadrin, Steven X., Lisa A. Kerr, and Stefano Mariani, eds. "Stock identification methods: applications in fishery science." (2013).

[20] Kok, Fatma O., et al. "Reverse genetic screening reveals poor correlation between morpholino-induced and mutant phenotypes in zebrafish." Developmental cell 32.1 (2015): 97-108.

[21] Muppaneni, Rajarshi Krishna. “How Enterprises Are Achieving 360° Customer Views With Dynamics 365”. International Journal of AI, BigData, Computational and Management Studies, vol. 2, no. 2, June 2021, pp. 129-38

[22] Li, Peng, et al. "Occurrence and fate of antibiotic residues and antibiotic resistance genes in a reservoir with ecological purification facilities for drinking water sources." Science of the Total Environment 707 (2020): 135276.

[23] Parakala, Adityamallikarjunkumar. "Building Analytics-Driven Bots: RPA Meets Business Intelligence." International Journal of Emerging Research in Engineering and Technology 2.1 (2021): 77-87.

[24] Láruson, Áki J., Sam Yeaman, and Katie E. Lotterhos. "The importance of genetic redundancy in evolution." Trends in ecology & evolution 35.9 (2020): 809-822.

Published

2022-03-30

Issue

Section

Articles

How to Cite

1.
Vppalapati M, Talasila PK. Correlated Independence: Why Redundant Storage Systems Share the Same Fate. IJETCSIT [Internet]. 2022 Mar. 30 [cited 2026 May 31];3(1):169-7. Available from: https://ijetcsit.org/index.php/ijetcsit/article/view/729

Similar Articles

41-50 of 593

You may also start an advanced similarity search for this article.