Data Lake Governance – Establishing a Single Source of Truth in Healthcare Enterprises
DOI:
https://doi.org/10.63282/3050-9246.IJETCSIT-V6I4P120Keywords:
Data Lake, Healthcare Analytics, Data Governance, Single Source of Truth, Interoperability, Data Quality, Metadata Management, HIPAA, Value-Based CareAbstract
Healthcare organizations generate enormous volumes of multi-modal data electronic health records (EHR), pharmacy claims, medical claims, imaging, genomics, IoT sensor streams, and administrative data. However, fragmented systems prevent efficient data sharing, analytics, and decision-making. A well-governed healthcare data lake provides a scalable architecture to integrate structured and unstructured data while maintaining quality, security, and compliance. This paper proposes a comprehensive governance framework enabling a unified Single Source of Truth (SSOT) for healthcare enterprises. The framework integrates metadata management, data lineage, interoperability standards, AI-driven quality checks, and federated access controls. The proposed model ensures trustworthy, timely, and regulated data access for clinical, operational, and financial use cases including population health, pharmacy benefit optimization, risk scoring, and value-based care. The framework further incorporates ethical safeguards to mitigate AI bias, enforce algorithmic fairness, and ensure transparency and accountability in all automated governance decisions
Downloads
References
[1] HL7 International, “FHIR Release 4,” 2021.
[2] CMS, “Risk Adjustment Data Validation,” 2020.
[3] IBM Healthcare, “AI in Data Governance,” 2023.
[4] Google Cloud Healthcare API Documentation, 2024.
[5] Khosla et al., “Interoperability in Healthcare Data Systems,” IEEE Access, 2022.
[6] HITRUST Alliance Framework, 2022.
