Platformable logo
Understand
watch21 min read
email

Health Data Governance: An Overview

Written by Mark Boyd
Updated at Mon Aug 25 2025
featured image

Who should read this:

Data governance leads, data policy managers, and data managers across the health sector (pharmaceutical companies, healthcare providers, non-government organisations, community health, hospital networks, biotech, health tech startups)

What it’s about:

Our health data governance model looks beyond technical aspects to responsible, ethical and equitable health data management and use policies and processes can centre on data ethics and data justice principles, enable value from data, build trust, reduce resource use and generate benefits from patients to communities, commercial enterprises, society, and the environment.

Why it’s important:

Health data governance requires specialised skills around managing data securely, in compliance with regulations, and with a view to generating benefits for patients, organisations and society. Well-managed data can aid in improving population health and where people trust systems and structures to share data responsibly, data can be combined in new ways to amplify benefits for patients, organisations, society, and the environment.

At Platformable, we support the development of open digital ecosystems. A big part of that is ensuring that data is well managed: responsibly, ethically, and equitably. That means that open digital ecosystems need to have rules of engagement around how data is used. These rules of engagement are called "data governance". In the health sector, this is especially important as health data is personal and sensitive, but also because good use of data could save lives, reduce inequalities, improve life quality, prevent health risks, and optimise the resources available to spend on health care. Health data governance is needed at multiple levels: for the country or region, as well as by each stakeholder in the open digital health ecosystem.

Previous experience in Health Data Governance

My work and the work of Platformable has involved data governance across a range of sectors, in particular often focusing on health data governance. This has included:

Created population health and wellbeing data governance models for local governments in Australia and to track data on health inequalities for state government
Researched data governance maturity for WHO
Conducted workshops on health data governance of secondary use of health data and on the value of data across Europe
Developed a health data governance model
Analysed health data governance regulatory frameworks for three global projects, including Europe, Western Balkans, and Canada
Mentored and helped build data teams for health-based community organisations in the US and New York City, focused on HIV/AIDS and health inequalities
Built data governance software for community health organisations
Created a health data governance training course
Health Data Governance (Platformable's definition)
Data governance is the ability of organisations to use health data in a secure, safe, ethical, and equitable manner with the goal of generating personal, commercial, societal, and environmental benefits. Health data governance includes infrastructure, policies and procedures, and secure data sharing. Health data governance fosters open health data ecosystems that allow organisations to effectively collect, manage, use, and share data with other stakeholders to achieve shared outcomes.

Health Data Governance in an ecosystem context

When we look at health data governance, we consider how it impacts across the digital health ecosystem. Many health data governance models focus on the technical aspects (such as defining data models) and on legal and regulatory compliance requirements (such as labeling data so that it is easier to ensure it is managed in a way that meets data protection requirements). We take a step back and consider the wider digital health ecosystem context in which data is collected, processed, used, analysed, and shared. Our health data governance framework reflects the central role that responsible and ethical data management plays in fostering a trustworthy and effective digital health ecosystem.

In a digital health ecosystem, data can flow between multiple stakeholders. 

Data governance is often set at a geographic, or regional level: governments introduce data protection and data sharing regulations and establish regulators to ensure that data is managed by all stakeholders appropriately. 

Policies and approaches to aid the use of data, such as digital health or interoperability policies, help data flow in an ecosystem. 

Each stakeholder collecting, consuming and using data may then also have health data governance frameworks and systems in place. 

And a range of capacity-building stakeholders including consultants, trainers, and tools providers/software vendors help support data governance best practice implementation. 

There are also components and enablers that increase the impact of data governance activities to generate benefits. For example, a good process of continuous engagement and continuous consent where health subjects can give consent to sharing their health data, understand the risks and see how sharing data creates benefits will help augment trustworthiness in a digital health ecosystem. 

Documented and shared health data governance artefacts, training, tools and infrastructure (including health data research platforms where data can be analysed but not removed) can all embed health data governance processes, simplifying the adoption of health data governance best practices by ecosystem stakeholders.

Digital Health Ecosystem 2025_without AI.png

In a digital health ecosystem, data governance is a context-setting component that can contribute to value generation. We have identified 6 key benefits that can be generated from functioning digital health ecosystems:

Better health outcomes
Increased coordination to support high-quality care throughout the patient journey
Data is used to seamlessly provide care across a range of healthcare providers where needed, reduces risks of adverse reactions and contraindications from prescribed therapies, and enables personalised medicine advancements.
Reduced Inequality
Using data to better target services and disaggregate data to understand the impacts on vulnerable populations
Data allows insight into differential health outcomes by gender and socioeconomic status.
Optimised health system
Achieving efficiencies in care delivery and resource allocation, decreasing the cost of care while maintaining high quality and satisfaction
Data is used to reduce inefficiencies and ensure care is provided through early diagnosis when health care delivery costs are cheaper. Data can also be used to set prevention policies to reduce future strain on healthcare systems.
Greater patient-public participation
Understanding of connections and collaborations, and a sense of engagement and involvement in health decision-making by communities and all stakeholders
Access to data enables deeper conversations between patients and health providers, and allows the public to participate in discussions on health research, building trust and supporting local industry growth.
Expanded innovation
New industry opportunities and deeper insights into existing systems
Reuse of population health data and research data accelerates innovation, identifying high-potential opportunities for healthcare product development.
Professional recognition & Private profits
Support for researchers and other stakeholders that collect, use, and share data
Academics can build careers through data reuse and recognition. For-profit businesses can leverage health data to innovate, remain profitable, and drive economic growth.

The virtuous cycle/theory of change for why Health Data Governance generates impacts

Health data governance policies and practices occur at multiple levels and is managed by multiple stakeholders.

+
At the government/regulator (context-setting) level: Sets regulatory and legal frameworks for data governance, funds data governance best practice research and adoption
+
At the capacity-building level: Builds guidelines, tools, digital public infrastructure and training services to support data governance adoption
+
At the individual stakeholder level: Implements their own health data governance policies and processes aligned with the context-setting level and leveraging the capacity-building components

 

The adoption of health data governance policies and processes increases trust and encourages stakeholders to share and make us of health data responsibly and ethically. Which, in turn, amplifies the value generated across the ecosystem.

Platformable's Health Data Governance model

Overview of the health data governance framework

We position health data governance within an overall data justice framework.

Data Justice (Platformable's definition)
Data justice is a set of ideas that addresses the way in which people are represented and/or harmed through their data being made visible to others, or conversely, excluded from the public eye, sites of power, and the process of decision-making. It confronts and challenges structural biases in the ways that we think about, collect, steward, and use data. (From: Royal Roads University, https://libguides.royalroads.ca/datajustice)

There are a variety of models that offer approaches that address data justice:

Data Ethics
Frameworks
FAIR
and CARE
OCAP
Patient-Centred Care
and Participatory Models

Each digital health ecosystem could define their data justice and data ethics principles using these or other models to ensure that as they create the various data governance framework components described below, that each aligns with these defined principles. Otherwise, there is a high risk that the collection and use of data will not build trustworthy infrastructure and institutions, which decreases the strength, capabilities and value able to be generated in the digital health ecosystem.

Within this data justice context, there are two phases of health data governance:

Phase 01

Policies and processes for robust health data governance

Phase 02

Implementation technologies and tools that automate the policies and processes

A fourth level then looks at how data quality can be maintained over time and that the data and processes continue to generate value while continuallly aligned with the defined data justice context.

Detailed overview of Plkatformable's health data governance framework

Core Health Data Governance components

In Phase 1, the following policies, processes and artefacts can be created:
 

Data policy
A documented set of guidelines that ensures your organisation's data and information assets are managed consistently and used properly. The policy typically includes separate sections for data quality, access, security, privacy, and usage, as well as roles and responsibilities for implementing those policies and monitoring compliance with them.
How do we manage data ethically and responsibly?
Data policies
Logic model / Theory of change / Operational model
Use of a model to describe how data is collected and the intended impacts of collecting and using this data.

A Program Logic Model is a schema or visual outline that demonstrates how your program is designed. The Theory of Change approach to program design is a well-defined methodology that explains how an intervention is expected to result in a specific change as it relates to the pathway that occurs between causes and effects. The Operational Model establishes a causal pathway that shows the connection between causes and effects and includes any intermediary variables that may impact the pathway.
What do we need to collect and why?
Visualisation of model
Ecosystem and persona maps
Using a digital health ecosystem to map all local, relevant stakeholders in your ecosystem can help identify where opportunities to build partnerships exist and where existing resources and approaches can be drawn on.

Persona maps can then be used to identify how a particular local stakeholder interacts with your organisation when using data or digital products and services.
Who will use the data and how?
Ecosystem model
Persona templates
Reference Library
A reference library is a catalog of all Data Standards, External Datasets and Data Models used by your organisation.
What external datasets can we use to demonstrate need and impact?
Standards library
Data licenses
Dataset library
Open data models
Data Dictionary
A data dictionary describes an individual dataset in your data inventory and each individual data field within that dataset. At the dataset level, metadata fields could include a description of the dataset, the methodology used to collect the data, how often it is updated, how it should be used, any limitations and equity considerations that should be kept in mind, when it was last updated and so on. At the data field level, a data dictionary would list each data field, describe what that data is about, define any validation rules and formatting rules, describe any alignments with other datasets, score the level of risk management required to manage the data, describe any specific business rules for collecting the data, provide a sample of what the data field would look like, and so on.
How do we standardise and consistently describe each data item we collect?
Data inventory for all datasets you create and manage
Metadata file for each dataset in your inventory
Data dictionary for each dataset in your inventory
Workflow
Documentation for each process identified as part of your operational framework describing how data will be collected, processed, managed, used and shared, including business process guides and workflow diagrams.
What is the flow from collection to processing to analysis to use?
Workflow diagrams for each process
Business process management tools

Implementation Health Data Governance components

In Phase 2, the policies and processes can be implemented using the following tools:
 

Data tech and architecture
Documents that describe your data architecture and choice of tools
How do we build a tech stack that meets our current and future needs?
Architecture diagrams
Tools library
Internal developer portal
Wiki
Surveys
Data collection instruments that align with your data dictionaries
What are the standard data instruments/surveys we use to collect data?
Data spreadsheets/tables
Data surveys
Data collection instruments
Database
Databases that collect, store, clean/transform and catalogue all data within the datasets that are managed by your organisation
How do we store all of the data?
Databases
Integrations
Workflow automation tools (including AI) that help to automate processes and reduce manual handling of data including data entry
How do we reduce duplication and move data from survey to database to reporting?
Workflow integration tools
Reports
Reporting and analysis form factors that allow data to be more easily accessed and understood by analysts and other users
How do we report on the data findings to identified personas?
Report templates
Dashboards
Publications

Once health data is being collected, managed, analysed, used and shared, there is the need for ongoing quality review and data assurance mechanisms to be in place.

Our series on health data governance, being released in September 2025, will go into further detail on each component of this model.

member image

Mark Boyd

DIRECTORmark@platformable.com

Related article