Diverse Data
The initiative aims to reduce health inequalities and improve patient outcomes in genomic medicine for minoritised communities.
The Diverse Data vision
Our vision is that all patients, regardless of their background, receive the same quality of genomics-enabled personalised medicine, supported by the latest research on people like them.
Read and comment on a live version of our strategy to see our plans and input into our design.

Visualisation of data bias by Stef Posavec
What makes human genomes diverse?
Human genomes are 99.9% identical; however, it's differences in the 0.1% which make each of us unique – whether in terms of our hair colour, our like/dislike of certain foods, and even our sleeping patterns.
Variations in genomes can results from factors such as human migration patterns, our ancestry and choice of mate, and oftentimes environmental factors (which along with genetics, can contribute to a trait such as height and weight).
These variants tend to appear at different frequencies across different populations. Sometimes, they're rare and impact only specific families. At other times, the variants are common and can be found across populations.
Europeans represent 78%
Polygenic risk scores are 4.5x
7% of significant associations
While only 2%
Addressing the gap in diverse genomic data
The Diverse Data Initiative will deliver on its aims through four streams of activities:
Research & discovery: Understand the data gap
Improve our understanding of genomic diversity by reviewing, stimulating and conducting research into diversity and its impacts on scientific, clinical and health system outcomes.
Community & engagement: Close the gaps, together
Convene and work with patient, genomic and data communities to design, develop and implement equity-enhancing strategies.
Sequencing & data: Fill the data gap
Increase the volume and depth of genomic data available on individuals from under-represented groups by sequencing genomes, and generating, linking and facilitating better access to data from diverse populations.
Products, tools, & behaviours: Bridge the data gap
Work with clinicians, analysts, researchers, patients, and community groups to develop tools, and processes to improve research, service-delivery practices, recruitment and care.
The Diverse Data team
You can get to know each member of the team in this blog post. If you have questions about the Diverse Data programme, please use the contact button below.
Upcoming events involving the Diverse Data team
Events hosted by the Diverse Data team or events where a member of the team is speaking
Latest blogs from the Diverse Data team
Often work is shared at the end and has a shiny, positive slant to it. Issues of data diversity can raise complex scientific, technical as well as social and emotional debates. The Diverse Data Initiative aims to share more during the process to show the journey, and hopefully encourage others to course direct. To learn more about how the team works, interim outputs and what we’re thinking, have a read of our blogs.
Our CEO Chris Wigley shares some of his hopes and dreams for the future of Genomics England, the patients and participants we serve, and the wider genomics landscape.
The next genomics decade
Sam Tallman, a genomic data scientist, reviews an approach commonly used to assign participants into genetically defined ancestry groups using genomic data from the 100,000 Genomes Project.
Inclusive ancestry group assignment in genetics research
The Diverse Data team at Genomics England discuss recommendations from a recent review into the potential unintended consequences of efforts to diversify genomic data.
Ethical, legal and social issues in diversifying data: A Genomics England & Centre for Personalised Medicine collaboration
More resources on data diversity
Explore other initiatives
Cancer 2.0
Our initiative to integrate long-read sequencing technology and multimodal…
Newborn Genomes Programme
This initiative explores the benefits, challenges, and practicalities of…
100,000 Genomes Project
Information on the 100,000 Genomes Project, including an overview of the…
COVID-19 Study
Read about our partnership with the GenOMICC consortium, led by the…