Skip to main content

How your data is used

None of our work would be possible without the consent and support of patients and participants – they are at the heart of everything that we do. By enabling scientists and clinicians in their research, a participant’s data goes from being an isolated dot to a key part of the picture.

Flat illustration of a diverse crowd of people walking different directions

Your data in the National Genomic Research Library

The data we collect is stored in the National Genomic Research Library, a platform built by us and NHS England that allows approved researchers to access samples, genomic data, and other associated health data via a secure Research Environment.

Researchers who use the Research Environment can learn more about everyone’s health by looking for patterns in the de-identified data of thousands of participants.

Being able to compare all genomic and health data in one place provides researchers with an opportunity to better understand diseases, develop new treatments and can lead to new diagnoses.

All data in the National Genomic Research Library has been de-identified and cannot be extracted from the Research Environment by researchers. If researchers wish to publish data, for example an academic article, this request has to be approved by a committee that checks to ensure the data is not identifiable.

Access to data is governed by an independent Access Review Committee with members, including participants, making access decisions.

What kind of data is stored in the National Genomic Research Library?

Types of de-identified data that are collected:

  • Your medical test results

  • Electronic copies of your health records from the NHS, your GP and other organisations

  • Information about any illnesses or stays in hospital – including your primary diagnosis and any historic diagnoses going back as far as medical records allow

  • Imaging data from your NHS records, such as MRI scans, X-rays or pathology images

A lifetime resource

The data we hold in the National Genomic Research Library about you will continue to be updated throughout your lifetime, for as long as you give us permission to. This lifelong health data may support the development of better treatments or diagnostics for others.

Why do we gather clinical data as well as your genome sequence?

For a deeper approach to genomic research, we need as much detail as possible about a participant’s medical condition and symptoms.

Even small differences in symptoms between individuals might be crucial in understanding what changes in a genome mean, and helping decide the best treatments.

Only by understanding your detailed clinical data are we able to understand how patterns in your genome affect your diagnosis.

What will the data be used for?

Finding treatments

Helping to find new treatments and possibly cures for a wide range of health conditions.

Improving analysis

Researchers might use the data to try and find new, faster ways to analyse large amounts of data.

Sharing knowledge

Researchers may share their findings with other scientists and doctors through publications or meetings to help research advance as quickly as possible.

Developing drug and diagnostic tests

New drugs and diagnostic tests may be developed by the NHS, universities and companies across the world.

Keeping your data safe

De-identifying data in the National Genomic Research Library

Maintaining your data security and protecting your privacy is our top priority

To maintain your privacy, information that could identify you – like name, date of birth, and all other personal details – is removed from your health records and genomic data.

The process of de-identifying data allows researchers to access detailed information without knowing who individual participants are.

However, genomic data is very rich and complex, and many of our participants may have rare conditions that make them more unique. Whilst we put in place strong protections for the data we cannot completely eliminate the risk that someone may be identified.

Keeping your data safe

How is data stored and secured?

Genome data is enormous, and we use a cloud service and other secure data centres to provide a secure computing and storage infrastructure. All participant data is held in secure facilities based in the UK.

All of the data from the National Genomic Research Library stays within the secure, monitored Research Environment where it can be analysed by researchers.

How we protect your data:

  • We have a multi-layered approach to cyber security using best practice across people, process, and technology and regularly undertake security tests
  • We meet laws and standards for data protection
  • We remain vigilant to ensure we are protected against current and evolving threats and work closely with colleagues across the public and private sector including the National Cyber Security Centre (NCSC)

Who has access to the data?

Anyone with access to the de-identified data in the National Genomic Research Library must be approved. Access to the data is governed by an independent Access Review Committee with members, including participants, making access decisions.

Researchers

Approved academic researchers may work for not-for-profit organisations, such as universities, hospitals, or research charities and must be part of our Research Network. Approved industry researchers such as those from pharma, biotech or diagnostic companies may also work with de-identified data to better understand diseases and develop diagnostics and treatments.

Clinicians and healthcare staff

Doctors, nurses and other healthcare professionals in the NHS have access to information about the patients they are caring for.

This is to enable clinical staff to see any results or findings from whole genome sequencing for patients in their care.

Service providers

Approved organisations who provide the data itself, IT support, computing infrastructure, data storage, genome analysis, or other technical services.

Each company only has access to the part of the data centre they are working on or supporting, and all individuals have undergone Information Governance training and signed confidentiality agreements.

The data will never be used for insurance or marketing purposes, nor for speculative searches, such as checks of DNA profiles or other information derived from your sample.

The Department of Health and Social Care has had confirmation from the Home Office and the Association of Chief Police Officers that they will not seek access to data held in the National Genomic Research Library without presentation of a court order.

For further details of unacceptable uses, please see the list as detailed in the Research Ethics Committee approved National Genomic Research Library Governance Framework.

Read the National Genomic Research Library Governance Framework (DOCX, 3MB)

FAQs

Questions about the National Genomic Research Library?

If you would like to make a data request, please see our data requests page. To contact us about the National Genomic Research Library, please get in touch on 0808 2819 535 or by email using the link below (it will open in your email application).