The KP Research Bank includes over 440K participants recruited through multiple outreach efforts since 2008, incorporated under a single research program in 2015. The KP Research Bank resource has enrolled over 230K participants since 2015 under two protocols: general (2015-current) and cancer (rapid case ascertainment: 2016-2019, cancer survivors: 2022). Both protocols had two components: a blood draw and an intake (baseline) survey; neither were mandatory for inclusion into the program. The cancer protocol incorporated a cancer case ascertainment algorithm, which identified potential cancer cases and sent recruitment materials near time of cancer diagnosis; cancer- specific questions were added to the survey created under this protocol as well.
In addition, the KP Research Bank acquired participants from two large pre-existing research cohorts: the California Men’s Health Study (CMHS, ~22K, originally recruited 2002-2003), and the Research Program on Genes, Environment and Health (RPGEH, also California-based, ~195K, originally recruited 2008-2014). RPGEH included a sub-cohort of women of reproductive age who were recruited during a pregnancy episode (RPGEH Pregnancy Cohort, ~25K, originally recruited 2010-2018). These pre-existing cohorts are referred to as Legacy Data, and all participants incorporated into the KP Research Bank provided either a sample, or a sample and a survey. Samples for the Legacy Data cohorts were either saliva or blood.
In 2021, the KP Research Bank began genotyping 400K participants on the ThermoFisher Axiom Precision Medicine and Diversity Research Array (PMDA). In 2024, version 1 of these data (395K+) — based on samples that were successfully genotyped and passed sample and data quality control and verification — were released for use in research. In addition to the PMDA data, whole genome, whole exome data and Axiom World Array data are available on smaller subsets of KP Research Bank participants (see below for counts). The largest of these subsets is the Genetic Epidemiology Research on Aging cohort (102K+) created in 2012, which includes matched RPGEH/CMHS survey data, telomere data, and Y chromosome and mitochondrial data.
Summary statistics provided through this site represent a high-level overview of the data in our resource. Researchers applying to use the resource will complete a Preparatory-to-Research request that will provide summary statistics tailored to their specific study design. View the KP Research Bank Data Model.
Demographics
Resources
405,232 KPRB Biomolecular Data
90.23% Percentage of KPRB participants with at least one biomolecular data type
415K+ Total number of assays
View Biomolecular Data Types
433,444 KPRB Biosamples
96.51% Percentage of KPRB participants with at least one biospecimen
737K+ Total number of biosamples
View Biosample Types
382,305 KPRB Surveys
85% Percentage of KPRB participants with at least one intake survey
432K+ Total number of intake surveys
View Survey Types
EMR
449,056 At least one address matched to a U.S. Census Geocode
99.99% Percentage of KPRB Cohort with at least one data point
2M+ Total number of data points
448,957 Clinical Encounters
99.96% Percentage of KPRB Cohort with at least one data point
171M+ Total number of data points
389,688 Clinically-Administered Medications
86.77% Percentage of KPRB Cohort with at least one data point
60M+ Total number of data points
448,925 Diagnoses
99.96% Percentage of KPRB Cohort with at least one data point
333M+ Total number of data points
View Key Diagnoses
447,747 Lab Results
99.69% Percentage of KPRB Cohort with at least one data point
266M+ Total number of data points
View Key Laboratory Results
447,545 Outpatient Medications
99.65% Percentage of KPRB Cohort with at least one data point
132M+ Total number of data points
446,950 Vital Signs (Height, Weight and/or Blood Pressure)
99.52% Percentage of KPRB Cohort with at least one data point
94M+ Total number of data points
Last updated May 2024