Analysis of protein-coding genetic variation in 60,706 humans
- PMID: 27535533
- PMCID: PMC5018207
- DOI: 10.1038/nature19057
Analysis of protein-coding genetic variation in 60,706 humans
Abstract
Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
Figures
![Extended Data Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/810a2c67b274/nihms798561f6.gif)
![Extended Data Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/48f3cecce2b8/nihms798561f7.gif)
![Extended Data Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/b8d67dc5b8e1/nihms798561f8.gif)
![Extended Data Figure 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/99d4b930e239/nihms798561f9.gif)
![Extended Data Figure 5](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/40b100fabb04/nihms798561f10.gif)
![Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/a6b8500c5dc8/nihms798561f1.gif)
![Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/1217127a4db7/nihms798561f2.gif)
![Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/5173ac7c244d/nihms798561f3.gif)
![Figure 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/7e9d4205131e/nihms798561f4.gif)
![Figure 5](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d518/5018207/487d788b4e4b/nihms798561f5.gif)
Comment in
-
Human genomics: A deep dive into genetic variation.Nature. 2016 Aug 18;536(7616):277-8. doi: 10.1038/536277a. Nature. 2016. PMID: 27535530 No abstract available.
-
Rethink the links between genes and disease.Nature. 2016 Oct 13;538(7624):140. doi: 10.1038/538140a. Nature. 2016. PMID: 27734882 No abstract available.
-
How scientists use Slack.Nature. 2016 Dec 29;541(7635):123-124. doi: 10.1038/541123a. Nature. 2016. PMID: 28054618 No abstract available.
Similar articles
-
Pathogenic variant burden in the ExAC database: an empirical approach to evaluating population data for clinical variant interpretation.Genome Med. 2017 Feb 6;9(1):13. doi: 10.1186/s13073-017-0403-7. Genome Med. 2017. PMID: 28166811 Free PMC article.
-
Diagnosing rare diseases after the exome.Cold Spring Harb Mol Case Stud. 2018 Dec 17;4(6):a003392. doi: 10.1101/mcs.a003392. Print 2018 Dec. Cold Spring Harb Mol Case Stud. 2018. PMID: 30559314 Free PMC article. Review.
-
Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease.Am J Hum Genet. 2017 Jan 5;100(1):75-90. doi: 10.1016/j.ajhg.2016.12.003. Epub 2016 Dec 29. Am J Hum Genet. 2017. PMID: 28041643 Free PMC article.
-
Using high-resolution variant frequencies to empower clinical genome interpretation.Genet Med. 2017 Oct;19(10):1151-1158. doi: 10.1038/gim.2017.26. Epub 2017 May 18. Genet Med. 2017. PMID: 28518168 Free PMC article.
-
Discovery of rare variants for complex phenotypes.Hum Genet. 2016 Jun;135(6):625-34. doi: 10.1007/s00439-016-1679-1. Epub 2016 May 24. Hum Genet. 2016. PMID: 27221085 Free PMC article. Review.
Cited by
-
Comparing Ethnicity-Specific Reference Intervals for Clinical Laboratory Tests from EHR Data.J Appl Lab Med. 2018 Nov 1;3(3):366-377. doi: 10.1373/jalm.2018.026492. J Appl Lab Med. 2018. PMID: 33636914 Free PMC article.
-
Pathogenic missense protein variants affect different functional pathways and proteomic features than healthy population variants.PLoS Biol. 2021 Apr 28;19(4):e3001207. doi: 10.1371/journal.pbio.3001207. eCollection 2021 Apr. PLoS Biol. 2021. PMID: 33909605 Free PMC article.
-
A novel missense TGFBI variant p.(Ser591Phe) in a Finnish family with variant lattice corneal dystrophy.Eur J Ophthalmol. 2022 Jul;32(4):NP61-NP66. doi: 10.1177/1120672121997305. Epub 2021 Mar 1. Eur J Ophthalmol. 2022. PMID: 33645289 Free PMC article.
-
Tracking the motion of the KV1.2 voltage sensor reveals the molecular perturbations caused by a de novo mutation in a case of epilepsy.J Physiol. 2020 Nov;598(22):5245-5269. doi: 10.1113/JP280438. Epub 2020 Sep 21. J Physiol. 2020. PMID: 32833227 Free PMC article.
-
Responsible, practical genomic data sharing that accelerates research.Nat Rev Genet. 2020 Oct;21(10):615-629. doi: 10.1038/s41576-020-0257-5. Epub 2020 Jul 21. Nat Rev Genet. 2020. PMID: 32694666 Free PMC article. Review.
References
-
- Stoneking M, Krause J. Learning about human population history from ancient and modern genomes. Nat. Rev. Genet. 2011;12:603–614. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- 090367/WT_/Wellcome Trust/United Kingdom
- R01DK062370/DK/NIDDK NIH HHS/United States
- K02 NS085048/NS/NINDS NIH HHS/United States
- 5U54HG003067-11/HG/NHGRI NIH HHS/United States
- P30 DK020572/DK/NIDDK NIH HHS/United States
- MOP82810/CAPMC/ CIHR/Canada
- RC2F DK088389/DK/NIDDK NIH HHS/United States
- U01-DK085545/DK/NIDDK NIH HHS/United States
- MH077139/MH/NIMH NIH HHS/United States
- HHSN268201300049C/HL/NHLBI NIH HHS/United States
- 098381/WT_/Wellcome Trust/United Kingdom
- U01 DK085545/DK/NIDDK NIH HHS/United States
- HHSN268201300046C/HL/NHLBI NIH HHS/United States
- NIMHRC2MH089905/PHS HHS/United States
- 1RC2DK088389/DK/NIDDK NIH HHS/United States
- G0801418/MRC_/Medical Research Council/United Kingdom
- MR/L003120/1/MRC_/Medical Research Council/United Kingdom
- U01 DK085501/DK/NIDDK NIH HHS/United States
- 2P50MH066392-05A1/MH/NIMH NIH HHS/United States
- R01 MH077139/MH/NIMH NIH HHS/United States
- RG/13/13/30194/BHF_/British Heart Foundation/United Kingdom
- P30 DK043351/DK/NIDDK NIH HHS/United States
- MH095034/MH/NIMH NIH HHS/United States
- MOP136936/CAPMC/ CIHR/Canada
- R01HL107816/HL/NHLBI NIH HHS/United States
- R01 DK098032/DK/NIDDK NIH HHS/United States
- U01DK085526/DK/NIDDK NIH HHS/United States
- U01 NS040024/NS/NINDS NIH HHS/United States
- HHSN268201300047C/HL/NHLBI NIH HHS/United States
- U54HG003067/HG/NHGRI NIH HHS/United States
- MC_UP_1102/20/MRC_/Medical Research Council/United Kingdom
- U41 HG000330/HG/NHGRI NIH HHS/United States
- K01 HL125751/HL/NHLBI NIH HHS/United States
- T32 HL007208/HL/NHLBI NIH HHS/United States
- G0800509/MRC_/Medical Research Council/United Kingdom
- U01 DK085584/DK/NIDDK NIH HHS/United States
- MOP77682/CAPMC/ CIHR/Canada
- HHSN268201300048C/HL/NHLBI NIH HHS/United States
- U01 DK085524/DK/NIDDK NIH HHS/United States
- R01DK098032/DK/NIDDK NIH HHS/United States
- RC2DK088389/DK/NIDDK NIH HHS/United States
- DK085545/DK/NIDDK NIH HHS/United States
- U01 DK085526/DK/NIDDK NIH HHS/United States
- R01MH085521/MH/NIMH NIH HHS/United States
- MH094421/MH/NIMH NIH HHS/United States
- NS40024-09S1/NS/NINDS NIH HHS/United States
- DK088389/DK/NIDDK NIH HHS/United States
- DK098032/DK/NIDDK NIH HHS/United States
- U01 DK062370/DK/NIDDK NIH HHS/United States
- P30 AG038072/AG/NIA NIH HHS/United States
- 090532/WT_/Wellcome Trust/United Kingdom
- U01 NS40024-09S1/NS/NINDS NIH HHS/United States
- RC2-DK088389/DK/NIDDK NIH HHS/United States
- R01HL24799/HL/NHLBI NIH HHS/United States
- U54 DK105566/DK/NIDDK NIH HHS/United States
- 5 U54 HG003067-13/HG/NHGRI NIH HHS/United States
- U01 MH094432/MH/NIMH NIH HHS/United States
- R01 GM104371/GM/NIGMS NIH HHS/United States
- HHSN268201300050C/HL/NHLBI NIH HHS/United States
- K01HL125751/HL/NHLBI NIH HHS/United States
- F32GM115208/GM/NIGMS NIH HHS/United States
- MH089905/MH/NIMH NIH HHS/United States
- R01MH085560/MH/NIMH NIH HHS/United States
- NS085048/NS/NINDS NIH HHS/United States
- G0601261/MRC_/Medical Research Council/United Kingdom
- FS/14/55/30806/BHF_/British Heart Foundation/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases