Raw Data Library
About
Aims and ScopeAdvisory Board Members
More
Who We Are?
User Guide
Green Science
​
​
EN
Kurumsal BaşvuruSign inGet started
​
​

About
Aims and ScopeAdvisory Board Members
More
Who We Are?
User GuideGreen Science

Language

Kurumsal Başvuru

Sign inGet started
RDL logo

Verified research datasets. Instant access. Built for collaboration.

Navigation

About

Aims and Scope

Advisory Board Members

More

Who We Are?

Contact

Add Raw Data

User Guide

Legal

Privacy Policy

Terms of Service

Support

Got an issue? Email us directly.

Email: info@rawdatalibrary.netOpen Mail App
​
​

© 2026 Raw Data Library. All rights reserved.
PrivacyTermsContact
  1. Raw Data Library
  2. /
  3. Publications
  4. /
  5. Transcript expression-aware annotation improves rare variant interpretation

Verified authors • Institutional access • DOI aware
50,000+ researchers120,000+ datasets90% satisfaction
Article
en
2020

Transcript expression-aware annotation improves rare variant interpretation

0 Datasets

0 Files

en
2020
Vol 581 (7809)
Vol. 581
DOI: 10.1038/s41586-020-2329-2

Get instant academic access to this publication’s datasets.

Create free accountHow it works

Frequently asked questions

Is access really free for academics and students?

Yes. After verification, you can browse and download datasets at no cost. Some premium assets may require author approval.

How is my data protected?

Files are stored on encrypted storage. Access is restricted to verified users and all downloads are logged.

Can I request additional materials?

Yes, message the author after sign-up to request supplementary files or replication code.

Advance your research today

Join 50,000+ researchers worldwide. Get instant access to peer-reviewed datasets, advanced analytics, and global collaboration tools.

Get free academic accessLearn more
✓ Immediate verification • ✓ Free institutional access • ✓ Global collaboration
Access Research Data

Join our academic network to download verified datasets and collaborate with researchers worldwide.

Get Free Access
Institutional SSO
Secure
This PDF is not available in different languages.
No localized PDFs are currently available.
Emelia Benjamin
Emelia Benjamin

Institution not specified

Verified
Beryl B. Cummings
Konrad J. Karczewski
Jack A. Kosmicki
+92 more

Abstract

The acceleration of DNA sequencing in samples from patients and population studies has resulted in extensive catalogues of human genetic variation, but the interpretation of rare genetic variants remains problematic. A notable example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Here, by manual curation of putative loss-of-function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD)1, we show that one explanation for this paradox involves alternative splicing of mRNA, which allows exons of a gene to be expressed at varying levels across different cell types. Currently, no existing annotation tool systematically incorporates information about exon expression into the interpretation of variants. We develop a transcript-level annotation metric known as the 'proportion expressed across transcripts', which quantifies isoform expression for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression (GTEx) project2 and show that it can differentiate between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder and intellectual disability or developmental disorders to show that pLoF variants in weakly expressed regions have similar effect sizes to those of synonymous variants, whereas pLoF variants in highly expressed exons are most strongly enriched among cases. Our annotation is fast, flexible and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for the genetic diagnosis of rare diseases, the analysis of rare variant burden in complex disorders, and the curation and prioritization of variants in recall-by-genotype studies.

How to cite this publication

Beryl B. Cummings, Konrad J. Karczewski, Jack A. Kosmicki, Eleanor G. Seaby, Nicholas A. Watts, Moriel Singer‐Berk, Jonathan M. Mudge, Juha Karjalainen, F. Kyle Satterstrom, Anne O’Donnell‐Luria, Timothy Poterba, Cotton Seed, Matthew Solomonson, Irina M. Armean, Eric Banks, Louis Bergelson, Kristian Cibulskis, Ryan L. Collins, Kristen M. Connolly, Miguel Covarrubias, Beryl B. Cummings, Mark J. Daly, Stacey Donnelly, Yossi Farjoun, Steven Ferriera, Laurent C. Francioli, Stacey Gabriel, Laura D. Gauthier, Jeff Gentry, Namrata Gupta, Thibault Jeandet, Diane Kaplan, Kristen M. Laricchia, Christopher Llanwarne, Eric Vallabh Minikel, Ruchi Munshi, Benjamin M. Neale, Sam Novod, Anne O’Donnell‐Luria, Nikelle Petrillo, Timothy Poterba, David Roazen, Valentín Ruano-Rubio, Andrea Saltzman, Kaitlin E. Samocha, Molly Schleicher, Cotton Seed, Matthew Solomonson, José Soto, Grace Tiao, Kathleen Tibbetts, Charlotte Tolonen, Christopher Vittal, Gordon Wade, Arcturus Wang, Qingbo S. Wang, James S. Ware, Nicholas A. Watts, Ben Weisburd, Nicola Whiffin, Carlos A. Aguilar‐Salinas, Tariq Ahmad, Christine M. Albert, Diego Ardissino, Gil Atzmon, John Barnard, Laurent Beaugerie, Emelia Benjamin, Michael Boehnke, Lori L. Bonnycastle, Erwin P. Böttinger, Donald W. Bowden, Matthew J. Bown, John C. Chambers, Juliana C.N. Chan, Daniel I. Chasman, Judy H. Cho, Mina K. Chung, Bruce M. Cohen, Adolfo Correa, Dana Dabelea, Dawood Darbar, Ravindranath Duggirala, Josée Dupuis, Patrick T. Ellinor, Roberto Elosúa, Jeanette Erdmann, Tõnu Esko, Martti Färkkilâ, José C. Florez, André Franke, Gad Getz, Benjamin Gläser, Stephen J. Glatt, David Goldstein (2020). Transcript expression-aware annotation improves rare variant interpretation. , 581(7809), DOI: https://doi.org/10.1038/s41586-020-2329-2.

Related publications

Why join Raw Data Library?

Quality

Datasets shared by verified academics with rich metadata and previews.

Control

Authors choose access levels; downloads are logged for transparency.

Free for Academia

Students and faculty get instant access after verification.

Publication Details

Type

Article

Year

2020

Authors

95

Datasets

0

Total Files

0

Language

en

DOI

https://doi.org/10.1038/s41586-020-2329-2

Join Research Community

Access datasets from 50,000+ researchers worldwide with institutional verification.

Get Free Access