4 years ago

Aggregation of population-based genetic variation over protein domain homologues and its potential use in genetic diagnostics

Aggregation of population-based genetic variation over protein domain homologues and its potential use in genetic diagnostics
Hanka Venselaar, Gert Vriend, Christian Gilissen, Joris A. Veltman, Laurens Wiel
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population-based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease-causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta-domains.” These meta-domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta-domains high-frequency population variation re-occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta-domains can improve the interpretation of genetic variation. We developed a framework to map population variation and known pathogenic mutations onto 2,750 “meta-domains.” These meta-domains consist of 30,853 within-human protein domain homologues. We find that population variation re-occurs at the same positions across domain homologues more often than expected. Additionally, we observe that the presence of pathogenic variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. These meta-domains aid in interpreting genetic variants in protein domains.

Publisher URL: http://onlinelibrary.wiley.com/resolve/doi

DOI: 10.1002/humu.23313

You might also like
Discover & Discuss Important Research

Keeping up-to-date with research can feel impossible, with papers being published faster than you'll ever be able to read them. That's where Researcher comes in: we're simplifying discovery and making important discussions happen. With over 19,000 sources, including peer-reviewed journals, preprints, blogs, universities, podcasts and Live events across 10 research areas, you'll never miss what's important to you. It's like social media, but better. Oh, and we should mention - it's free.

  • Download from Google Play
  • Download from App Store
  • Download from AppInChina

Researcher displays publicly available abstracts and doesn’t host any full article content. If the content is open access, we will direct clicks from the abstracts to the publisher website and display the PDF copy on our platform. Clicks to view the full text will be directed to the publisher website, where only users with subscriptions or access through their institution are able to view the full article.