During the last decade, novel technological developments in high-throughput DNA sequencing have brought in new information about benign and disease-associated genetic mutations. One type of those are missense variants, which cause an amino acid substitution upon a single nucleotide change in a protein-coding region of the genome [1]. Over the past years, different studies have looked at the functional impact of missense variants based on conservation through evolution, sequence or structural information, however, there is still very limited understanding about their outcome and, in particular, how disease-associated missense variants impair their corresponding protein and trigger their associated phenotype [2,3].
Studying the structural and functional effects of missense variants is still a challenge, but thanks to the development of structural biology methods —with more than 100,000 new structures in the last 10 years and growing as a direct product of the evolution of new techniques— it represents a rising research area.
We assess the disease-causing missense variants by analysing the changes that they cause on the 3D protein structure, thanks to their atomic level insight. In this regard, we have developed a novel tool that allows us to characterize and annotate missense variants considering all public structural information. Our tool searches for available structural data and builds a molecular model of the mutated structure and, then, calculates multiple features such as increase or decrease of interactions, changes in solvent accessible surface area or proximity to a protein-protein interface, among others. We have validated the pipeline with data from the literature and applied it on larger datasets from HGMD and ClinVar, where we have identified some examples of, to date, unknown variant effect that might be explained with structural changes.
Studying the impact of disease-causing variants and identifying new examples may give us not only a better understanding of the underlying molecular mechanisms of different disorders, but also higher chances of finding a better treatment for current and future patients as well as helping to overcome drug resistance, selectivity or toxicity in a plethora of diseases, such as cancer, diabetes or other metabolic disorders [4,5].