
Predrag Radivojac
· Core FacultyVerifiedNortheastern University · Artificial Intelligence
Active 2001–2024
Research topics
- Computer Science
- Biology
- Genetics
- Computational biology
- Artificial Intelligence
- Bioinformatics
- Medicine
- Pathology
- Data science
- Psychology
- Combinatorics
- Theoretical computer science
- Mathematics
Selected publications
The American Journal of Human Genetics · 2022 · 504 citations
- Computer Science
- Computer Science
- Medicine
Recommendations from the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) for interpreting sequence variants specify the use of computational predictors as "supporting" level of evidence for pathogenicity or benignity using criteria PP3 and BP4, respectively. However, score intervals defined by tool developers, and ACMG/AMP recommendations that require the consensus of multiple predictors, lack quantitative support. Previously, we described a probabilistic framework that quantified the strengths of evidence (supporting, moderate, strong, very strong) within ACMG/AMP recommendations. We have extended this framework to computational predictors and introduce a new standard that converts a tool's scores to PP3 and BP4 evidence strengths. Our approach is based on estimating the local positive predictive value and can calibrate any computational tool or other continuous-scale evidence on any variant type. We estimate thresholds (score intervals) corresponding to each strength of evidence for pathogenicity and benignity for thirteen missense variant interpretation tools, using carefully assembled independent data sets. Most tools achieved supporting evidence level for both pathogenic and benign classification using newly established thresholds. Multiple tools reached score thresholds justifying moderate and several reached strong evidence levels. One tool reached very strong evidence level for benign classification on some variants. Based on these findings, we provide recommendations for evidence-based revisions of the PP3 and BP4 ACMG/AMP criteria using individual tools and future assessment of computational methods for clinical interpretation.
A roadmap for the functional annotation of protein families: a community perspective
Database · 2022 · 61 citations
- Computer Science
- Data science
- Artificial Intelligence
Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction
Bioinformatics · 2020 · 71 citations
Senior authorCorresponding- Computer Science
- Biology
- Computational biology
MOTIVATION: The computational prediction of gene function is a key step in making full use of newly sequenced genomes. Function is generally predicted by transferring annotations from homologous genes or proteins for which experimental evidence exists. The 'ortholog conjecture' proposes that orthologous genes should be preferred when making such predictions, as they evolve functions more slowly than paralogous genes. Previous research has provided little support for the ortholog conjecture, though the incomplete nature of the data cast doubt on the conclusions. RESULTS: We use experimental annotations from over 40 000 proteins, drawn from over 80 000 publications, to revisit the ortholog conjecture in two pairs of species: (i) Homo sapiens and Mus musculus and (ii) Saccharomyces cerevisiae and Schizosaccharomyces pombe. By making a distinction between questions about the evolution of function versus questions about the prediction of function, we find strong evidence against the ortholog conjecture in the context of function prediction, though questions about the evolution of function remain difficult to address. In both pairs of species, we quantify the amount of information that would be ignored if paralogs are discarded, as well as the resulting loss in prediction accuracy. Taken as a whole, our results support the view that the types of homologs used for function transfer are largely irrelevant to the task of function prediction. Maximizing the amount of data used for this task, regardless of whether it comes from orthologs or paralogs, is most likely to lead to higher prediction accuracy. AVAILABILITY AND IMPLEMENTATION: https://github.com/predragradivojac/oc. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Inferring the molecular and phenotypic impact of amino acid variants with MutPred2
Nature Communications · 2020 · 840 citations
Senior authorCorresponding- Computational biology
- Biology
- Genetics
Identifying pathogenic variants and underlying functional alterations is challenging. To this end, we introduce MutPred2, a tool that improves the prioritization of pathogenic amino acid substitutions over existing methods, generates molecular mechanisms potentially causative of disease, and returns interpretable pathogenicity score distributions on individual genomes. Whilst its prioritization performance is state-of-the-art, a distinguishing feature of MutPred2 is the probabilistic modeling of variant impact on specific aspects of protein structure and function that can serve to guide experimental studies of phenotype-altering variants. We demonstrate the utility of MutPred2 in the identification of the structural and functional mutational signatures relevant to Mendelian disorders and the prioritization of de novo mutations associated with complex neurodevelopmental disorders. We then experimentally validate the functional impact of several variants identified in patients with such disorders. We argue that mechanism-driven studies of human inherited disease have the potential to significantly accelerate the discovery of clinically actionable variants.
Recent grants
NSF · $282k · 2015–2019
Computational approaches to protein identification and quantification using MS/MS
NIH · $449k · 2007–2016
NIH · $20k · 2012
CAREER: Bioinformatics of Protein Post-Translational Modifications
NSF · $608k · 2007–2013
A computational framework for predicting the impact of mutations in autism
NIH · $1.4M · 2014–2018
Frequent coauthors
- 139 shared
Sean D. Mooney
National Institutes of Health
- 51 shared
Iddo Friedberg
Iowa State University
- 50 shared
A. Keith Dunker
Indiana University School of Medicine
- 49 shared
Vikas Pejaver
Icahn School of Medicine at Mount Sinai
- 49 shared
Vladimir N. Uversky
- 45 shared
Maricel G. Kann
University of Maryland, Baltimore County
- 41 shared
D.N. Cooper
Cardiff University
- 40 shared
Matthew Mort
Cardiff University
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Predrag Radivojac
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup