Discovery and Basic Research
Luchen Zhang, MS (he/him/his)
Graduate Student
University of Michigan
Ann Arbor, Michigan, United States
Albert Cao
Student
University of Michigan
Ann Arbor, Michigan, United States
Yingzi Bu, Ph.D.
Post-doc Fellow
University of Michigan
Ann Arbor, Michigan, United States
Duxin Sun, Ph.D.
Professor
University of Michigan
Ann Arbor, Michigan, United States
Analysis of the dataset of 759 FDA-approved drugs. (a) The Tanimoto similarity index indicates that the chemical compounds exhibit significant diversity, covering a broad range of chemical space. This demonstrates the structural variability within the dataset. (b) The representative t-Distributed Stochastic Neighbor Embedding (t-SNE) analysis result reveals that conventional dimensionality reduction methods fail to distinguish drugs with observed AEs from those without AEs based solely on chemical structure, underscoring the necessity for advanced modeling techniques like machine learning for AE prediction. Labels in (b) denote frequency of the specified AE: 1 – frequent, 0 – infrequent, -100 – missing values.
Analysis of predicted on-/off-targets of the six FDA-approved SMKI drugs. Heatmaps of predicted pIC50 values of the SMKIs’ (a) primary targets (IC50 < 30 nM) and (b) primary/unintended targets (IC50 < 250 nM). (c) Summary of the number of predicted on-/off-targets for the six SMKIs.
Performance for the AEs prediction