Publications and Research

DeepREAL: a deep learning powered multi-scalemodeling framework for predicting out-of-distributionligand-induced GPCR activity

Tian Cai, CUNY Graduate CenterFollow
Kyra Alyssa Abbu, CUNY Hunter College
Yang Liu, CUNY Hunter College
Lei Xie, CUNY Hunter CollegeFollow

Document Type

Article

Publication Date

3-11-2022

Abstract

Motivation Drug discovery has witnessed intensive exploration of predictive modeling of drug–target physical interactions over two decades. However, a critical knowledge gap needs to be filled for correlating drug–target interactions with clinical outcomes: predicting genome-wide receptor activities or function selectivity, especially agonist versus antagonist, induced by novel chemicals. Two major obstacles compound the difficulty on this task: known data of receptor activity is far too scarce to train a robust model in light of genome-scale applications, and real-world applications need to deploy a model on data from various shifted distributions.

Results To address these challenges, we have developed an end-to-end deep learning framework, DeepREAL, for multi-scale modeling of genome-wide ligand-induced receptor activities. DeepREAL utilizes self-supervised learning on tens of millions of protein sequences and pre-trained binary interaction classification to solve the data distribution shift and data scarcity problems. Extensive benchmark studies on G-protein coupled receptors (GPCRs), which simulate real-world scenarios, demonstrate that DeepREAL achieves state-of-the-art performances in out-of-distribution settings. DeepREAL can be extended to other gene families beyond GPCRs.

Availability and implementation All data used are downloaded from Pfam (Mistry et al., 2020), GLASS (Chan et al., 2015) and IUPHAR/BPS and the data from reference (Sakamuru et al., 2021). Readers are directed to their official website for original data. Code is available on GitHub https://github.com/XieResearchGroup/DeepREAL.

Supplementary information Supplementary data are available at Bioinformatics online.

Comments

Article was originally published in Bioinformatics, available at https://doi.org/10.1093/bioinformatics/btac154.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Included in

Bioinformatics Commons

COinS

CUNY Academic Works

Publications and Research

DeepREAL: a deep learning powered multi-scalemodeling framework for predicting out-of-distributionligand-induced GPCR activity

Document Type

Publication Date

Abstract

Comments

Included in

Browse

Search

Author Corner

Links

CUNY Academic Works

Publications and Research

DeepREAL: a deep learning powered multi-scalemodeling framework for predicting out-of-distributionligand-induced GPCR activity

Authors

Document Type

Publication Date

Abstract

Comments

Included in

Share

Browse

Search

Author Corner

Links