Read Me

TargetGDrug: Predicting GPCR-Drug Interactions by Random Forest with Drug Association Matrix Based Post-processing Procedure

G-protein-coupled receptors (GPCRs) are important targets of the modern medicinal drugs. The accurate identification of the interactions between GPCRs and drugs is of significant importance for both protein function annotations and drug discovery. In this paper, a new sequence-based predictor called TargetGDrug is designed and implemented for predicting GPCR-drug interactions. In TargetGDrug, the evolutionary feature of GPCR sequence and the wavelet-based molecular fingerprints of drug are integrated to form the combined feature of a GPCR-drug pair; then, the combined feature is fed to a trained random forest (RF) classifier to perform initial prediction; finally, a novel drug association matrix based post-processing procedure is applied to reduce the potential false positive or false negative of the initial prediction. Experimental results on benchmark datasets demonstrate the efficacy of the proposed method and an improvement of 15% in terms of the Matthews correlation coefficient (MCC) was observed over independent validation tests if compared with the most recently released sequence-based GPCR-drug interactions predictor. The implemented webserver, together with the datasets used in this study, is freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetGDrug or http://www.csbio.sjtu.edu.cn/bioinf/TargetGDrug.