Abstract
G-protein coupled receptor (GPCR) is a protein family that is found only in the Eukaryotes. They are used for the interfacing of cell to the outside world and are involved in many physiological processes. Their role in drug development is evident. Hence, the prediction of GPCRs is very much demanding. Because of the unavailability of 3D structures of most of the GPCRs, the statistical and machine learning based prediction of GPCRs is much demanding. GPCRs are classified into family, sub family and sub-sub family levels in the proposed approach. We have extracted features using the hybrid combination of Pseudo amino acid, Fast Fourier Transform and Split amino acid techniques. The overall feature vector is then reduced using Principle component analysis. Mostly, GPCRs are composed of two or more sub units. The arrangement and number of sub units forming a GPCR are referred to as quaternary structure. The functions of GPCRs are closely related to their quaternary structure. The classification in the present research is performed using grey incidence degree (GID) measure, which can efficiently analyze the numerical relation between various components of GPCRs. The GID measure based classification has shown remarkable improvement in predicting GPCRs.
Keywords: Bayes network method, Hidden Markov models, Chemokine, signal peptides, BLAST, &, FASTA, Frizzled, Smoothened receptors family, PseAAC, Power spectral density, Mathew's Selective Top Down method, Hybrid combination, quaternary structural classes, GPCRs prediction, split amino acid, fast fourier transforms, pseudo amino acid composition, nearest neighbor classifier, principle component analysis, grey incidence degree measure, G-protein coupled receptor, amino acids, RhodopsinBayes network method, Hidden Markov models, Chemokine, signal peptides, BLAST, &, FASTA, Frizzled, Smoothened receptors family, PseAAC, Power spectral density, Mathew's Selective Top Down method, Hybrid combination, quaternary structural classes, GPCRs prediction, split amino acid, fast fourier transforms, pseudo amino acid composition, nearest neighbor classifier, principle component analysis, grey incidence degree measure, G-protein coupled receptor, amino acids, Rhodopsin