| FASTA Format |
|
The FASTA format of each query GPCR-drug pair consists of three parts, i.e., GPCR name, drug ID, and GPCR sequence. The first line starts with '>' and is followed by a GPCR name and a drug ID that are separated by one or more blanks; GPCR sequence is listed below the first line.
The following is an example: >hsa:10800 D00411 MDETGNLTVSSATCHDTIDDFRNQVYSTLYSMISVVGFFGNGFVLYVLIKTYHKKSAFQV YMINLAVADLLCVCTLPLRVVYYVHKGIWLFGDFLCRLSTYALYVNLYCSIFFMTAMSFF RCIAIVFPVQNINLVTQKKARFVCVGIWIFVILTSSPFLMAKPQKDEKNNTKCFEPPQDN QTKNHVLVLHYVSLFVGFIIPFVIIIVCYTMIILTLLKKSMKKNLSSHKKAIGMIMVVTA AFLVSFMPYHIQRTIHLHFLHNETKPCDSVLRMQKSVVITLSLAASNCCFDPLLYFFSGG NFRKRLSTFRKHSLSSVTYVPRKKASLPEKGEEICKV
where has:10800 is a GPCR name, D00411 is a drug ID, and the remaining is the GPCR sequence.
Note that the drug ID can be found from the KEGG database available at http://www.kegg.jp/kegg/. |