FASTA Format

The FASTA format of each query GPCR-drug pair consists of three parts, i.e., GPCR name, drug ID, and GPCR sequence. The first line starts with '>' and is followed by a GPCR name and a drug ID that are separated by one or more blanks; GPCR sequence is listed below the first line.

The following is an example:
>hsa:10800 D00411
MDETGNLTVSSATCHDTIDDFRNQVYSTLYSMISVVGFFGNGFVLYVLIKTYHKKSAFQV YMINLAVADLLCVCTLPLRVVYYVHKGIWLFGDFLCRLSTYALYVNLYCSIFFMTAMSFF RCIAIVFPVQNINLVTQKKARFVCVGIWIFVILTSSPFLMAKPQKDEKNNTKCFEPPQDN QTKNHVLVLHYVSLFVGFIIPFVIIIVCYTMIILTLLKKSMKKNLSSHKKAIGMIMVVTA AFLVSFMPYHIQRTIHLHFLHNETKPCDSVLRMQKSVVITLSLAASNCCFDPLLYFFSGG NFRKRLSTFRKHSLSSVTYVPRKKASLPEKGEEICKV

where has:10800 is a GPCR name, D00411 is a drug ID, and the remaining is the GPCR sequence.

Note that the drug ID can be found from the KEGG database available at http://www.kegg.jp/kegg/.