Benchmark Data
Online Supplementary Materials A. The learning dataset contains 220 Gram-positive bacterial proteins classified into 5 subcellular locations according to the experimental annotations. Both the accession numbers and sequences are given. None of the proteins has more than 25% sequence identity to any other in the same subset (subcellular location). See the reference given on the top page of the web-server for further explanation. Click Supp-A to download the dataset.

Online Supplementary Materials B. The testing dataset contains 232 Gram-positive bacterial proteins classified into 5 subcellular locations according to the experimental annotations. Both the accession numbers and sequences are given. None of the proteins has more than 25% sequence identity to any others in the same subcellular location of either the dataset here or the learning dataset in the Online Supplementary Materials A. See the text of the paper for further explanation. Please download this dataset by click Supp-B.