Benchmark Data
Online Supplementary Materials A. The learning dataset contains 406 plant proteins classified into 11 subcellular locations according to the experimental annotations. Both the accession numbers and sequences are given. None of the proteins has >25% sequence identity to any other in the same subset (subcellular location).
Click learning dataset.pdf to download the dataset.
Online Supplementary Materials B. The testing dataset contains 265 plant proteins classified into 11 subcellular locations according to the experimental annotations. Both the accession numbers and sequences are given. None of the proteins has >25% sequence identity to any others in the same subcellular location of either the dataset here or the training dataset in the Online Supplementary Materials A.
Click testing dataset.pdf to download the dataset.