There are two benchmark dataset are contained in the ATP168 and the NUC5 directories, respectively.

ATP168 directory only includes the training data, ATPseq.fasta is the sequence information, and ATPlabel.fasta is the label information.

NUC5 directory has two subfolders the training and the validation directories which contain the training dataset and validation dataset for each type of nucleotides.
	Taking ATP as an example, its training dataset are two fasta format files named as follows:
							Training\ATPseq.fasta
							Training\ATPlabel.fasta
	and its validation dataset are also two fasta format files named as follows:
							Validation\ATPseq.fasta
							Validation\ATPlabel.fasta	