Dipeptide pseudo amino acid composition
The option of dipeptide composition will generate 420 components for each protein sequence, the first 20 components are the conventional amino acid composition(AAC); the following 400 components are the fractions of 400 dipeptides, i.e. AA, AC, AD, , YV, YW, YY; the 400 components are calculated using the following equation,
 
where dep(i) is the ith dipeptide of the 400 dipeptides, i=1,2,,400.
 
The format of the output for the 420 components are:
1st line: 20 components of amino acid composition (AAC);
2nd line: 20 components of dipeptide composition beginning with amino acid A, i.e. AA, AC, AD, , AY;
3rd line: 20 components of dipeptide composition beginning with amino acid C, i.e. CA, CC, CD, , CY;
21st line: 20 components of dipeptide composition beginning with amino acid Y, i.e. YA, YC, YD, , YY.
 
NOTE: in dipeptide pseudo amino acid composition, the other three parameters are NOT needed any more, i.e. amino acid attributes, normalization weight factor and lamda .