Type 1 PseAA composition is also called the parallelcorrelation type and generates 20 + discrete numbers to represent a protein, which was introduced by Prof. KuoChen Chou in 2001 and the original publication is:
Chou K.C.(2001). Prediction of protein cellular attributes using pseudoaminoacidcomposition. PROTEINS: Structure, Function, and Genetics 43, 246255.
The essence of pseudoamino acid composition is, on one hand, to include the main feature of amino acid composition, but on the other, to include information beyond amino acid composition. The conventional amino acid composition contains 20 components, or discrete numbers, each reflecting the occurrence frequency of one of the 20 native amino acids in a protein. For the pseudoamino acid composition, however, there are some other elements in addition to the 20 components. It is through these additional discrete numbers that the sequence order effect of a protein is approximately reflected and improvements are made, as will be shown below. The basic ideas of Type 1 pseudo amino acid composition is as following:
Consider a protein chain of L amino acid residues:
 (1) 
Sequence order effect can be approximately reflected with a set of sequence ordercorrelated factors as defined below:
 (2) 
where _{1} is called the firsttier correlation factor that reflects the sequence order correlation between all the most contiguous residues along a protein chain (Fig. 1a), _{2} the secondtier correlation factor that reflects the sequence order correlation between all the second most contiguous residues (Fig.1b), _{3} the thirdtier correlation factor that reflects the sequence order correlation between all the 3rd most contiguous residues (Fig.1c), and so forth. In Eq. 2 the correlation function is given by
 (3) 
where H_{1}(R_{i}), H_{2}(R_{i}), and M(R_{i}) are, respectively, the hydrophobicity value, hydrophilicity value, and sidechain mass of the amino acid R_{i}; and H_{1}(R_{j}), H_{2}(R_{j}), and M(R_{j}) the corresponding values for the amino acid R_{j}. Note that before substituting the values of hydophobicity, hydrphilicity, and sidechain mass into Eq. 3, they were all subjected to a standard conversion as described by the following equation:
 (4) 
where H(i) is the original hydrophobicity value of the ith amino acid, H(i) the corresponding original hydrophilicity value, and M^{0}(i) the mass of the ith amino acid side chain that can be obtained from any biochemistry text book. Without loss of generality, we use the numerical indices 1, 2, 3,, 20 to represent the 20 native amino acids according to the alphabetical order of their singleletter codes: A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y. The data obtained by such a standard conversion (Eq. 4) will have a zero mean value and will remain unchanged if going through the same conversion procedure again.

As we can see from Figure 1, the sequence order effect of a protein can be, to some extent, reflected through a set of sequencecorrelation factors _{1}, _{2}, _{3},, _{ }, as defined by Eq. 2. Now let us augment the formulation of amino acid composition to include such a set of discrete numbers. To realize this, instead of using a 20D (dimensional) vector defined by 20 components, we use a (20 + )D vector defined by 20 + discrete numbers to represent a protein X; i.e.,
 (5) 
where
 (6) 
where f_{i} is the normalized occurrence frequency of the 20 amino acids in the protein X, _{j} is the jtier sequence correlation factor computed according to Eqs. 24 for the protein X, and w is the weight factor for the sequence order effect. As we can see from Eqs. 5 and 6, the first 20 components reflect the effect of the amino acid composition, whereas the components from 20 + 1 to 20 + reflect the effect of sequence order. A set of such 20 + components as formulated by Eqs. 5 and 6 is called the pseudoamino acid composition for protein X.
