Skip to main content

Table 6 Patterns of four amino acids

From: 50 years of amino acid hydrophobicity scales: revisiting the capacity for peptide classification

Sequence pool Peptide FO in pool Average FO in remaining set Average FO in remaining set GBSS
dc-helix ALAA 0.0005088 0.0001336 0.0001327
AALA 0.0004970 0.0001545 0.0001559
ALLE 0.0003787 6.55e−05 6.82e−05
dc-random GSSG 0.0028090 0.0002055 0.0001760
SSGS 0.0015990 0.0001405 0.0001205
SGSS 0.0014350 0.0001022 7.90e−05
HHHH 0.0007792 4.17e−05 2.41e−05
EEEE 0.0003691 2.28e−05 1.88e−05
dc-sheet VLLV 0.0003481 0.0001082 0.0001198
dd-helix EELL* 0.0003622 5.37e−05 5.67e−05
LEEL* 0.0003372 6.47e−05 6.58e−05
dd-random GSSG 0.0006914 0.0003379 0.0003685
SSGS 0.0003872 0.0002163 0.0002307
SGSS 0.0003042 0.0001729 0.0001818
SSGL 0.0002489 4.35e−05 4.62e−05
dd-sheet GEVV* 0.0002647 4.57e−05 4.93e−05
PDGT* 0.0002427 1.19e−05 1.53e−05
DGSV* 0.0002427 2.88e−05 3.05e−05
no-helix SGSS 0.0001950 0.0001797 0.0001918
PDGS 0.0001509 2.59e−05 3.24e−05
no-random VVGI 0.0017010 4.14e−05 3.84e−05
QELD 0.0010210 1.13e05 1.45e05
no-sheet LEAL 0.0003080 8.56e−05 9.38e−05
random GSSG* 0.0012510 0.0003029 2.53e−05
GPSS 0.0007403 0.0001942 4.57e−05
SGPS 0.0006586 0.0001508 2.70e−05
SSGS 0.0005360 1.43e−05 6.00e07
SGSS 0.0005156 1.14e−05 3.60e06
s-sheet VKVI 0.0001704 2.9e−06 1.16e−05
krtm-helix LGLL 0.0012460 6.78e−05 5.58e−05
VLLV 0.0009341 7.16e−05 6.66e−05
GIAL 0.0009341 5.29e−05 4.11e−05
tm-helix LLLL 0.0004439 7.87e−05 4.07e−05
LILL 0.0004040 3.21e−05 1.25e−05
LLLV 0.0003990 4.38e−05 2.15e−05
ILLL 0.0003891 2.98e−05 1.73e−05
krtm-sheet SIGA 0.0012020 2.92e−05 1.69e05
tm-sheet SGPL 0.0004559 1.88e−05 1.20e−05
SLNL 0.0004079 1.96e−05 2.15e−05
LYGG 0.0004079 1.06e−05 9.80e−06
  1. Given are the sequence pool name (column 1) and peptide sequence with the highest frequency of occurrence (column 2); the frequency of occurrence (FO) of this peptide in the according pool (column 3), the frequency of occurrence of this peptide in the pool containing all sequences except the one of the analyzed pool (column 4), the frequency of occurrence of this peptide in the pool containing all sequences generated by the same strategy (GBSS) as the analyzed pool excluding the sequences of the analyzed pool (column 5). Italic shows peptides that have an at least 50-fold higher frequency, with respect to the remaining sets (column 4) or the remaining peptides generated by the same strategy (column 5). Peptide sequences with p values below 0.05 were marked by an asterisk