Skip to main content

Table 7 Amino acid patterns of five amino acids

From: 50 years of amino acid hydrophobicity scales: revisiting the capacity for peptide classification

Sequence pool

Peptide

FO in pool

FO in remaining set

FO in remaining set GBSS

dc-helix

ALLDA

0.0001311

5.50e−06

3.50e−06

AALAA

0.0001311

3.75e−05

4.50e−05

ALDAA

0.0001180

6.30e−06

5.50e−06

AAALA

0.0001180

1.31e−05

1.15e−05

dc-random

GSSGS

0.0016490

9.41e−05

7.02e−05

SGSSG

0.0015360

8.46e−05

6.15e−05

SSGSS

0.0015130

8.75e−05

6.31e−05

HHHHH

0.0005872

2.50e−05

1.33e−05

dc-sheet

VLVNA

0.0001966

4.80e−06

1.10e06

SDTVV

0.0001966

2.50e06

3.20e06

KGTVT

0.0001966

2.50e06

2.20e06

YLVNM#

0.0001311

<1.00e08

<1.00e08

dd-helix

LTEEE

8.01e−05

2.10e−06

2.10e−06

LTLEE

9.34e−05

5.20e−06

7.30e−06

ELLAD

8.01e−05

7.50e−06

9.20e−06

dd-random

GSSGS

0.0003547

0.0001750

0.0001879

SGSSG

0.0003252

0.0001603

0.0001716

SSGSS

0.0003252

0.0001618

0.0001711

GSSGL

0.0001774

1.28e−05

1.35e−05

TILPL#

0.0001182

1.20e06

2.00e07

dd-sheet

TLDGG#

0.0001182

4.50e−06

2.00e07

SVIDT

9.52e−05

1.40e06

2.10e−06

LTVTG#

9.52e−05

2.80e−06

2.00e07

no-helix

GDSGG

6.76e−05

2.20e−06

3.20e−06

no-random

VGIVT#

0.0011290

5.00e07

2.00e07

TGHSL#

0.0007524

1.20e06

1.70e06

no-sheet

SSGSS

0.0002080

0.0001691

0.0001817

SGSSG

0.0001976

0.0001683

0.0001832

all

VIGGG

4.35e−05

3.90e−06

2.70e−06

IIGGG

3.84e−05

2.50e−06

2.70e−06

LADAG

3.07e−05

2.00e−06

2.40e−06

IVGAG

3.07e−05

4.80e−06

6.00e−06

GVDVV

3.07e−05

1.20e−06

1.20e−06

random

SGPSS#

0.0005350

2.00e07

<1.00e08

GSSGS#

0.0007301

0.0001515

7.00e07

SGSSG#

0.0006744

0.0001385

7.00e07

SSGSS

0.0006744

0.0001399

8.00e06

HHHHH

0.0002508

4.60e−05

7.00e07

s-helix

EELKK#

5.49e−05

<1.00e08

<1.00e08

s-sheet

CGGSL#

0.0001124

6.60e−06

<1.00e08

GIVSW

8.03e−05

4.80e−06

1.30e06

YGGVT#

6.42e−05

1.01e−05

<1.00e08

krtm-helix

LLVGI

0.0004832

4.60e06

2.60e06

LAAVA

0.0004832

7.20e06

3.10e06

FLAVL

0.0004832

3.00e06

1.20e06

YVFFG#

0.0003221

7.00e07

<1.00e08

YPIVW

0.0003221

5.40e06

6.00e06

tm-helix

LILLL

9.97e−05

1.00e06

7.00e07

LLLLV

8.92e−05

3.10e−06

4.70e−06

krtm-sheet

TGTLE

1.05e−05

5.40e−06

6.30e−06

tm-sheet

PTLDL#

0.0001878

2.77e−05

<1.00e08

LYGKV#

0.0001610

<1.00e08

<1.00e08

SASAG#

0.0001342

1.40e06

<1.00e08

RQFNV

0.0001342

2.20e06

1.40e06

  1. Given are the sequence pool name (column 1) and sequence with the highest frequency of occurrence (column 2); the frequency of occurrence (FO) of this peptide in the according pool (column 3), the FO in the pool containing all sequences except the one of the analyzed pool (column 4), the FO in the pool containing all sequences generated by the same strategy (GBSS) excluding sequences of the analyzed pool (column 5). Italic shows peptides with at least 50-fold higher frequency with respect to all (column 4) or peptides of the same strategy (column 5). Hashtag after the pattern indicate 500-fold higher frequency in at least column 4 or column 5