Table 1 |
|||||
|
FSC of Selected Proteins |
|||||
|
length (aa) |
Number of Sequences |
Null State (Bits) |
FSC (Fits) |
FSC Density Fits/aa |
|
|
|
|||||
|
Ankyrin |
33 |
1,171 |
143 |
46 |
1.4 |
|
HTH 8 |
41 |
1,610 |
177 |
76 |
1.9 |
|
HTH 7 |
45 |
503 |
194 |
83 |
1.8 |
|
HTH 5 |
47 |
1,317 |
203 |
80 |
1.7 |
|
HTH 11 |
53 |
663 |
229 |
80 |
1.5 |
|
HTH 3 |
55 |
3,319 |
238 |
80 |
1.5 |
|
Insulin |
65 |
419 |
281 |
156 |
2.4 |
|
Ubiquitin |
65 |
2,442 |
281 |
174 |
2.7 |
|
Kringle domain |
75 |
601 |
324 |
173 |
2.3 |
|
Phage Integr N-dom |
80 |
785 |
346 |
123 |
1.5 |
|
VPR |
82 |
2,372 |
359 |
308 |
3.7 |
|
RVP |
95 |
51 |
411 |
172 |
1.8 |
|
Acyl-Coa dh N-dom |
103 |
1,684 |
445 |
174 |
1.7 |
|
MMR HSR1 |
119 |
792 |
514 |
179 |
1.5 |
|
Ribosomal S12 |
121 |
603 |
523 |
359 |
3.0 |
|
FtsH |
133 |
456 |
575 |
216 |
1.6 |
|
Ribosomal S7 |
149 |
535 |
644 |
359 |
2.4 |
|
P53 DNA domain |
157 |
156 |
679 |
525 |
3.3 |
|
Vif |
190 |
1,982 |
821 |
675 |
3.6 |
|
SRP54 |
196 |
835 |
847 |
445 |
2.3 |
|
Ribosomal S2 |
197 |
605 |
851 |
462 |
2.4 |
|
Viral helicase1 |
229 |
904 |
990 |
335 |
1.5 |
|
Beta-lactamase |
239 |
1,785 |
1,033 |
336 |
1.4 |
|
RecA |
240 |
1,553 |
1,037 |
832 |
3.5 |
|
Bac luciferase |
272 |
1,900 |
1,176 |
357 |
1.3 |
|
tRNA-synt 1b |
280 |
865 |
1,210 |
438 |
1.6 |
|
SecY |
342 |
469 |
1,478 |
688 |
2.0 |
|
EPSP Synthase |
372 |
1,001 |
1,608 |
688 |
1.9 |
|
FTHFS |
390 |
658 |
1,686 |
1,144 |
2.9 |
|
DctM |
407 |
682 |
1,759 |
724 |
1.8 |
|
Corona S2 |
445 |
836 |
1,923 |
1,285 |
2.9 |
|
Flu PB2 |
608 |
1,692 |
2,628 |
2,416 |
4.0 |
|
Usher |
724 |
316 |
3,129 |
1,296 |
1.8 |
|
Paramyx RNA Pol |
887 |
389 |
3,834 |
1,886 |
2.1 |
|
ACR Tran |
949 |
1,141 |
4,102 |
1,650 |
1.7 |
|
Random sequences |
1000 |
500 |
4,321 |
0 |
0 |
|
50-mer polyadenosine |
50 |
1 |
0 |
0 |
0 |
|
|
|||||
|
Results for 35 protein families Shown above are the 35 protein families analyzed, their sequence length (column 1), the number of sequences analyzed for each family (column 2), the Shannon uncertainty of the Null State Hø (Eqn. 4) for each protein (column 3), the FSC value ζ in Fits for each protein (column 4), and the average Fit value/site (FSC/length, column 5). For comparison, the results for a set of uniformly random amino acid sequences (RSC) are shown in the second from last row, and a highly ordered, 50-mer polyadenosine sequence (OSC) in the last row. All values, except for the OSC example, which was calculated from the constrained ground state required to produce OSC, were computed from the null state. The Fit values obtained can be discussed as the measure of the change in functional uncertainty required to specify any functional sequence that falls into the given family being analyzed. |
|||||
|
Durston et al. Theoretical Biology and Medical Modelling 2007 4:47 doi:10.1186/1742-4682-4-47 |
|||||