|
|
 |
The annotation of sequences is performed according to the "best" PSI-BLAST score, i.e. for each sequence from the test set
we select the annotated sequence from the training set that has maximum PSI-BLAST score (i.e., highest similarity). After this we assign all FunCat categories of the training set
sequence to the sequence from the test set. The confidence value is 1/SCORE, where SCORE is the "best" PSI-BLAST score.
Thus this annotation corresponds to the use of k-nearest neighbor classifier with K=1. |
|
Performance of the method
genome |
total genes |
coverage of manually annotated genes |
annotations of new genes |
sensitivity |
specificity |
|
Helicobacter_pylori |
1576 |
82 (713 out of 870) |
261 |
71.9 |
50.4 |
Bacillus_subtilis |
4112 |
79.1 (2233 out of 2823) |
311 |
75.7 |
78.5 |
Listeria_monocytogenes |
2846 |
89.4 (1741 out of 1948) |
234 |
86.3 |
83.1 |
Listeria_innocua |
2968 |
89.1 (1740 out of 1953) |
262 |
86.2 |
82.1 |
| |
 |