TOPDB Topology Data Bank of Transmembrane Proteins
Topology, Structure and Prediction.
Database revision: (4190 entries, 75211 topology data) Quick search:    

Database status

Database revision:
Revision date: 01/01/70
Entries: 4190
Topology data: 75211
Alpha helical proteins: 4067
Beta barrel proteins: 123
PubMed links: 4270
PDB links: 12816
UniProt links: 4190
Visitors: 601721

Documents related to TOPDB

Measuring the reliability of the prediction

The source code of the HMMTOP program has been modified in order to calculate the sum of the posterior probabilities along the Viterbi path. According to the unique hidden structure of the HMMTOP, the posterior probabilities were summed up for each main hidden state type (inside,membrane, loop and outside) in each position of the amino acid sequence, then these probabilities were summed up along the most probable state sequence provided by the Viterbi algorithm. We use this sum divided by the length of the protein to measure the reliability.

Assuming the notations of Rabiner's excellent tutorial on hidden Markov models, the posterior probabilities can be calculated from the forward and backward variables:

\(\gamma_{t} (i) = \cfrac{\alpha_{t} (i) \cdot \beta_{t} (i)}{P(O|\lambda)} \ \ 1\leq t\leq n, \ 1\leq i\leq N\)

where n is the sequence length, N is the number of states in the hidden Markov model, O is the array of observation symbols (the amino acid sequence) and λ is the hidden Markov model. The posterior probability of each main state can be calculated by summing up the posterior probabilities, which have the same label as the main state:

\(\Gamma_{t} (j) = \sum\limits_{k=1}^{N} \gamma_{t} (k|label (S_{k}) = j), \ \ 1\leq j\leq \hat{N}\)

Reliability is the average of the posterior probabilities along the most probably state path (q):

\(R=100 \cdot \cfrac{ \sum\limits_{t=1}^{N} \Gamma_{t}(label(q_{t})) }{n}\)

.: © 2007-2014 Institute of Enzymology :. .: Disclaimer :. .: Privacy :.