COLOMBO
COLOMBO is a software framework equipped with a GUI for the
statistical analysis of sequences of a genome. It can be equipped with
different plugins that actually perform the analysis. The current version
of COLOMBO is supplied with SIGI-HMM, a tool for the prediction of Genomic
Islands.
SIGI-HMM: Prediction of Genomic Islands in Procaryotic
Genomes Using HMMs
The process of acquiring genes from foreign species, with or without a
closer taxonomical relation to the host,
is called horizontal gene transfer. This phenomenon is frequent among
microbial species and is considered one major means of rapid adaptation to
changing environmental demands. Researchers are seeking for algorithms that
detect anomalies characterising, for example, horizontally transferred genes,
often found in larger contiguous regions called genomic islands.
We focus on the case of genes from a taxonomically distant source.
Our approach to the problem is the use of a Hidden Markov Model (HMM)
that incorporates data gained from comparative genomes considered as putative
sources into a model describing the host genome, thus detecting possible
genomic islands. It uses synonymous Codon Usage as the statistical feature,
characterising the origin of a gene, that is incorporated into the model's
observation probabilities, and serving as a similarity
measure. In a preliminary step, Codon Usage data from a large number of
species has been merged into clusters, each belonging to one taxonomic group
that serves as a putative source. In one run of the algorithm, each gene of
a species under consideration is classified according to the prepared
data.
Our program SIGI-HMM is a publicly available and ready-to-use tool implementing
this model, showing good prediction capabilities in the sense that it was able
to reproduce experimental results for a number of well-studied genomes. We
regard the usability of SIGI-HMM as its main advantage compared to other
approaches that require a large amount of preprocessing done by the user. In
contrast to that, SIGI-HMM works fully automated and can be adjusted with a
small number of parameters.