Coll, F. and Mallard, K. and Preston, M. and Bentley, S. and Parkhill, J. and McNerney, R. and Martin, Nigel and Clark, T. (2012) SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences. Bioinformatics 28 (22), pp. 2991-2993. ISSN 1367-4803.
|
Text
SpolPred_1209.pdf - Published Version of Record Available under License Creative Commons Attribution. Download (146kB) | Preview |
Abstract
Summary: Spoligotyping is a well-established genotyping technique based on the presence of unique DNA sequences in Mycobacterium tuberculosis (Mtb), the causal agent of tuberculosis disease (TB). Although advances in sequencing technologies are leading to whole-genome bacterial characterization, tens of thousands of isolates have been spoligotyped, giving a global view of Mtb strain diversity. To bridge the gap, we have developed SpolPred, a software to predict the spoligotype from raw sequence reads. Our approach is compared with experimentally and de novo assembly determined strain types in a set of 44 Mtb isolates. In silico and experimental results are identical for almost all isolates (39/44). However, SpolPred detected five experimentally false spoligotypes and was more accurate and faster than the assembling strategy. Application of SpolPred to an additional seven isolates with no laboratory data led to types that clustered with identical experimental types in a phylogenetic analysis using single-nucleotide polymorphisms. Our results demonstrate the usefulness of the tool and its role in revealing experimental limitations.
Metadata
Item Type: | Article |
---|---|
School: | Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences |
Research Centres and Institutes: | Structural Molecular Biology, Institute of (ISMB), Birkbeck Knowledge Lab |
Depositing User: | Nigel Martin |
Date Deposited: | 26 Feb 2014 12:28 |
Last Modified: | 09 Aug 2023 12:34 |
URI: | https://eprints.bbk.ac.uk/id/eprint/9247 |
Statistics
Additional statistics are available via IRStats2.