Hirsch, M. and Tucker, A. and Swift, S. and Martin, Nigel and Orengo, C. and Kellam, P. and Liu, X. (2006) Improved robustness in time series analysis of gene expression data by polynomial model based clustering. In: Berthold, M.R. and Glen, R.C. and Fischer, I. (eds.) Computational Life Sciences II. Lecture Notes in Computer Science 4216. Berlin, Germany: Springer, pp. 1-10. ISBN 9783540457671.
|
Text
CompLifeSci_0609.pdf - Author's Accepted Manuscript Download (384kB) | Preview |
Abstract
Microarray experiments produce large data sets that often contain noise and considerable missing data. Typical clustering methods such as hierarchical clustering or partitional algorithms can often be adversely affected by such data. This paper introduces a method to overcome such problems associated with noise and missing data by modelling the time series data with polynomials and using these models to cluster the data. Similarity measures for polynomials are given that comply with commonly used standard measures. The polynomial model based clustering is compared with standard clustering methods under different conditions and applied to a real gene expression data set. It shows significantly better results as noise and missing data are increased.
Metadata
Item Type: | Book Section |
---|---|
Additional Information: | Second International Symposium, CompLife 2006, Cambridge, UK, September 27-29, 2006. Proceedings. - The final publication is available at link.springer.com |
School: | Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences |
Research Centres and Institutes: | Structural Molecular Biology, Institute of (ISMB), Birkbeck Knowledge Lab |
Depositing User: | Nigel Martin |
Date Deposited: | 26 Feb 2014 12:06 |
Last Modified: | 09 Aug 2023 12:34 |
URI: | https://eprints.bbk.ac.uk/id/eprint/9250 |
Statistics
Additional statistics are available via IRStats2.