Nearest neighbours in least-squares data imputation algorithms with different missing patterns

Wasito, I. and Mirkin, Boris (2006) Nearest neighbours in least-squares data imputation algorithms with different missing patterns. Computational Statistics & Data Analysis 50 (4), pp. 926-949. ISSN 0167-9473.

Preview

Text
Binder1.pdf
Download (386kB) | Preview

Official URL: http://dx.doi.org/10.1016/j.csda.2004.11.009

Abstract

Methods for imputation of missing data in the so-called least-squares approximation approach, a non-parametric computationally efficient multidimensional technique, are experimentally compared. Contributions are made to each of the three components of the experiment setting: (a) algorithms to be compared, (b) data generation, and (c) patterns of missing data. Specifically, "global" methods for least-squares data imputation are reviewed and extensions to them are proposed based on the nearest neighbours (NN) approach. A conventional generator of mixtures of Gaussian distributions is theoretically analysed and, then, modified to scale clusters differently. Patterns of missing data are defined in terms of rows and columns according to three different mechanisms that are referred to as Random missings, Restricted random missings, and Merged database. It appears that NN-based versions almost always outperform their global counterparts. With the Random missings pattern, the winner is always the authors' two-stage method M, which combines global and local imputation algorithms.

Metadata

Item Type:	Article
Keyword(s) / Subject(s):	least squares, nearest neighbours, singular value decomposition, missing data, random missing, restricted random missing, merged database missing, principal component analysis, likelihood
School:	Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences
Research Centres and Institutes:	Structural Molecular Biology, Institute of (ISMB)
Depositing User:	Sandra Plummer
Date Deposited:	01 Feb 2006
Last Modified:	19 Sep 2025 18:16
URI:	https://eprints.bbk.ac.uk/id/eprint/313

Statistics

DownloadsShow export options

Activity Overview

6 month trend

791Downloads

6 month trend

710Hits

Additional statistics are available via IRStats2.

Archive Staff Only (login required)

Edit/View Item