BIROn - Birkbeck Institutional Research Online

    A-Wardpβ: effective hierarchical clustering using the Minkowski metric and a fast k-means initialisation

    Cordeiro de Amorim, Renato and Makarenkov, V. and Mirkin, Boris (2016) A-Wardpβ: effective hierarchical clustering using the Minkowski metric and a fast k-means initialisation. Information Sciences 370-1 , pp. 343-354. ISSN 0020-0255.

    [img]
    Preview
    Text
    15982.pdf - Author's Accepted Manuscript
    Available under License Creative Commons Attribution Non-commercial No Derivatives.

    Download (1MB) | Preview

    Abstract

    In this paper we make two novel contributions to hierarchical clustering. First, we introduce an anomalous pattern initialisation method for hierarchical clustering algorithms, called A-Ward, capable of substantially reducing the time they take to converge. This method generates an initial partition with a sufficiently large number of clusters. This allows the cluster merging process to start from this partition rather than from a trivial partition composed solely of singletons. Our second contribution is an extension of the Ward and Wardp algorithms to the situation where the feature weight exponent can differ from the exponent of the Minkowski distance. This new method, called A-Wardpβ, is able to generate a much wider variety of clustering solutions. We also demonstrate that its parameters can be estimated reasonably well by using a cluster validity index. We perform numerous experiments using data sets with two types of noise, insertion of noise features and blurring within-cluster values of some features. These experiments allow us to conclude: (i) our anomalous pattern initialisation method does indeed reduce the time a hierarchical clustering algorithm takes to complete, without negatively impacting its cluster recovery ability; (ii) A-Wardpβ provides better cluster recovery than both Ward and Wardp.

    Metadata

    Item Type: Article
    Keyword(s) / Subject(s): Initialisation algorithm, Minkowski metric, Hierarchical clustering, Feature weighting
    School: Birkbeck Schools and Departments > School of Business, Economics & Informatics > Computer Science and Information Systems
    Research Centre: Structural Molecular Biology, Institute of (ISMB)
    Depositing User: Administrator
    Date Deposited: 10 Nov 2016 09:40
    Last Modified: 27 Jul 2019 11:17
    URI: http://eprints.bbk.ac.uk/id/eprint/15982

    Statistics

    Downloads
    Activity Overview
    80Downloads
    164Hits

    Additional statistics are available via IRStats2.

    Archive Staff Only (login required)

    Edit/View Item Edit/View Item