Building fuzzy thematic clusters and mapping them to higher ranks in a taxonomy
Mirkin, Boris and Fenner, Trevor and Nascimento, S. and Moniz Pereira, L. (2010) Building fuzzy thematic clusters and mapping them to higher ranks in a taxonomy. International Journal of Software and Informatics 4 (3), pp. 257-275. ISSN 1673-7288.
Abstract
We present a novel methodology for the analysis of activities engaged in an organization such as the research conducted in a University department by mapping them to a related hierarchical taxonomy such as Classification of Computer Subjects by ACM (ACM-CCS). We start by collecting data of activities of the individual components of the organization and present them as the components fuzzy membership profiles over the subjects of the taxonomy. Our method generalizes the profiles in two steps. First step finds fuzzy clusters of taxonomy subjects according to the working of the organization. Second, each cluster is mapped to higher ranks of the taxonomy in a parsimonious way. Each of the steps is formalized and solved in a novel way. We build fuzzy clusters of the taxonomy leaves according to the similarity between individual profiles by using a novel, additive spectral, fuzzy clustering method that involves a number of model-based stopping conditions, in contrast to other methods. As the found clusters are not necessarily consistent with the taxonomy, each is considered as a query set. To lift a query set to higher ranks of the taxonomy, we develop an original recursive algorithm for minimizing a penalty function that involves 'head subjects' on the higher ranks of the taxonomy together with their 'gaps' and 'offshoots'. The method is illustrated by applying it to real-world data.
Metadata
Item Type: | Article |
---|---|
School: | Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences |
Research Centres and Institutes: | Structural Molecular Biology, Institute of (ISMB) |
Depositing User: | Administrator |
Date Deposited: | 14 May 2013 09:45 |
Last Modified: | 09 Aug 2023 12:33 |
URI: | https://eprints.bbk.ac.uk/id/eprint/6747 |
Statistics
Additional statistics are available via IRStats2.