Hayrapetyan, Z. and Mirkin, Boris and Nascimento, S. and Fenner, Trevor and Frolov, D. (2023) Modeling a most specific generalization in domain taxonomies. In: 17th Conference of the International Federation of Classification Societies (IFCS 2022), 19-23 July 2022, Porto, Portugal. (In Press)
Text
IFCS2022_Paper (1).pdf - Author's Accepted Manuscript Restricted to Repository staff only Download (112kB) | Request a copy |
Abstract
We define a most specific generalization of a fuzzy set of topics assigned to leaves of the rooted tree of a domain taxonomy. This generalization lifts the set to its “head subject” node in the higher ranks of the taxonomy tree. The head subject is supposed to “tightly” cover the query set, possibly involving some errors referred to as “gaps” and “offshoots”. We develop a method to globally maximize either the parsimony or the likelihood of a scenario involving gains and losses of the general concept manifested in a fuzzy cluster of leaf nodes of the taxonomy. Supplemented with fuzzy c-means clustering, this allows us to obtain meaningful generalizations for fuzzy thematic clusters of Data Science topics using several dozen thousand abstracts from issues of relevant research journals published from 2000 on.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
School: | Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences |
Depositing User: | Trevor Fenner |
Date Deposited: | 21 Mar 2023 14:03 |
Last Modified: | 09 Aug 2023 12:53 |
URI: | https://eprints.bbk.ac.uk/id/eprint/47941 |
Statistics
Additional statistics are available via IRStats2.