Multi-modal curriculum learning for semi-supervised image classification

Gong, C. and Tao, D. and Maybank, Stephen J. and Liu, W. and Kang, G. and Yang, J. (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Transactions on Image Processing 25 (7), pp. 3249-3260. ISSN 1057-7149.

Preview

Text
15093.pdf - Author's Accepted Manuscript
Download (1MB) | Preview

Official URL: http://dx.doi.org/10.1109/TIP.2016.2563981

Abstract

Semi-supervised image classification aims to classify a large quantity of unlabeled images by typically harnessing scarce labeled images. Existing semi-supervised methods often suffer from inadequate classification accuracy when encountering difficult yet critical images, such as outliers, because they treat all unlabeled images equally and conduct classifications in an imperfectly ordered sequence. In this paper, we employ the curriculum learning methodology by investigating the difficulty of classifying every unlabeled image. The reliability and the discriminability of these unlabeled images are particularly investigated for evaluating their difficulty. As a result, an optimized image sequence is generated during the iterative propagations, and the unlabeled images are logically classified from simple to difficult. Furthermore, since images are usually characterized by multiple visual feature descriptors, we associate each kind of features with a teacher, and design a multi-modal curriculum learning (MMCL) strategy to integrate the information from different feature modalities. In each propagation, each teacher analyzes the difficulties of the currently unlabeled images from its own modality viewpoint. A consensus is subsequently reached among all the teachers, determining the currently simplest images (i.e., a curriculum), which are to be reliably classified by the multi-modal learner. This well-organized propagation process leveraging multiple teachers and one learner enables our MMCL to outperform five state-of-the-art methods on eight popular image data sets.

Metadata

Item Type:	Article
Additional Information:	(c) 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Keyword(s) / Subject(s):	Curriculum learning, Semi-supervised learning, Multi-modal, Image classification
School:	Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences
Depositing User:	Administrator
Date Deposited:	24 May 2016 12:37
Last Modified:	23 Sep 2025 11:34
URI:	https://eprints.bbk.ac.uk/id/eprint/15093

Statistics

DownloadsShow export options

Activity Overview

6 month trend

1,911Downloads

6 month trend

305Hits

Additional statistics are available via IRStats2.

Archive Staff Only (login required)

Edit/View Item