Editing training data for multi-label classification with the k-nearest neighbor rule

Sawsan Kanj, Fahed Abdallah, Thierry Denœux, Kifah Tout

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

41 Citations (Scopus)

Résumé

Multi-label classification allows instances to belong to several classes at once. It has received significant attention in machine learning and has found many real-world applications in recent years, such as text categorization, automatic video annotation and functional genomics, resulting in the development of many multi-label classification methods. Based on labeled examples in the training dataset, a multi-labeled method extracts inherent information to output a function that predicts the labels of unlabeled data. Due to several problems, like errors in the input vectors or in their labels, this information may be wrong and might lead the multi-label algorithm to fail. In this paper, we propose a simple algorithm for overcoming these problems by editing the existing training dataset, and adapting the edited set with different multi-label classification methods. Evaluation on benchmark datasets demonstrates the usefulness and effectiveness of our approach.

langue originaleAnglais
Pages (de - à)145-161
Nombre de pages17
journalPattern Analysis and Applications
Volume19
Numéro de publication1
Les DOIs
étatPublié - 1 févr. 2016
Modification externeOui

Une note bibliographique

Publisher Copyright:
© 2015, Springer-Verlag London.

Contient cette citation