MCut: A Thresholding Strategy for Multi-label Classification

Christine Largeron; Mathias Géry; Christophe Moulin

Communication Dans Un Congrès Année : 2012

MCut: A Thresholding Strategy for Multi-label Classification

(1) , (1) , (1)

Christine Largeron

Fonction : Auteur
PersonId : 5702
IdHAL : christine-largeron
ORCID : 0000-0003-1059-4095
IdRef : 029304121

Laboratoire Hubert Curien

Mathias Géry

Fonction : Auteur
PersonId : 843869

Laboratoire Hubert Curien

Christophe Moulin

Fonction : Auteur

Laboratoire Hubert Curien

Résumé

The multi-label classi cation is a frequent task in pattern recognition, data mining and machine learning. When binary classi ers are not suited, an alternative consists in using a multiclass classi er that provides for each document a score per category and then in applying a thresholding strategy in order to select the set of categories which must be assigned to the document. The common thresholding strategies, such as RCut, PCut and SCut methods, need a training step to determine the value of the threshold. To overcome this limit, we propose in this article a new strategy, called MCut which automatically estimates a value for the threshold. This method, simple to implement, does not have to be trained and it does not need any parametrization. Experimentations performed on two textual corpora: XML Mining 2009 and RCV1 collections, show that the MCut strategy obtains good results compared to those provided by usual thresholding strategies.

Domaines

Apprentissage [cs.LG]

Christine Largeron : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-00730656

Soumis le : lundi 10 septembre 2012-17:33:27

Dernière modification le : vendredi 24 mars 2023-14:52:56

Dates et versions

ujm-00730656 , version 1 (10-09-2012)

Identifiants

HAL Id : ujm-00730656 , version 1

Citer

Christine Largeron, Mathias Géry, Christophe Moulin. MCut: A Thresholding Strategy for Multi-label Classification. Eleventh International Symposium on Intelligent Data Analysis (IDA 2012), Oct 2012, Elsinki, Finland. pp.173-184. ⟨ujm-00730656⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS LAHC PARISTECH UDL

215 Consultations

0 Téléchargements

MCut: A Thresholding Strategy for Multi-label Classification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager