A Neural Few-Shot Text Classification Reality Check

Thomas Dopierre; Christophe Gravier; Wilfried Logerais

Communication Dans Un Congrès Année : 2021

A Neural Few-Shot Text Classification Reality Check

(1) , (1) , (1)

Thomas Dopierre

Fonction : Auteur
PersonId : 1102945

Laboratoire Hubert Curien

Christophe Gravier

Fonction : Auteur

Laboratoire Hubert Curien

Wilfried Logerais

Fonction : Auteur
PersonId : 1102946

Laboratoire Hubert Curien

Résumé

Modern classification models tend to struggle when the amount of annotated data is scarce. To overcome this issue, several neural fewshot classification models have emerged, yielding significant progress over time, both in Computer Vision and Natural Language Processing. In the latter, such models used to rely on fixed word embeddings before the advent of transformers. Additionally, some models used in Computer Vision are yet to be tested in NLP applications. In this paper, we compare all these models, first adapting those made in the field of image processing to NLP, and second providing them access to transformers. We then test these models equipped with the same transformer-based encoder on the intent detection task, known for having a large number of classes. Our results reveal that while methods perform almost equally on the ARSC dataset, this is not the case for the Intent Detection task, where the most recent and supposedly best competitors perform worse than older and simpler ones (while all are given access to transformers). We also show that a simple baseline is surprisingly strong. All the new developed models, as well as the evaluation framework, are made publicly available 1 .

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

2021.eacl-main.79.pdf (397.06 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Christophe Gravier : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-03267869

Soumis le : mardi 22 juin 2021-17:00:24

Dernière modification le : vendredi 24 mars 2023-14:53:22

Archivage à long terme le : jeudi 23 septembre 2021-19:11:24

Dates et versions

ujm-03267869 , version 1 (22-06-2021)

Identifiants

HAL Id : ujm-03267869 , version 1

Citer

Thomas Dopierre, Christophe Gravier, Wilfried Logerais. A Neural Few-Shot Text Classification Reality Check. 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kyiv (virtual), Ukraine. ⟨ujm-03267869⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS PARISTECH UDL

22 Consultations

36 Téléchargements

A Neural Few-Shot Text Classification Reality Check

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager