String representations and distances in deep Convolutional Neural Networks for image classification

Cécile Barat; Christophe Ducottet

doi:10.1016/j.patcog.2016.01.007

Article Dans Une Revue Pattern Recognition Année : 2016

String representations and distances in deep Convolutional Neural Networks for image classification

(1) , (1)

Cécile Barat

Fonction : Auteur
PersonId : 844844

Laboratoire Hubert Curien

Christophe Ducottet

Fonction : Auteur

Laboratoire Hubert Curien

Résumé

Recent advances in image classification mostly rely on the use of powerful local features combined with an adapted image representation. Although Convolutional Neural Network (CNN) features learned from ImageNet were shown to be generic and very efficient, they still lack of flexibility to take into account variations in the spatial layout of visual elements. In this paper, we investigate the use of structural representations on top of pre-trained CNN features to improve image classification. Images are represented as strings of CNN features. Similarities between such representations are computed using two new edit distance variants adapted to the image classification domain. Our algorithms have been implemented and tested on several challenging datasets, 15Scenes, Caltech101, Pas-cal VOC 2007 and MIT indoor. The results show that our idea of using structural string representations and distances clearly improves the classification performance over standard approaches based on CNN and SVM with linear kernel, as well as other recognized methods of the literature.

Mots clés

Convolutional Neural Network string representation edit distance image classification 2010 MSC: 00-01 99-00

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Traitement des images [eess.IV]

Fichier principal

Barat2016-String-preprint.pdf (1.32 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Ducottet : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-01274675

Soumis le : mardi 16 février 2016-10:10:14

Dernière modification le : jeudi 11 avril 2024-16:22:18

Archivage à long terme le : mardi 17 mai 2016-10:05:21

Dates et versions

ujm-01274675 , version 1 (16-02-2016)

Identifiants

HAL Id : ujm-01274675 , version 1
DOI : 10.1016/j.patcog.2016.01.007

Citer

Cécile Barat, Christophe Ducottet. String representations and distances in deep Convolutional Neural Networks for image classification. Pattern Recognition, 2016, 54, pp.104-115. ⟨10.1016/j.patcog.2016.01.007⟩. ⟨ujm-01274675⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS LAHC PARISTECH UDL

119 Consultations

1783 Téléchargements

String representations and distances in deep Convolutional Neural Networks for image classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager