Fisher Linear Discriminant Analysis for Text-Image Combination in Multimedia Information Retrieval

Christophe Moulin; Christine Largeron; Christophe Ducottet; Mathias Géry; Cécile Barat

doi:10.1016/j.patcog.2013.06.003

Article Dans Une Revue Pattern Recognition Année : 2014

Fisher Linear Discriminant Analysis for Text-Image Combination in Multimedia Information Retrieval

(1) , (1) , (1) , (1) , (1)

Christophe Moulin

Fonction : Auteur

Laboratoire Hubert Curien

Christine Largeron

Fonction : Auteur
PersonId : 5702
IdHAL : christine-largeron
ORCID : 0000-0003-1059-4095
IdRef : 029304121

Laboratoire Hubert Curien

Christophe Ducottet

Fonction : Auteur
PersonId : 836866
IdHAL : christophe-ducottet
ORCID : 0000-0002-2812-1918

Laboratoire Hubert Curien

Mathias Géry

Fonction : Auteur
PersonId : 843869

Laboratoire Hubert Curien

Cécile Barat

Fonction : Auteur
PersonId : 844844

Laboratoire Hubert Curien

Résumé

With multimedia information retrieval, combining different modalities - text, image, audio or video provides additional information and generally improves the overall system performance. For this purpose, the linear combination method is presented as simple, flexible and effective. However, it requires to choose the weight assigned to each modality. This issue is still an open problem and is addressed in this paper. Our approach, based on Fisher Linear Discriminant Analysis, aims to learn these weights for multimedia documents composed of text and images. Text and images are both represented with the classical bag-of-words model. Our method was tested over the ImageCLEF datasets 2008 and 2009. Results demonstrate that our combination approach not only outperforms the use of the single textual modality but provides a nearly optimal learning of the weights with an efficient computation. Moreover, it is pointed out that the method allows to combine more than two modalities without increasing the complexity and thus the computing time

Domaines

Recherche d'information [cs.IR] Vision par ordinateur et reconnaissance de formes [cs.CV] Multimédia [cs.MM]

Fichier principal

Moulin2013Fisher.pdf (749.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Ducottet : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-00866140

Soumis le : jeudi 26 septembre 2013-09:40:24

Dernière modification le : jeudi 11 avril 2024-16:22:18

Archivage à long terme le : vendredi 27 décembre 2013-04:32:26

Dates et versions

ujm-00866140 , version 1 (26-09-2013)

Identifiants

HAL Id : ujm-00866140 , version 1
DOI : 10.1016/j.patcog.2013.06.003

Citer

Christophe Moulin, Christine Largeron, Christophe Ducottet, Mathias Géry, Cécile Barat. Fisher Linear Discriminant Analysis for Text-Image Combination in Multimedia Information Retrieval. Pattern Recognition, 2014, 47 (1), pp.260-269. ⟨10.1016/j.patcog.2013.06.003⟩. ⟨ujm-00866140⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS LAHC PARISTECH UDL

190 Consultations

725 Téléchargements

Fisher Linear Discriminant Analysis for Text-Image Combination in Multimedia Information Retrieval

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager