UJM at ImageCLEFwiki 2008

Christophe Moulin; Cecile Barat; Mathias Géry; Christophe Ducottet; Christine Largeron

Communication Dans Un Congrès Année : 2008

UJM at ImageCLEFwiki 2008

(1) , (1) , (1) , (1) , (1)

Christophe Moulin

Fonction : Auteur
PersonId : 854129

Laboratoire Hubert Curien

Cecile Barat

Fonction : Auteur
PersonId : 844844

Laboratoire Hubert Curien

Mathias Géry

Fonction : Auteur
PersonId : 843869

Laboratoire Hubert Curien

Christophe Ducottet

Fonction : Auteur
PersonId : 836866
IdHAL : christophe-ducottet
ORCID : 0000-0002-2812-1918

Laboratoire Hubert Curien

Christine Largeron

Fonction : Auteur
PersonId : 5702
IdHAL : christine-largeron
ORCID : 0000-0003-1059-4095
IdRef : 029304121

Laboratoire Hubert Curien

Résumé

This paper reports our multimedia information retrieval experiments carried out for the ImageCLEF track (ImageCLEFwiki). The task is to answer to user information needs, i.e. queries which may be composed of several modalities (text, image, concept) with ranked lists of relevant documents. The purpose of our experiments is twofold: firstly, our overall aim is to develop a multimedia document model combining text and/or image modalities. Secondly, we aim to compare results of our model using a multimedia query with a text only model. Our multimedia document model is based on a vector of textual and visual terms. The textual terms correspond to words. The visual ones result from local colour descriptors which are automatically extracted and quantized by k-means, leading to an image vocabulary. They represent the colour property of an image region. To perform a query, we compute a similarity score between each document vector (textual + visual terms) and the query using the Okapi method based on the tf.idf approach. We have submitted 6 runs either automatic or manual, using textual, visual or both information. Thanks to these 6 runs, we aim to study several aspects of our model, as the choice of the visual words and local features, the way of combining textual and visual words for a query and the performance improvements obtained when adding visual information to a pure textual model. Concerning the choice of the visual words, results show us that they are significant in some cases where the visualness of the query is meaningful. The conclusion about the combination of textual and visual words is surprising. We obtain worth results when we add directly the text to the visual words. Finally, results also inform that visual information bring complementary relevant documents that were not found with the text query. These initial results are promising and encourage the development of our multimedia model.

Domaines

Apprentissage [cs.LG] Traitement des images [eess.IV] Recherche d'information [cs.IR]

Fichier principal

moulin-paperCLEF2008.pdf (97.62 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Moulin : Connectez-vous pour contacter le contributeur

https://ujm.hal.science/ujm-00326228

Soumis le : mercredi 20 mai 2009-09:55:04

Dernière modification le : vendredi 24 mars 2023-14:52:51

Archivage à long terme le : vendredi 4 juin 2010-12:06:08

Dates et versions

ujm-00326228 , version 1 (20-05-2009)

Identifiants

HAL Id : ujm-00326228 , version 1

Citer

Christophe Moulin, Cecile Barat, Mathias Géry, Christophe Ducottet, Christine Largeron. UJM at ImageCLEFwiki 2008. ECDL 2008 - Workshop CLEF, Sep 2008, Aarhus, Denmark. ⟨ujm-00326228⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS LAHC PARISTECH UDL

279 Consultations

231 Téléchargements

UJM at ImageCLEFwiki 2008

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager