Using Proximity and Tag Weights for Focused Retrieval in Structured Documents

Abstract : Focused information retrieval is concerned with the retrieval of small units of information. In this context, the structure of the documents as well as the proximity among query terms have been found useful for improving retrieval effectiveness. In this article, we propose an approach combining the proximity of the terms and the tags which mark these terms. Our approach is based on a Fetch and Browse method where the fetch step is performed with BM25 and the browse step with a structure enhanced proximity model. In this way, the ranking of a document depends not only upon the existence of the query terms within the document but also upon the tags which mark these terms. Thus, the document tends to be highly relevant when query terms are close together and are emphasized by tags. The evaluation of this model on a large XML structured collection provided by the INEX 2010 XML IR evaluation campaign shows that the use of term proximity and structure improves the retrieval effectiveness of BM25 in the context of focused information retrieval.
Type de document :
Article dans une revue
Knowledge and Information Systems (KAIS), Springer, 2015, 44 (1), pp.51-76
Liste complète des métadonnées

Littérature citée [47 références]  Voir  Masquer  Télécharger

https://hal-ujm.archives-ouvertes.fr/ujm-01016381
Contributeur : Mathias Géry <>
Soumis le : lundi 30 juin 2014 - 10:24:18
Dernière modification le : jeudi 26 juillet 2018 - 01:11:08
Document(s) archivé(s) le : mardi 30 septembre 2014 - 15:06:14

Fichier

2013_KAIS.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : ujm-01016381, version 1

Citation

Michel Beigbeder, Mathias Géry, Christine Largeron. Using Proximity and Tag Weights for Focused Retrieval in Structured Documents. Knowledge and Information Systems (KAIS), Springer, 2015, 44 (1), pp.51-76. 〈ujm-01016381〉

Partager

Métriques

Consultations de la notice

257

Téléchargements de fichiers

302