Impact précoce du poids des balises pour la recherche d'information ciblée

Abstract : This paper addresses the integration of XML tags in terms weighting function for focused XML Information Retrieval (IR). Our model allows to consider a certain kind of structural information: tags that represent logical structure (title, section, paragraph, etc.) as well as tags related to formatting (bold, italic, center, etc.). We take into account the tags influence by estimating the probability that tags distinguishe relevant terms. Then, these weights are integrated in terms weighting function. Experiments on a large collection during INEX 2008 XML IR evaluation campaign showed improvements on focused retrieval.
Document type :
Conference papers
Complete list of metadatas

https://hal-ujm.archives-ouvertes.fr/ujm-00385200
Contributor : Mathias Géry <>
Submitted on : Tuesday, May 19, 2009 - 7:01:45 PM
Last modification on : Wednesday, July 25, 2018 - 2:05:31 PM
Long-term archiving on : Thursday, June 10, 2010 - 11:22:42 PM

File

coria_final.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : ujm-00385200, version 1

Collections

Citation

Mathias Géry, Christine Largeron, Franck Thollard. Impact précoce du poids des balises pour la recherche d'information ciblée. Conférence en Recherche d'Information et Applications, May 2009, Presqu'île de Giens, France. pp 333-348. ⟨ujm-00385200⟩

Share

Metrics

Record views

144

Files downloads

131