Integrating structure in the probabilistic model for Information Retrieval

Abstract : In databases or in the World Wide Web, many documents are in a structured format (e.g. XML). We propose in this article to extend the classical IR probabilistic model in order to take into account the structure through the weighting of tags. Our approach includes a learning step in which the weight of each tag is computed. This weight estimates the probability that the tag distinguishes the terms which are the most relevant. Our model has been evaluated on a large collection during INEX IR evaluation campaigns.
Document type :
Conference papers
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal-ujm.archives-ouvertes.fr/ujm-00331482
Contributor : Mathias Géry <>
Submitted on : Tuesday, May 19, 2009 - 7:03:00 PM
Last modification on : Wednesday, July 25, 2018 - 2:05:30 PM
Long-term archiving on : Monday, June 7, 2010 - 8:21:49 PM

File

gery-integratingStructure.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Mathias Géry, Christine Largeron, Franck Thollard. Integrating structure in the probabilistic model for Information Retrieval. IEEE / WIC / ACM International Conference on Web Intelligence, Dec 2008, Sydney, Australia. pp.763-769, ⟨10.1109/WIIAT.2008.346⟩. ⟨ujm-00331482⟩

Share

Metrics

Record views

129

Files downloads

148