Fusion of tf.idf Weighted Bag of Visual Features for Image Classification

Abstract : Image representation using bag of visual words approach is commonly used in image classification. Features are extracted from images and clustered into a visual vocabulary. Images can then be represented as a normalized histogram of visual words similarly to textual documents represented as a weighted vector of terms. As a result, text categorization techniques are applicable to image classification. In this paper, our contribution is twofold. First, we propose a suitable Term-Frequency and Inverse Document Frequency weighting scheme to characterize the importance of visual words. Second, we present a method to fuse different bag-of-words obtained with different vocabularies. We show that using our tf.idf normalization and the fusion leads to better classification rates than other normalization methods, other fusion schemes or other approaches evaluated on the SIMPLIcity collection.
Document type :
Conference papers
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal-ujm.archives-ouvertes.fr/ujm-00501523
Contributor : Christophe Moulin <>
Submitted on : Friday, August 13, 2010 - 1:44:03 PM
Last modification on : Wednesday, July 25, 2018 - 2:05:30 PM
Long-term archiving on : Tuesday, October 23, 2012 - 12:11:15 PM

File

CBMI_2010.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : ujm-00501523, version 1

Collections

Citation

Christophe Moulin, Cécile Barat, Christophe Ducottet. Fusion of tf.idf Weighted Bag of Visual Features for Image Classification. Content Based Multimedia Indexing, Jun 2010, France. pp.124-129. ⟨ujm-00501523⟩

Share

Metrics

Record views

271

Files downloads

1409