A. W. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

M. Lew, N. Sebe, C. Djeraba, and R. Jain, Content-based multimedia information retrieval, ACM Transactions on Multimedia Computing, Communications, and Applications, vol.2, issue.1, pp.1-19, 2006.
DOI : 10.1145/1126004.1126005

M. Flickner, H. Sawhney, J. Ashley, Q. Huang, B. Dom et al., Query by image and video content: The qbic system, IEEE Computer, pp.28-51, 1995.

S. Antani, R. Kasturi, and R. Jain, A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video, Pattern Recognition, vol.35, issue.4, pp.945-965, 2002.
DOI : 10.1016/S0031-3203(01)00086-3

Y. Liu, D. Zhang, G. Lu, and W. Ma, A survey of content-based image retrieval with high-level semantics, Pattern Recognition, vol.40, issue.1, pp.262-282, 2007.
DOI : 10.1016/j.patcog.2006.04.045

R. Datta, D. Joshi, J. Li, and J. Z. Wang, Image retrieval, ACM Computing Surveys, vol.40, issue.2, pp.1-60, 2008.
DOI : 10.1145/1348246.1348248

G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV'04 : 8th European Conference on Computer Vision : workshop on Statistical Learning in Computer Vision, pp.59-74, 2004.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Aucouturier, B. Defreville, and F. Pachet, The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music, The Journal of the Acoustical Society of America, vol.122, issue.2, p.881, 2007.
DOI : 10.1121/1.2750160

C. Schuldt, I. Laptev, and B. Caputo, Recognizing human actions: a local SVM approach, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., pp.32-36, 2004.
DOI : 10.1109/ICPR.2004.1334462

R. Yan and A. G. Hauptmann, A review of text and image retrieval approaches for broadcast news video, Information Retrieval, vol.15, issue.1, pp.445-484, 2007.
DOI : 10.1007/s10791-007-9031-y

C. Moulin, C. Barat, C. Lema??trelema??tre, M. Géry, C. Ducottet et al., Combining text, CLEF'09 : 10th workshop of the Cross-Language Evaluation Forum, pp.164-171, 2009.
URL : https://hal.archives-ouvertes.fr/ujm-00432319

P. Atrey, M. Hossain, A. Saddik, and M. Kankanhalli, Multimodal fusion for multimedia analysis: a survey, Multimedia Systems, vol.24, issue.11, pp.345-379, 2010.
DOI : 10.1007/s00530-010-0182-0

B. T. Bartell, G. W. Cottrell, and R. K. Belew, Automatic Combination of Multiple Ranked Retrieval Systems, Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '94, pp.173-181, 1994.
DOI : 10.1007/978-1-4471-2099-5_18

J. A. Shaw and E. A. Fox, Combination of Multiple Searches, The Second Text REtrieval Conference (TREC-2, pp.243-252, 1993.

C. C. Vogt and G. W. Cottrell, Fusion via a linear combination of scores, Information Retrieval, vol.1, issue.3, pp.151-173, 1999.
DOI : 10.1023/A:1009980820262

A. Ross and A. Jain, Information fusion in biometrics, Pattern Recognition Letters, vol.24, issue.13, pp.2115-2125, 2003.
DOI : 10.1016/S0167-8655(03)00079-5

A. Rattani, D. R. Kisku, M. Bicego, and M. Tistarelli, Feature Level Fusion of Face and Fingerprint Biometrics, BTAS 2007, First IEEE International Conference on Biometrics: Theory, Applications, and Systems, pp.1-6, 2007.

Y. Fu, L. Cao, G. Guo, and T. S. Huang, Multiple feature fusion by subspace learning, Proceedings of the 2008 international conference on Content-based image and video retrieval, CIVR '08, pp.127-134, 2008.
DOI : 10.1145/1386352.1386373

A. Depeursinge, D. Racoceanu, J. Iavindrasana, G. Cohen, A. Platon et al., Fusing visual and clinical information for lung tissue classification in high-resolution computed tomography, Artificial Intelligence in Medicine, vol.50, issue.1, pp.13-21, 2010.
DOI : 10.1016/j.artmed.2010.04.006

A. Macedonas, S. Fotopoulos, and G. Economou, Improvement of Image Retrieval by Fusing Different Descriptors, Eighth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS '07), pp.75-75, 2007.
DOI : 10.1109/WIAMIS.2007.52

S. Wu and F. Crestani, Data fusion with estimated weights, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, pp.648-651, 2002.
DOI : 10.1145/584792.584908

R. Nuray and F. Can, Automatic ranking of information retrieval systems using data fusion, Information Processing & Management, vol.42, issue.3, pp.595-614, 2006.
DOI : 10.1016/j.ipm.2005.03.023

J. A. Aslam and M. Montague, Models for metasearch, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '01, pp.276-284, 2001.
DOI : 10.1145/383952.384007

M. Montague and J. A. Aslam, Condorcet fusion for improved retrieval, Proceedings of the eleventh international conference on Information and knowledge management , CIKM '02, pp.538-548, 2002.
DOI : 10.1145/584792.584881

C. G. Snoek, Early versus late fusion in semantic video analysis, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, pp.399-402, 2005.
DOI : 10.1145/1101149.1101236

P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, Audio-visual speech recognition using mpeg-4 compliant visual features, EURASIP Journal on Applied Signal Processing, pp.1213-1227, 2002.

G. Iyengar, H. Nock, and C. Neti, Audio-visual synchrony for detection of monologues in video archives, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698), pp.329-332, 2003.
DOI : 10.1109/ICME.2003.1220921

M. Cheung, M. Mak, and S. Kung, A two-level fusion approach to multimodal biometric verification, IEEE Int. Conf. Acoust. Speech, Signal Processing (ICASSP), vol.5, pp.485-488, 2005.

D. N. Iskandar, J. Pehcevski, J. A. Thom, and S. M. Tahaghoghi, Combining image and structured text retrieval, INEX'05, pp.525-539, 2005.
DOI : 10.1007/11766278_40

M. Torjmen, K. Pinel-sauvagnat, and M. Boughanem, Methods for Combining Content-Based and Textual-Based Approaches in Medical Image Retrieval, pp.691-695, 2008.
DOI : 10.1007/978-3-540-30213-1_35

L. A. Alexandre, A. C. Campilho, and M. Kamel, Combining independent and unbiased classifiers using weighted average, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, pp.2495-2498, 2000.
DOI : 10.1109/ICPR.2000.906120

Y. Wang, T. Tan, and A. K. Jain, Combining Face and Iris Biometrics for Identity Verification, Proceedings of the 4th international conference on Audio-and video-based biometric person authentication, AVBPA'03, pp.805-813, 2003.
DOI : 10.1007/3-540-44887-X_93

R. Fisher, THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS, Annals of Eugenics, vol.59, issue.2, pp.179-188, 1936.
DOI : 10.1111/j.1469-1809.1936.tb02137.x

C. Moulin, C. Largeron, and M. Géry, Impact of Visual Information on Text and Content Based Image Retrieval, S+SSPR'10 :13th international workshop on Structural, Syntactic, and Statistical Pattern Recognition, pp.159-169, 2010.
DOI : 10.1007/978-3-642-14980-1_15
URL : https://hal.archives-ouvertes.fr/hal-00526626

G. Salton, A. Wong, and C. Yang, A vector space model for automatic indexing, Communications of the ACM, vol.18, issue.11, pp.613-620, 1975.
DOI : 10.1145/361219.361220

G. Salton and M. J. Mcgill, Introduction to modern Information Retrieval, 1983.

S. Robertson, S. Walker, M. Hancock-beaulieu, A. Gull, and M. Lau, Okapi at trec-3, TREC-3 : 3rd Text REtrieval Conference, pp.21-30, 1994.

C. Zhai, Notes on the lemur tfidf model, 2001.

D. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999.
DOI : 10.1109/ICCV.1999.790410
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.121.4065

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

S. Tollari and H. Glotin, Web image retrieval on ImagEVAL, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, pp.65-72, 2007.
DOI : 10.1145/1282280.1282289
URL : https://hal.archives-ouvertes.fr/hal-00199823

S. Tollari, M. Detyniecki, C. Marsala, A. Fakeri-tabrizi, M. Amini et al., Exploiting Visual Concepts to Improve Text-Based Image Retrieval, ECIR'09 : Proceedings of European Conference on Information Retrieval, pp.701-705, 2009.
DOI : 10.1007/11788034_63
URL : https://hal.archives-ouvertes.fr/hal-00402448

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, 2008.
DOI : 10.1017/CBO9780511809071

J. Nocedal and S. Wright, Numerical optimization, 1999.
DOI : 10.1007/b98874

R. Fisher, THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS, Annals of Eugenics, vol.59, issue.2, pp.179-188, 1936.
DOI : 10.1111/j.1469-1809.1936.tb02137.x

S. Mika, G. Ratsch, J. Weston, B. Scholkopf, and K. Mullers, Fisher discriminant analysis with kernels, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468), pp.41-48, 1999.
DOI : 10.1109/NNSP.1999.788121

I. Jolliffe, Principal Component Analysis, 2002.
DOI : 10.1007/978-1-4757-1904-8

G. Mclachlan and J. Wiley, Discriminant analysis and statistical pattern recognition, 1992.
DOI : 10.1002/0471725293

T. Tsikrika and J. Kludas, Overview of the WikipediaMM Task at ImageCLEF 2008, pp.539-550, 2008.
DOI : 10.1007/978-3-540-73888-6_33

T. Tsikrika and J. Kludas, Overview of the WikipediaMM Task at ImageCLEF 2009, pp.60-71, 2009.
DOI : 10.1007/978-3-642-15751-6_7