Academia.eduAcademia.edu

Outline

ECL-LIRIS at TrecVid 2011: Semantic Indexing

2011, TRECVID

Abstract

This is the first time that our team participate TRECVID. This paper summarizes our approach submitted to Semantic Indexing (SIN) task in TRECVID 2011. Our approach adopts bag-of-features method to transform original visual and audio features into histogram features, using pre-trained codebook. After feature transformation, one-versus-others SVMs with Chi-square kernel are trained. In decision step, averaged probability is calculated as a final score to rank shots. Under this framework, we tested 4 visual features including dense grid SIFT, color SIFT, OLBPC and DAISY together with 1 audio feature consisting of MFCC with delta and acceleration. Our audio visual combination model achieves best results in terms of mean xinfAP. Besides, considering the huge amount of data this year, we employed several speedup strategies such as k-means clustering with GPU acceleration and homogeneous kernel map. All these efforts rank us at the 12 th out of 19 teams in full run and the 13 th out of 27 teams in the light run test.

References (17)

  1. REFERENCES
  2. A. F. Smeaton, P. Over, W. Kraaij, "Evaluation cam- paigns and TRECVid," In Proc. the 8th ACM Interna- tional Workshop on Multimedia Information Retrieval, pp.321-330, 2006
  3. A. F. Smeaton, P. Over, W. Kraaij, "High-Level Feature Detection from Video in TRECVid: a 5-Year Retro- spective of Achievements," in Multimedia Content Analysis, Theory and Applications, pp. 151-174, 2009
  4. D. Gorisse1, F. Precioso, P. Gosselin1, L. Granjon, D. Pellerin, M. Rombaut, H. Bredin, L. Koenig, R. Vieux, B. Mansencal, J. Benois-Pineau, H. Boujut, C. Morand, H. Jé gou, S. Ayache, B. Safadi, Y. Tong, F. Thollard, G. Qué not, M. Cord, A. Benoî t, PLambert, "IRIM at TRECVID 2011: Semantic Indexing and Instance Search," TREVID 2011 notebook paper.
  5. L. Bao, S.-I. Yu, Z.-Z. Lan, A. Overwijk, Q. Jin, B. Langner, M. Garbus, S. Burger, F. Metze, A. Haupt- mann, "Informedia @ TRECVID 2011," TREVID 2011 notebook paper.
  6. C.G.M. Snoek, K.E.A. van de Sande, X. Li, M. Mazloom, Y.-G. Jiang, D.C. Koelma, A.W.M. Smeulders, "The MediaMill TRECVID 2011 Semantic Video Search Engine," TREVID 2011 notebook paper.
  7. D. G. Lowe, "Distinctive image features from scale- invariant keypoints," in International Journal of Com- puter Vision, vol. 60, no. 2, pp. 91-110, 2004
  8. G. J. Burghouts, J. M. Geusebroek. "Performance eval- uation of local colour invariants," in Computer Vision and Image Understanding, 113:48-62, 2009.
  9. T. Ojala, M. Pietikainen, D. Harwood, "A comparative study of texture measures with classification based on feature distribution," in Pattern Recognition 29 (1996) 51-59.
  10. C. Zhu, C.-E. Bichot, L. Chen, "Color orthogonal local binary patterns combination for image region descrip- tion," in Technical Report, LIRIS UMR5205 CNRS, Ecole Centrale de Lyon, 2011.
  11. E. Tola, V. Lepetit, P. Fua, "Daisy: an Efficient Dense Descriptor Applied to Wide Baseline Stereo," IEEE PAMI, vol. 32, no. 5, pp. 815-830, 2010.
  12. C. Zhu, C.-E. Bichot and L. Chen, "Visual object recognition using daisy descriptor," In Proc. IEEE In- ternational Conference on Multimedia and Expo (ICME), Jul. 2011.
  13. S. B. Davis, P. Mermelstein, "Comparison of Paramet- ric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," in IEEE Transac- tions on Acoustics, Speech, and Signal Processing, 28(4), pp. 357-366, 1980.
  14. F. Eyben, M. Wöllmer, B. Schuller: "openSMILE -The Munich Versatile and Fast Open-Source Audio Feature Extractor," in Proc. ACM Multimedia (MM), ACM, Fi- renze, Italy, 25.-29.10.2010.
  15. A. Vedaldi, A. Zisserman, "Efficient additive kernels via explicit feature maps," In Proc. CVPR, 2010.
  16. A. Vedaldi, B. Fulkerson, "VLFeat: An open and porta- ble library of computer vision algorithms," 2008.
  17. R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.-J. Lin. "LIBLINEAR: A library for large linear clas- sification," in Journal of Machine Learning Research 9(2008), 1871-1874.