Academia.eduAcademia.edu

Outline

COSTA: Co-Occurrence Statistics for Zero-Shot Classification

2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition

https://doi.org/10.1109/CVPR.2014.313

Abstract

In this paper we aim for zero-shot classification, that is visual recognition of an unseen class by using knowledge transfer from known classes. Our main contribution is COSTA, which exploits co-occurrences of visual concepts in images for knowledge transfer. These inter-dependencies arise naturally between concepts, and are easy to obtain from existing annotations or web-search hit counts. We estimate a classifier for a new label, as a weighted combination of related classes, using the co-occurrences to define the weight. We propose various metrics to leverage these cooccurrences, and a regression model for learning a weight for each related class. We also show that our zero-shot classifiers can serve as priors for few-shot learning. Experiments on three multi-labeled datasets reveal that our proposed zero-shot methods, are approaching and occasionally outperforming fully supervised SVMs. We conclude that cooccurrence statistics suffice for zero-shot classification.

References (29)

  1. Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid. Label- embedding for attribute-based classification. In CVPR, 2013. 2, 5
  2. E. Bart and S. Ullman. Cross-generalization: Learning novel classes from a single example by feature replacement. In CVPR, 2005. 2
  3. G. Chen, Y. Ding, J. Xiao, and T. Han. Detection evolution with multi-order contextual co-occurrence. In CVPR, 2013. 3
  4. M. Choi, J. Lim, A. Torralba, and A. Willsky. Exploiting hierarchical context on a large database of object categories. In CVPR, 2010. 3, 5
  5. M. Everingham, L. Van Gool, C. Williams, J. Winn, and A. Zisserman. The Pascal visual object classes (VOC) chal- lenge. IJCV, 2010. 2, 5
  6. R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.- J. Lin. Liblinear: A library for large linear classification. JMLR, 2008. 4
  7. H. Grabner, J. Gall, and L. V. Gool. What makes a chair a chair? In CVPR, 2011. 1
  8. B. Hariharan, S. Vishwanathan, and M. Varma. Efficient max-margin multi-label classification with applications to zero-shot learning. MLJ, 2012. 3
  9. H. Jégou and O. Chum. Negative evidences and co- occurrences in image retrieval: the benefit of PCA and whitening. In ECCV, 2012. 3
  10. H. Jégou, M. Douze, and C. Schmid. On the burstiness of visual elements. In CVPR, 2009. 3
  11. L. Ladický, C. R. P. Kohli, and P. Torr. Graph cut based inference with co-occurrence statistics. In ECCV, 2010. 3
  12. C. Lampert, H. Nickisch, and S. Harmeling. Attribute- based classification for zero-shot learning of object cate- gories. IEEE Trans. PAMI, 2013. 1, 2, 3, 6, 7
  13. F.-F. Li, R. Fergus, and P. Perona. One-shot learning of ob- ject categories. IEEE Trans. PAMI, 2006. 2
  14. T. Malisiewicz and A. Efros. Beyond categories: the visual memex model for reasoning about object relationships. In NIPS, 2009. 1, 3
  15. T. Mensink, J. Verbeek, and G. Csurka. Tree-structured CRF models for interactive image labeling. IEEE Trans. PAMI, 2012. 3, 5
  16. T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka. Distance-based image classification: Generalizing to new classes at near-zero cost. IEEE Trans. PAMI, 2013. 1, 2
  17. S. Nowak and M. Huiskes. New strategies for image annota- tion: Overview of the photo annotation task at ImageCLEF 2010. In Working Notes of CLEF, 2010. 5
  18. F. Orabona, C. Castellini, B. Caputo, A. E. Fiorilla, and G. Sandini. Model adaptation with least-squares svm for adaptive hand prosthetics. In IEEE ICRA, 2009. 4
  19. M. Rastegari, A. Farhadi, and D. Forsyth. Attribute discov- ery via predictable discriminative binary codes. In ECCV, 2012. 2
  20. M. Rohrbach, M. Stark, and B. Schiele. Evaluating knowl- edge transfer and zero-shot learning in a large-scale setting. In CVPR, 2011. 1, 2
  21. M. Rohrbach, M. Stark, G. Szarvas, I. Gurevych, and B. Schiele. What helps where -and why? semantic relat- edness for knowledge transfer. In CVPR, 2010. 2, 4, 6
  22. M. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011. 3
  23. R. Salakhutdinov, A. Torralba, and J. Tenenbaum. Learning to share visual appearance for multiclass object detection. In CVPR, 2011. 2
  24. J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Im- age classification with the fisher vector: Theory and practice. IJCV, 2013. 3, 4
  25. V. Sharmanska, N. Quadrianto, and C. Lampert. Augmented attribute representations. In ECCV, 2012. 2
  26. T. Tommasi and B. Caputo. The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories. In BMVC, 2009. 2
  27. T. Tommasi, F. Orabona, and B. Caputo. Safety in numbers: Learning categories from few examples with multi model knowledge transfer. In CVPR, 2010. 4
  28. C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. The caltech-ucsd birds-200-2011 dataset. Technical report, Computation & Neural Systems, 2011. 1, 2, 3, 5
  29. J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. SUN database: Large-scale scene recognition from abbey to zoo. In CVPR, 2010. 2, 5