COSTA: Co-Occurrence Statistics for Zero-Shot Classification
2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition
https://doi.org/10.1109/CVPR.2014.313Abstract
In this paper we aim for zero-shot classification, that is visual recognition of an unseen class by using knowledge transfer from known classes. Our main contribution is COSTA, which exploits co-occurrences of visual concepts in images for knowledge transfer. These inter-dependencies arise naturally between concepts, and are easy to obtain from existing annotations or web-search hit counts. We estimate a classifier for a new label, as a weighted combination of related classes, using the co-occurrences to define the weight. We propose various metrics to leverage these cooccurrences, and a regression model for learning a weight for each related class. We also show that our zero-shot classifiers can serve as priors for few-shot learning. Experiments on three multi-labeled datasets reveal that our proposed zero-shot methods, are approaching and occasionally outperforming fully supervised SVMs. We conclude that cooccurrence statistics suffice for zero-shot classification.
References (29)
- Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid. Label- embedding for attribute-based classification. In CVPR, 2013. 2, 5
- E. Bart and S. Ullman. Cross-generalization: Learning novel classes from a single example by feature replacement. In CVPR, 2005. 2
- G. Chen, Y. Ding, J. Xiao, and T. Han. Detection evolution with multi-order contextual co-occurrence. In CVPR, 2013. 3
- M. Choi, J. Lim, A. Torralba, and A. Willsky. Exploiting hierarchical context on a large database of object categories. In CVPR, 2010. 3, 5
- M. Everingham, L. Van Gool, C. Williams, J. Winn, and A. Zisserman. The Pascal visual object classes (VOC) chal- lenge. IJCV, 2010. 2, 5
- R.-E. Fan, K.-W. Chang, C.-J. Hsieh, X.-R. Wang, and C.- J. Lin. Liblinear: A library for large linear classification. JMLR, 2008. 4
- H. Grabner, J. Gall, and L. V. Gool. What makes a chair a chair? In CVPR, 2011. 1
- B. Hariharan, S. Vishwanathan, and M. Varma. Efficient max-margin multi-label classification with applications to zero-shot learning. MLJ, 2012. 3
- H. Jégou and O. Chum. Negative evidences and co- occurrences in image retrieval: the benefit of PCA and whitening. In ECCV, 2012. 3
- H. Jégou, M. Douze, and C. Schmid. On the burstiness of visual elements. In CVPR, 2009. 3
- L. Ladický, C. R. P. Kohli, and P. Torr. Graph cut based inference with co-occurrence statistics. In ECCV, 2010. 3
- C. Lampert, H. Nickisch, and S. Harmeling. Attribute- based classification for zero-shot learning of object cate- gories. IEEE Trans. PAMI, 2013. 1, 2, 3, 6, 7
- F.-F. Li, R. Fergus, and P. Perona. One-shot learning of ob- ject categories. IEEE Trans. PAMI, 2006. 2
- T. Malisiewicz and A. Efros. Beyond categories: the visual memex model for reasoning about object relationships. In NIPS, 2009. 1, 3
- T. Mensink, J. Verbeek, and G. Csurka. Tree-structured CRF models for interactive image labeling. IEEE Trans. PAMI, 2012. 3, 5
- T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka. Distance-based image classification: Generalizing to new classes at near-zero cost. IEEE Trans. PAMI, 2013. 1, 2
- S. Nowak and M. Huiskes. New strategies for image annota- tion: Overview of the photo annotation task at ImageCLEF 2010. In Working Notes of CLEF, 2010. 5
- F. Orabona, C. Castellini, B. Caputo, A. E. Fiorilla, and G. Sandini. Model adaptation with least-squares svm for adaptive hand prosthetics. In IEEE ICRA, 2009. 4
- M. Rastegari, A. Farhadi, and D. Forsyth. Attribute discov- ery via predictable discriminative binary codes. In ECCV, 2012. 2
- M. Rohrbach, M. Stark, and B. Schiele. Evaluating knowl- edge transfer and zero-shot learning in a large-scale setting. In CVPR, 2011. 1, 2
- M. Rohrbach, M. Stark, G. Szarvas, I. Gurevych, and B. Schiele. What helps where -and why? semantic relat- edness for knowledge transfer. In CVPR, 2010. 2, 4, 6
- M. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011. 3
- R. Salakhutdinov, A. Torralba, and J. Tenenbaum. Learning to share visual appearance for multiclass object detection. In CVPR, 2011. 2
- J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Im- age classification with the fisher vector: Theory and practice. IJCV, 2013. 3, 4
- V. Sharmanska, N. Quadrianto, and C. Lampert. Augmented attribute representations. In ECCV, 2012. 2
- T. Tommasi and B. Caputo. The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories. In BMVC, 2009. 2
- T. Tommasi, F. Orabona, and B. Caputo. Safety in numbers: Learning categories from few examples with multi model knowledge transfer. In CVPR, 2010. 4
- C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. The caltech-ucsd birds-200-2011 dataset. Technical report, Computation & Neural Systems, 2011. 1, 2, 3, 5
- J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. SUN database: Large-scale scene recognition from abbey to zoo. In CVPR, 2010. 2, 5