Abstract
Abstract. We introduce a complete pipeline for recognizing and classifying people's clothing in natural scenes. This has several interesting applications, including e-commerce, event and activity recognition, online advertising, etc. The stages of the pipeline combine a number of state-of-the-art building blocks such as upper body detectors, various feature channels and visual attributes. The core of our method consists of a multi-class learner based on a Random Forest that uses strong discriminative learners as decision nodes.
References (29)
- Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded Up Robust Features. ICCV (2006)
- Breiman, L.: Random forests. Machine Learning. (2001) 5-32
- Caruana, R., Karampatziakis, N., Yessenalina, A.: An empirical evaluation of supervised learning methods in high dimensions. ICML (2008)
- Chen, H., Xu, Z.J., Liu, Z.Q., Zhu, S.C.: Composite Templates for Cloth Modeling and Sketching. CVPR (2006)
- Criminisi, A., Shotton, J., Konukoglu, E.: Decision forests for classification, regres- sion, density estimation, manifold learning and semi-supervised learning. Technical Report MSR-TR-2011-114, Microsoft Research (2011)
- Dalal, N., Triggs, B.: Histograms of Oriented Gradients for Human Detection. CVPR (2005)
- Daumé, H.: Frustratingly easy domain adaptation. Annual meeting-association for computational linguistics. Volume 45. (2007) 256
- Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. CVPR (2009)
- Eichner, M., Ferrari, V.: CALVIN Upper-body detector for detection in still images 10. Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: A library for large linear classification. JMLR 9 (2008)
- Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their at- tributes. CVPR (2009)
- Ferrari, V., Zisserman, A.: Learning visual attributes. NIPS (2008)
- Gallagher, A.C.: Clothing cosegmentation for recognizing people. CVPR (2008)
- Hu, Z., Yan, H., Lin, X.: Clothing segmentation using foreground and background estimation based on the constrained Delaunay triangulation. Pattern Recognition 41 (2008)
- Joachims, T.: Transductive inference for text classification using support vector machines. ICML (1999)
- Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile clas- sifiers for face verification. ICCV (2009)
- Lampert, C., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. CVPR (2009)
- Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. CVPR (2006)
- Leistner, C., Saffari, A., Santner, J., Bischof, H.: Semi-Supervised Random Forests. ICCV (2009)
- Liu, S., Song, Z., Liu, G., Xu, C., Lu, H., Yan, S.: Street-to-Shop: Cross-Scenario Clothing Retrieval via Parts Alignment and Auxiliary Set. CVPR (2012)
- Ojala, T., Pietikainen, M., Harwood, D.: Performance evaluation of texture mea- sures with classification based on Kullback discrimination of distributions. ICOR (1994)
- Pan, S.J., Yang, Q.: A survey on transfer learning. TKDE (2010)
- Shechtman, E., Irani, M.: Matching Local Self-Similarities across Images and Videos. CVPR (2007)
- Song, Z., Wang, M., Hua, X.s., Yan, S.: Predicting occupation via human clothing and contexts. ICCV (2011)
- Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. Workshop on Internet Vision. (2008)
- Stark, M., Goesele, M., Schiele, B.: A shape-based object class model for knowledge transfer. ICCV (2009)
- Wang, N., Ai, H.: Who Blocks Who: Simultaneous clothing segmentation for group- ing images. ICCV (2011)
- Wang, X., Zhang, T.: Clothes search in consumer photos via color matching and attribute learning. MM, ACM Press (2011)
- Yamaguchi, K., Kiapour, H., Ortiz, L., Berg, T.L.: Parsing Clothing in Fashion Photographs. CVPR (2012)
- Yao, B., Khosla, A., Fei-Fei, L.: Combining randomization and discrimination for fine-grained image categorization. CVPR (2011)