Academia.eduAcademia.edu

Outline

Occlusion-robust Detector Trained with Occluded Pedestrians

Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods

https://doi.org/10.5220/0006569200860094

Abstract

Pedestrian detection has achieved a remarkable progress in recent years, but challenges remain especially when occlusion happens. Intuitively, occluded pedestrian samples contain some characteristic occlusion appearance features that can help to improve detection. However, we have observed that most existing approaches intentionally avoid using samples of occluded pedestrians during the training stage. This is because such samples will introduce unreliable information, which affects the learning of model parameters and thus results in dramatic performance decline. In this paper, we propose a new framework for pedestrian detection. The proposed method exploits the use of occluded pedestrian samples to learn more robust features for discriminating pedestrians, and enables better performances on pedestrian detection, especially for the occluded pedestrians (which always happens in many real applications). Compared to some recent detectors on Caltech Pedestrian dataset, with our proposed method, detection miss rate for occluded pedestrians are significantly reduced.

References (26)

  1. Angelova, A., Krizhevsky, A., Vanhoucke, V., Ogale, A. S., and Ferguson, D. (2015). Real-time pedestrian detec- tion with deep network cascades. In BMVC, pages 32-1.
  2. Benenson, R., Mathias, M., Timofte, R., and Van Gool, L. (2012). Pedestrian detection at 100 frames per second. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 2903-2910. IEEE.
  3. Benenson, R., Mathias, M., Tuytelaars, T., and Van Gool, L. (2013). Seeking the strongest rigid detector. In Pro- ceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3666-3673.
  4. Dalal, N. and Triggs, B. (2005). Histograms of oriented gra- dients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Com- puter Society Conference on, volume 1, pages 886- 893. IEEE.
  5. Dollár, P., Appel, R., Belongie, S., and Perona, P. (2014). Fast feature pyramids for object detection. IEEE Transactions on Pattern Analysis and Machine Intel- ligence, 36(8):1532-1545.
  6. Dollár, P., Belongie, S. J., and Perona, P. (2010). The fastest pedestrian detector in the west. In BMVC, volume 2, page 7.
  7. Dollár, P., Tu, Z., Perona, P., and Belongie, S. (2009). Inte- gral channel features.
  8. Dollar, P., Wojek, C., Schiele, B., and Perona, P. (2012). Pedestrian detection: An evaluation of the state of the art. IEEE transactions on pattern analysis and ma- chine intelligence, 34(4):743-761.
  9. Enzweiler, M., Eigenstetter, A., Schiele, B., and Gavrila, D. M. (2010). Multi-cue pedestrian classification with partial occlusion handling. In Computer vision and pattern recognition (CVPR), 2010 IEEE Conference on, pages 990-997. IEEE.
  10. Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D. (2010). Object detection with discrim- inatively trained part-based models. IEEE transac- tions on pattern analysis and machine intelligence, 32(9):1627-1645.
  11. Friedman, J., Hastie, T., Tibshirani, R., et al. (2000). Addi- tive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). The annals of statistics, 28(2):337-407.
  12. Mathias, M., Benenson, R., Timofte, R., and Van Gool, L. (2013). Handling occlusions with franken-classifiers. In Proceedings of the IEEE International Conference on Computer Vision, pages 1505-1512.
  13. Nam, W., Dollár, P., and Han, J. H. (2014). Local decorre- lation for improved detection. Eprint Arxiv.
  14. Ohn-Bar, E. and Trivedi, M. M. (2016). To boost or not to boost? on the limits of boosted trees for object detec- tion. In Pattern Recognition (ICPR), 2016 23rd Inter- national Conference on, pages 3350-3355. IEEE.
  15. Ouyang, W. and Wang, X. (2013a). Joint deep learning for pedestrian detection. In Proceedings of the IEEE International Conference on Computer Vision, pages 2056-2063.
  16. Ouyang, W. and Wang, X. (2013b). Single-pedestrian de- tection aided by multi-pedestrian detection. In Pro- ceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3198-3205.
  17. Ouyang, W., Zeng, X., and Wang, X. (2016). Partial oc- clusion handling in pedestrian detection with a deep model. IEEE Transactions on Circuits and Systems for Video Technology, 26(11):2123-2137.
  18. Paisitkriangkrai, S., Shen, C., and van den Hengel, A. (2016). Pedestrian detection with spatially pooled fea- tures and structured ensemble learning. IEEE trans- actions on pattern analysis and machine intelligence, 38(6):1243-1257.
  19. Tang, S., Andriluka, M., and Schiele, B. (2014). Detection and tracking of occluded people. International Jour- nal of Computer Vision, 110(1):58-69.
  20. Tian, Y., Luo, P., Wang, X., and Tang, X. (2015). Pedes- trian detection aided by deep learning semantic tasks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5079-5087.
  21. Viola, P. and Jones, M. J. (2004). Robust real-time face detection. International journal of computer vision, 57(2):137-154.
  22. Walk, S., Majer, N., Schindler, K., and Schiele, B. (2010). New features and insights for pedestrian detection. In Computer vision and pattern recognition (CVPR), 2010 IEEE conference on, pages 1030-1037. IEEE.
  23. Wang, X., Han, T. X., and Yan, S. (2009). An hog-lbp human detector with partial occlusion handling. In Computer Vision, 2009 IEEE 12th International Con- ference on, pages 32-39. IEEE.
  24. Wojek, C., Walk, S., Roth, S., and Schiele, B. (2011). Monocular 3d scene understanding with explicit oc- clusion reasoning. In Computer Vision and Pat- tern Recognition (CVPR), 2011 IEEE Conference on, pages 1993-2000. IEEE.
  25. Yang, B., Yan, J., Lei, Z., and Li, S. Z. (2015). Convolu- tional channel features. In Proceedings of the IEEE in- ternational conference on computer vision, pages 82- 90.
  26. Zhang, S., Benenson, R., and Schiele, B. (2015). Filtered channel features for pedestrian detection. In Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on, pages 1751-1760. IEEE.