Academia.eduAcademia.edu

Outline

Robust Real-Time Face Detection

Abstract

This paper describes a face detection framework that is capable of processing images extremely rapidly while achieving high detection rates. There are three key contributions. The first is the introduction of a new image representation called the " Integral Image " which allows the features used by our detector to be computed very quickly. The second is a simple and efficient classifier which is built using the AdaBoost learning algorithm (Freund and Schapire, 1995) to select a small number of critical visual features from a very large set of potential features. The third contribution is a method for combining classifiers in a " cascade " which allows background regions of the image to be quickly discarded while spending more computation on promising face-like regions. A set of experiments in the domain of face detection is presented. The system yields face detection performance comparable to the best previous systems (Sung and a conventional desktop, face detection proceeds at 15 frames per second.

References (22)

  1. Amit, Y. and Geman, D. 1999. A computational model for visual selection. Neural Computation, 11:1691-1715.
  2. Crow, F. 1984. Summed-area tables for texture mapping. In Proceed- ings of SIGGRAPH, 18(3):207-212.
  3. Fleuret, F. and Geman, D. 2001. Coarse-to-fine face detection. Int. J. Computer Vision, 41:85-107.
  4. Freeman, W.T. and Adelson, E.H. 1991. The design and use of steer- able filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(9):891-906.
  5. Freund, Y. and Schapire, R.E. 1995. A decision-theoretic generaliza- tion of on-line learning and an application to boosting. In Com- putational Learning Theory: Eurocolt 95, Springer-Verlag, pp. 23-37.
  6. Greenspan, H., Belongie, S., Gooodman, R., Perona, P., Rakshit, S., and Anderson, C. 1994. Overcomplete steerable pyramid filters and rotation invariance. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  7. Itti, L., Koch, C., and Niebur, E. 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Patt. Anal. Mach. Intell., 20(11):1254--1259.
  8. John, G., Kohavi, R., and Pfeger, K. 1994. Irrelevant features and the subset selection problem. In Machine Learning Conference Proceedings.
  9. Osuna, E., Freund, R., and Girosi, F. 1997a. Training support vector machines: An application to face detection. In Proceed- ings of the IEEE Conference on Computer Vision and Pattern Recognition.
  10. Osuna, E., Freund, R., and Girosi, F. 1997b. Training support vector machines: an application to face detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  11. Papageorgiou, C., Oren, M., and Poggio, T. 1998. A general frame- work for object detection. In International Conference on Com- puter Vision.
  12. Quinlan, J. 1986. Induction of decision trees. Machine Learning, 1:81-106.
  13. Roth, D., Yang, M., and Ahuja, N. 2000. A snowbased face detector. In Neural Information Processing 12.
  14. Rowley, H., Baluja, S., and Kanade, T. 1998. Neural network-based face detection. IEEE Patt. Anal. Mach. Intell., 20:22-38.
  15. Schapire, R.E., Freund, Y., Bartlett, P., and Lee, W.S. 1997. Boost- ing the margin: A new explanation for the effectiveness of voting methods. In Proceedings of the Fourteenth International Confer- ence on Machine Learning.
  16. Schapire, R.E., Freund, Y., Bartlett, P., and Lee, W.S. 1998. Boost- ing the margin: A new explanation for the effectiveness of voting methods. Ann. Stat., 26(5):1651-1686.
  17. Schneiderman, H. and Kanade, T. 2000. A statistical method for 3D object detection applied to faces and cars. In International Conference on Computer Vision.
  18. Simard, P.Y., Bottou, L., Haffner, P., and LeCun, Y. (1999). Boxlets: A fast convolution algorithm for signal processing and neural net- works. In M. Kearns, S. Solla, and D. Cohn (Eds.), Advances in Neural Information Processing Systems, vol. 11, pp. 571- 577.
  19. Sung, K. and Poggio, T. 1998. Example-based learning for view- based face detection. IEEE Patt. Anal. Mach. Intell., 20:39-51.
  20. Tieu, K. and Viola, P. 2000. Boosting image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recog- nition.
  21. Tsotsos, J., Culhane, S., Wai, W., Lai, Y., Davis, N., and Nuflo, F. 1995. Modeling visual-attention via selective tuning. Artificial Intelligence Journal, 78(1/2):507-545.
  22. Webb, A. 1999. Statistical Pattern Recognition. Oxford University Press: New York.