Academia.eduAcademia.edu

Outline

Image Segmentation in Video Sequences

Abstract

\Background subtraction" is an old technique for nding moving objects in a video sequence|for example, cars driving on a freeway. The idea is that subtracting the current image from a time-averaged background image will leave only nonstationary objects. It is, however, a crude approximation to the task of classifying each pixel of the current image; it fails with slow-moving objects and does not distinguish shadows from moving objects. The basic idea of this paper is that we can classify each pixel using a model of how that pixel looks when it is part of di erent classes. We learn a mixtureof-Gaussians classi cation model for each pixel using an unsupervised technique|an e cient, incremental version of EM. Unlike the standard image-averaging approach, this automatically updates the mixture component for each class according to likelihood of membership; hence slow-moving objects are handled perfectly. Our approach also identi es and eliminates shadows much more e ectively than other techniques such as thresholding. Application of this method as part of the Roadwatch tra c surveillance project is expected to result in signi cant improvements in vehicle identi cation and tracking.

References (9)

  1. Dempster et al., 1977] A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39 (Se- ries B):1{38, 1977.
  2. Huang et al., 1994] T. Huang, D. Koller, J. Malik, G. Ogasawara, B. Rao, S. Russell, and J. Weber. Automatic symbolic tra c scene analysis using belief networks. In Proceedings of the Twelfth National Conference on Arti cial Intelligence (AAAI-94), pages 966{972, Seattle, Washington, August 1994. AAAI Press.
  3. Jepson, 1993] M.J. Jepson, A.; Black. Mixture models for optical ow computation. In CVPR, pages 760{1, 1993.
  4. Kilger, 1992] M. Kilger. A shadow handler in a video-based real-time tra c monitoring system. In IEEE Workshop on Applications of Computer Vision, 1992.
  5. Koller et al., 1994] D. Koller, J. Weber, T. Huang, J. Malik, G. Ogasawara, B. Rao, and S. Russell. Towards robust automatic tra c scene analysis in real-time. In Proceedings of the International Conference on Pattern Recognition, Israel, November 1994. McLachlan and Krishnan, 1997] G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. Wiley Interscience, 1997.
  6. Michalopoulos, 1991] P. Michalopoulos. Vehicle detection video through image processing: the Autoscope system. IEEE Trans. on Vehicular Technology, 40:21|29, 1991. Neal and Hinton, 1993] R. M. Neal and G. E. Hinton. A new view of the EM algorithm that justi es incremental and other variants. unpublished manuscript, 1993.
  7. Nowlan, 1991] S. J. Nowlan. Soft Competitive Adaptation: Neural Network Learning Al- gorithms based on Fitting Statistical Mixtures. PhD thesis, School of Computer Science, Carnegie Mellon University, 1991.
  8. Samadani, 1995] R. Samadani. A nite mixtures algorithm for nding proportions in sar images. IEEE Trans. on Image Processing, 4:1182{6, 1995.
  9. Sawhney, 1996] S. Sawhney, H.S.; Ayer. Compact representations of videos through domi- nant and multiple motion estimation. PAMI, 18:814{30, 1996.