On the sustained tracking of human motion
2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition
https://doi.org/10.1109/AFGR.2008.4813456Abstract
In this paper, we propose an algorithm for sustained tracking of humans, where we combine frame-to-frame articulated motion estimation with a per-frame body detection algorithm. The proposed approach can automatically recover from tracking error and drift. The frame-to-frame motion estimation algorithm replaces traditional dynamic models within a filtering framework. Stable and accurate per-frame motion is estimated via an image-gradient based algorithm that solves a linear constrained least squares system. The per-frame detector learns appearance of different body parts and 'sketches' expected gradient maps to detect discriminant pose configurations in images. The resulting online algorithm is computationally efficient and has been widely tested on a large dataset of sequences of drivers in vehicles. It shows stability and sustained accuracy over thousands of frames.
References (23)
- Y. Bar-Shalom. Tracking and data association. Academic Press Professional, 1987.
- C. Bregler and J. Malik. Tracking people with twists and exponential maps. IEEE International Conference on Com- puter Vision and Pattern Recognition, 1998.
- A. Datta, Y. Sheikh, and T. Kanade. Linear motion estima- tion for systems of articulated planes. IEEE International Conference on Computer Vision and Pattern Recognition, 2008.
- A. Fathi and G. Mori. Human pose estimation using motion exemplars. IEEE International Conference on Computer Vi- sion, 2007.
- P. Felzenszwalb and D. Huttenlocher. Efficient matching of pictorial structures. IEEE International Conference on Com- puter Vision and Pattern Recogition, 2000.
- M. Fischler and R. Elschlager. The representation and matching of pictorial images. IEEE Transactions on Com- puters, 1973.
- D. Forsyth, O. Arikan, L. Ikemoto, J. O'Brien, and D. Ra- manan. Computational studies of human motion: Part 1, tracking and motion synthesis. Foundations and Trends in Computer Graphics and Vision, 2006.
- D. Forsyth and J. Ponce. Computer vision -a modern ap- proach. Prentice Hall, 2003.
- D. Gavrila. The visual analysis of human movement: A sur- vey. Computer Vision and Image Understanding, 1999.
- R. Hartley and A. Zisserman. Multiple view geometry in computer vision. Cambridge University Press, 2000.
- M. Isard and A. Blake. Contour tracking by stochastic propa- gation of conditional density. European Conference on Com- puter Vision, 1996.
- S. Ju, M. Black, and Y. Yacoob. Cardboard people: A pa- rameterized model of articulated image motion. Automatic Face and Gesture Recognition, 1996.
- T. Moeslund, A. Hilton, and V. Krger. A survey of advances in vision-based human motion capture and analysis. Com- puter Vision and Image Understanding, 2006.
- G. Mori, X. Ren, A. Efros, and J. Malik. Recovering human body configurations: Combining segmentation and recogni- tion. IEEE International Conference on Computer Vision and Pattern Recognition, 2004.
- V. Pavlovic, james Rehg, and J. MacCormick. Learning switching linear models of human motion. Neural Informa- tion Processing Systems, 2000.
- V. Pavolvić, J. Rehg, T.-J. Cham, and K. Murphy. A dynamic bayesian network approach to figure tracking using learned dynamic models. IEEE International Conference on Com- puter Vision, 1999.
- D. Ramanan, D. Forsyth, and A. Zisserman. Strike a pose: Tracking people by finding stylized poses. IEEE Interna- tional Conference on Computer Vision and Pattern Recogni- tion, 2005.
- L. Ren, A. Patrick, A. Efros, J. Hodgins, and J. Rehg. A data-driven approach to quantifying natural human motion. ACM Transactions on Graphics, 2005.
- R. Rosales and S. Sclaroff. Learning body pose via special- ized maps. Neural Information Processing Systems, 2002.
- Y. Sheikh, M. Sheikh, and M. Shah. Exploring the space of human motions. IEEE International Conference on Com- puter Vision, 2005.
- H. Sidenbladh, M. Black, and L. Sigal. Implicit probabilistic models of human motion for synthesis and tracking. Euro- pean Conference on Computer Vision, 2002.
- L. Sigal and M. Black. Predicting 3d people from 2d pic- tures. Conference on Articulated Motion and Deformable Objects, 2006.
- C. Wren and A. Pentland. Dynamic models of human mo- tion. IEEE Proceedings of Automatic Face and Gesture Recognition, 1998.