Human activity recognition for domestic robots
https://doi.org/10.1007/978-3-319-07488-7_27Abstract
Capabilities of domestic service robots could be further improved, if the robot is equipped with an ability to recognize activities performed by humans in its sensory range. For example in a simple scenario a floor cleaning robot can vacuum the kitchen floor after recognizing human activity “cooking in the kitchen”. Most of the complex human activities can be sub divided into simple activities which can later used for recognize complex activities. Activities like “take meditation” can be sub divided into simple activities like “opening pill container” and “drinking water”. However, even recognizing simple activities are highly challenging due to the similarities between some inter activities and dissimilarities of intra activities which are performed by different people, body poses and orientations. Even a sim- ple human activity like “drinking water” can be performed while the subject is in different body poses like sitting, standing or walking. Therefore building machine learning techniques to recognize human activities with such complexities is non trivial. To address this issue, we propose a human activity recognition technique that uses 3D skeleton features produced by a depth camera. The algorithm incor- porates importance weights for skeleton 3D joints according to the activity being performed. This allows the algorithm to ignore the confusing or irrelevant features while relying on informative features. Later these joints were ensembled together to train Dynamic Bayesian Networks (DBN), which is then used to infer human activi- ties based on likelihoods. The proposed activity recognition technique is tested on a publicly available dataset and UTS experiments with overall accuracies of 85% and 90%
References (16)
- Brand, M., Oliver, N., Pentland, A.: Coupled hidden Markov models for complex action recog- nition. Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition 0, 994-999 (1997). DOI 10.1109/CVPR.1997.609450
- Demiris, Y., Meltzoff, A.: The robot in the crib: A developmental analysis of imitation skills in infants and robots. Infant Child Dev 17(1), 43-53 (2008)
- Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the em algorithm. JOURNAL OF THE ROYAL STATISTICAL SOCIETY, SERIES B 39(1), 1-38 (1977)
- Hartigan, J.A., Wong, M.A.: A K-means clustering algorithm. Applied Statistics 28, 100-108 (1979)
- J. Stork L. Spinello, J.S.K.O.A.: Audio-based human activity recognition using non- markovian ensemble voting. In: Proc. of IEEE International Symposium on Robot and Human Interactive Communication (RoMan) (2012)
- Lopes, M., Melo, F.S., Montesano, L.: Affordance-based imitation learning in robots. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1015-1021. USA (2007)
- Martinez-Contreras, F., Orrite-Urunuela, C., Herrero-Jaraba, E., Ragheb, H., Velastin, S.A.: Recognizing Human Actions Using Silhouette-based HMM. 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance pp. 43-48 (2009)
- Murphy, K.: Dynamic bayesian networks: Representation, inference and learning. Ph.D. the- sis, UC Berkeley, Computer Science Division (2002)
- Murphy, K.P.: The bayes net toolbox for matlab. Computing Science and Statistics 33, 2001 (2001)
- Nefian, A.V., Liang, L., Pi, X., Liu, X., Murphy, K.: Dynamic Bayesian Networks for Audio- Visual Speech Recognition. EURASIP Journal on Advances in Signal Processing 2002(11), 1274-1288 (2002). DOI 10.1155/S1110865702206083
- Nefian, A.V., Liang, L., Pi, X., Xiaoxiang, L., Mao, C., Murphy, K.: A coupled hmm for audio- visual speech recognition. In: in International Conference on Acoustics, Speech and Signal Processing (CASSP?02, pp. 2013-2016 (2002)
- Piyathilaka, L., Kodagoda, S.: Gaussian mixture based hmm for human activity recognition uisng 3d skeleton features. 8th IEEE Conference on Industrial Electronics and Applications (2013)
- Rabiner, L.: A tutorial on hidden markov models and selected applications in speech recogni- tion. Proceedings of the IEEE 77(2), 257 -286 (1989). DOI 10.1109/5.18626
- Sung, J., Ponce, C., Selman, B., Saxena, A.: Human activity detection from rgbd images. In: Plan, Activity, and Intent Recognition, vol. WS-11-16. AAAI (2011)
- Theodoridis, T., Agapitos, A., Hu, H., Lucas, S.M.: Ubiquitous robotics in physical human action recognition: A comparison between dynamic anns and gp. In: ICRA, pp. 3064-3069. IEEE (2008)
- Wu, Y., Yuan, J., Liu, Z., Wang, J.: Mining actionlet ensemble for action recognition with depth cameras. 2012 IEEE Conference on Computer Vision and Pattern Recognition 0, 1290- 1297 (2012). DOI http://doi.ieeecomputersociety.org/10.1109/CVPR.2012.6247813