Evaluation of Sift and Surf for Vision Based Localization
2016, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
https://doi.org/10.5194/ISPRSARCHIVES-XLI-B3-685-2016Abstract
Vision based localization is widely investigated for the autonomous navigation and robotics. One of the basic steps of vision based localization is the extraction of interest points in images that are captured by the embedded camera. In this paper, SIFT and SURF extractors were chosen to evaluate their performance in localization. Four street view image sequences captured by a mobile mapping system, were used for the evaluation and both SIFT and SURF were tested on different image scales. Besides, the impact of the interest point distribution was also studied. We evaluated the performances from for aspects: repeatability, precision, accuracy and runtime. The local bundle adjustment method was applied to refine the pose parameters and the 3D coordinates of tie points. According to the results of our experiments, SIFT was more reliable than SURF. Apart from this, both the accuracy and the efficiency of localization can be improved if the distribution of feature points are well constrained for SIFT.
References (35)
- Agarwal, S., Mierle, K. and Others, n.d. Ceres solver. http: //ceres-solver.org.
- Ballesta, M., Gil, A., Reinoso, O. and Mozos, O. M., 2007. Eval- uation of interest point detectors for visual slam. International Journal of Factory Automation, Robotics and Soft Computing 4, pp. 86-95.
- Bay, H., Ess, A., Tuytelaars, T. and Van Gool, L., 2008. Speeded- up robust features (surf). Computer vision and image understand- ing 110(3), pp. 346-359.
- Bellavia, F., Fanfani, M. and Colombo, C., 2015. Selective vi- sual odometry for accurate auv localization. Autonomous Robots pp. 1-11.
- Cvii, I. and Petrovi, I., 2015. Stereo odometry based on careful feature selection and tracking. In: Mobile Robots (ECMR), 2015
- European Conference on, pp. 1-6.
- Davison, A., 2003. Real-time simultaneous localisation and map- ping with a single camera. In: Computer Vision, 2003. Proceed- ings. Ninth IEEE International Conference on, pp. 1403-1410 vol.2.
- Eudes, A. and Lhuillier, M., 2009. Error propagations for local bundle adjustment. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, IEEE, pp. 2411-2418.
- Gauglitz, S., Höllerer, T. and Turk, M., 2011. Evaluation of in- terest point detectors and feature descriptors for visual tracking. International journal of computer vision 94(3), pp. 335-360.
- Grabner, M., Grabner, H. and Bischof, H., 2006. Fast approxi- mated sift. In: Computer Vision-ACCV 2006, Springer, pp. 918- 927.
- Jiang, Y., Xu, Y. and Liu, Y., 2013. Performance evaluation of feature detection and matching in stereo visual odometry. Neuro- computing 120, pp. 380-390.
- Juan, L. and Gwun, O., 2009. A comparison of sift, pca-sift and surf. International Journal of Image Processing (IJIP) 3(4), pp. 143-152.
- Khan, N. Y., McCane, B. and Wyvill, G., 2011. Sift and surf performance evaluation against various image deformations on benchmark dataset. In: Digital Image Computing Techniques and Applications (DICTA), 2011 International Conference on, IEEE, pp. 501-506.
- Kitt, B., Geiger, A. and Lategahn, H., 2010. Visual odometry based on stereo image sequences with ransac-based outlier re- jection scheme. In: Intelligent Vehicles Symposium (IV), 2010 IEEE, pp. 486-492.
- Kneip, L., Scaramuzza, D. and Siegwart, R., 2011. A novel parametrization of the perspective-three-point problem for a di- rect computation of absolute camera position and orientation. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, IEEE, pp. 2969-2976.
- Lowe, D. G., 2004. Distinctive image features from scale- invariant keypoints. International journal of computer vision 60(2), pp. 91-110.
- Luxen, M., 2003. Variance component estimation in performance characteristics applied to feature extraction procedures. In: Pat- tern Recognition, Springer, pp. 498-506.
- Mikolajczyk, K. and Schmid, C., 2005. A performance evaluation of local descriptors. Pattern Analysis and Machine Intelligence, IEEE Transactions on 27(10), pp. 1615-1630.
- Moulon, P., Monasse, P. and Marlet, R., 2013. Adaptive structure from motion with a contrario model estimation. In: Computer Vision-ACCV 2012, Springer, pp. 257-270.
- Mouragnon, E., Lhuillier, M., Dhome, M., Dekeyser, F. and Sayd, P., 2006. Real time localization and 3d reconstruction. In: Com- puter Vision and Pattern Recognition, 2006 IEEE Computer So- ciety Conference on, Vol. 1, IEEE, pp. 363-370.
- Muja, M. and Lowe, D. G., 2009. Fast approximate nearest neigh- bors with automatic algorithm configuration. In: International Conference on Computer Vision Theory and Application VISS- APP'09), INSTICC Press, pp. 331-340.
- Murillo, A. C., Guerrero, J. J. and Sagues, C., 2007. Surf features for efficient robot localization with omnidirectional images. In: Robotics and Automation, 2007 IEEE International Conference on, pp. 3901-3907.
- Nistér, D., 2004. An efficient solution to the five-point relative pose problem. Pattern Analysis and Machine Intelligence, IEEE Transactions on 26(6), pp. 756-770.
- Nistér, D., Naroditsky, O. and Bergen, J., 2004. Visual odometry. In: Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on, Vol. 1, IEEE, pp. I-652.
- Paparoditis, N., Papelard, J.-P., Cannelle, B., Devaux, A., Soheil- ian, B., David, N. and HOUZAY, E., 2012. Stereopolis ii: A multi-purpose and multi-sensor 3d mobile mapping system for street visualisation and 3d metrology. Revue franc ¸aise de pho- togrammétrie et de télédétection (200), pp. 69-79.
- Qu, X., Soheilian, B. and Paparoditis, N., 2015. Vehicle local- ization using mono-camera and geo-referenced traffic signs. In: 2015 IEEE Intelligent Vehicles Symposium (IV), pp. 605-610.
- Saleem, S., Bais, A. and Sablatnig, R., 2012. A performance evaluation of sift and surf for multispectral image matching. In: Image Analysis and Recognition, Springer, pp. 166-173.
- Schmid, C., Mohr, R. and Bauckhage, C., 2000. Evaluation of interest point detectors. International Journal of computer vision 37(2), pp. 151-172.
- Schmidt, A., Kraft, M. and Kasiński, A., 2010. An evaluation of image feature detectors and descriptors for robot navigation. In: Computer Vision and Graphics, Springer, pp. 251-259.
- Se, S., Lowe, D. and Little, J., 2001. Vision-based mobile robot localization and mapping using scale-invariant features. In: Robotics and Automation, 2001. Proceedings 2001 ICRA. IEEE International Conference on, Vol. 2, pp. 2051-2058 vol.2.
- Strasdat, H., Montiel, J. M. and Davison, A. J., 2012. Visual slam: why filter? Image and Vision Computing 30(2), pp. 65-77.
- Triggs, B., McLauchlan, P. F., Hartley, R. I. and Fitzgibbon, A. W., 2000. Bundle adjustmenta modern synthesis. In: Vision algorithms: theory and practice, Springer, pp. 298-372.
- Valgren, C. and Lilienthal, A. J., 2010. Sift, surf & sea- sons: Appearance-based long-term localization in outdoor envi- ronments. Robot. Auton. Syst. 58(2), pp. 149-156.
- Yang, Y., Song, Y., Zhai, F., Fan, Z., Meng, Y. and Wang, J., 2009. A high-precision localization algorithm by improved sift key-points. In: Image and Signal Processing, 2009. CISP '09. 2nd International Congress on, pp. 1-6.
- Zhang, Z., Deriche, R., Faugeras, O. and Luong, Q.-T., 1995. A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry. Artificial intelli- gence 78(1), pp. 87-119.