2d Image Features Detector and Descriptor Selection Expert System

Ibon Merino

doi:10.5121/CSIT.2019.91206

Outline

2d Image Features Detector and Descriptor Selection Expert System

Ibon Merino

2019, 8th International Conference on Natural Language Processing (NLP 2019)

https://doi.org/10.5121/CSIT.2019.91206

visibility

…

description

11 pages

link

1 file

Abstract

Detection and description of keypoints from an image is a well-studied problem in Computer Vision. Some methods like SIFT, SURF or ORB are computationally really efficient. This paper proposes a solution for a particular case study on object recognition of industrial parts based on hierarchical classification. Reducing the number of instances leads to better performance, indeed, that is what the use of the hierarchical classification is looking for. We demonstrate that this method performs better than using just one method like ORB, SIFT or FREAK, despite being fairly slower.

FAQs

What key benefits does the proposed expert system provide for industrial object recognition?add

The paper demonstrates that the expert system significantly enhances recognition accuracy, achieving F1 scores of 0.94 compared to 0.845 using traditional methods. This improvement is attributed to hierarchical classification and tailored recognition pipelines for similar object groups.

How does the proposed hierarchical recognition method improve over classical feature-based methods?add

The hierarchical method shows superior performance with an average F1 score increase, achieving 0.94 on the proprietary dataset versus 0.843 on the Caltech-101 dataset. This approach effectively clusters objects by recognition behavior, optimizing pipeline selection.

What evaluation technique was employed to assess the performance of recognition pipelines?add

The study utilized Leave-One-Out Cross-Validation (LOOCV), conducting |X| iterations to generate a confusion matrix for performance assessment. This method allowed for a detailed score calculation using F1 metrics for each instance.

Which industrial recognition methods were compared against the proposed expert system?add

The study compared classical methods such as SIFT, SURF, and ORB within diverse recognition pipelines. Notably, the expert system outperformed ORB, which achieved an F1 score of 0.845, demonstrating superior efficiency in similar industrial contexts.

What impact does varying the number of views per instance have on recognition accuracy?add

The findings indicate that increasing the number of views leads to enhanced performance, particularly benefits observed when comparing from 10 to 50 views per instance. Higher view counts systematically contribute to better F1 scores across tested subsets.

Figures (9)

Computer Science & Information Technology (CS & IT) need to be recognized (test). Different kind of methods could be used to match features, but, mainly, distance based techniques are used. This techniques make use of different distances (L2, hamming.,...) to find the closest feature to the one that needs to be labeled. Those two features (the test feature and the closest to this one) are considered a match. In order to discard ambiguous features, we use the Lowe’s ratio test [14] to define whether two features are a ’good match’. Assuming f; is the feature to be recognized, and f;; and fj2 its two closest features from the model, then (ft, fj1) is a good match if:

Table 1: Example of a confusion matrix for 3 instances. precision and the recall. The mean of all the F;’s, F, (5) is used for calculating ~)**.

The first step of the hierarchical recognition is to recognize the typology with the ~)**. Given the typology t¢ as the typology predicted, the w/ is used to recognize the instance y of the object. We call the hierarchical recognition Y. The Figure 2 shows an scheme of the hierarchical recognition for clarification.

Table 4: Time in seconds that needs each pipeline in recognize a piece.

Table 5: F\’s of the ~**’s for each test (Caltech-101). p stands for number of parts and ¢ for number of pictures per part.

Figure 5: 6 random examples of images from the Caltech-101 dataset. The classes are: Face, Leopard, Motorbike, Airplane, Accordion and Anchor.

References (28)

REFERENCES
A. E. Abdel-Hakim and A. A. Farag. Csift: A sift descriptor with color invariant char- acteristics. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), volume 2, pages 1978-1983. Ieee, 2006.
A. Alahi, R. Ortiz, and P. Vandergheynst. FREAK: Fast Retina Keypoint. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 510-517. IEEE, June 2012.
H. Bay, T. Tuytelaars, and L. Van Gool. SURF: Speeded Up Robust Features. In Computer Vision ECCV 2006, pages 404-417. 2006.
M. Calonder, V. Lepetit, C. Strecha, and P. Fua. Brief: Binary robust independent elementary features. In European conference on computer vision, pages 778-792, 2010.
L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 106(1):59 -70, 2007.
C. Goutte and E. Gaussier. A probabilistic interpretation of precision, recall and f-score, with implication for evaluation. In European Conference on Information Retrieval, pages 345-359, 2005.
M. Heikkil, M. Pietikinen, and C. Schmid. Description of interest regions with local binary patterns. Pattern Recognition, 42(3):425-436, March 2009.
P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing.
Y. Ke and R. Sukthankar. PCA-SIFT: a more distinctive representation for local image descriptors. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., volume 2, pages 506-513. IEEE, 2004.
R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai, volume 14, pages 1137-1145. Montreal, Canada, 1995.
S. Leutenegger, M. Chli, and R. Y. Siegwart. BRISK: Binary Robust invariant scalable key- points. In 2011 International Conference on Computer Vision, pages 2548-2555, November 2011.
W. Liao. Region Description Using Extended Local Ternary Patterns. In 2010 20th Interna- tional Conference on Pattern Recognition, pages 1003-1006, August 2010.
D. G. Lowe. Object recognition from local scale-invariant features. In Proceedings of the Seventh IEEE International Conference on Computer Vision, volume 2, pages 1150-1157 vol.2, September 1999.
D. G. Lowe. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60(2):91-110, November 2004.
J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, pages 281-297. University of California Press, 1967.
E. Mair, G. D. Hager, D. Burschka, M. Suppa, and G. Hirzinger. Adaptive and generic corner detection based on the accelerated segment test. In European conference on Computer vision, pages 183-196. Springer, 2010.
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. IEEE Trans- actions on Pattern Analysis and Machine Intelligence, 27(10):1615-1630, October 2005.
M. Muja and D. G. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. VISAPP, 2(331-340):2, 2009.
S. Murala, R. P. Maheshwari, and R. Balasubramanian. Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval. IEEE Transactions on Image Processing, 21 (5):2874-2886, May 2012.
L. Nanni, S. Brahnam, and A. Lumini. A local approach based on a local binary patterns variant texture descriptor for classifying pain states. Expert Systems with Applications, 37 (12):7888-7894, 2010.
T. Ojala, M. Pietikinen, and D. Harwood. A comparative study of texture measures with classification based on featured distributions. Pattern Recognition, 29(1):51-59, January 1996.
M. Pietikinen, A. Hadid, G. Zhao, and T. Ahonen. Local Binary Patterns for Still Images. In Computer Vision Using Local Binary Patterns, Computational Imaging and Vision, pages 13-47. Springer London, 2011.
John T. Robinson. The k-d-b-tree: A search structure for large multidimensional dynamic in- dexes. In Proceedings of the 1981 ACM SIGMOD International Conference on Management of Data, SIGMOD '81, pages 10-18, 1981.
E. Rosten and T. Drummond. Fusing points and lines for high performance tracking. In Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pages 1508-1515 Vol. 2. IEEE, 2005.
E. Rublee, V. Rabaud, K. Konolige, and G. Bradski. ORB: An efficient alternative to SIFT or SURF. In 2011 International Conference on Computer Vision, pages 2564-2571, November 2011.
E. Tola, V. Lepetit, and P. Fua. DAISY: An Efficient Dense Descriptor Applied to Wide- Baseline Stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(5): 815-830, May 2010.
L. Wang and D.C. He. Texture classification using texture spectrum. Pattern Recognition, 23(8):905-910, 1990.

2d Image Features Detector and Descriptor Selection Expert System

Sign up for access to the world's latest research

Abstract

FAQs

Related papers

References (28)

Related papers

Related topics