Academia.eduAcademia.edu

Scale Invariant Feature Transform

description3,615 papers
group128 followers
lightbulbAbout this topic
Scale Invariant Feature Transform (SIFT) is an algorithm used in computer vision to detect and describe local features in images. It identifies keypoints that are invariant to scale and rotation, enabling robust matching between different images despite changes in viewpoint or illumination.
lightbulbAbout this topic
Scale Invariant Feature Transform (SIFT) is an algorithm used in computer vision to detect and describe local features in images. It identifies keypoints that are invariant to scale and rotation, enabling robust matching between different images despite changes in viewpoint or illumination.

Key research themes

1. How can local features be designed to achieve robust scale and rotation invariance for object recognition and image matching?

This research area focuses on the development and improvement of local image features that remain invariant under changes in scale, rotation, translation, illumination, and partial affine projection, enabling robust object recognition and image matching in cluttered or complex scenes. Achieving such invariance is critical as it allows recognition systems to reliably identify objects regardless of viewing conditions and occlusions.

Key finding: Introduced the seminal Scale Invariant Feature Transform (SIFT) framework where local features are detected as maxima/minima of Difference-of-Gaussians in scale space, providing invariance to translation, scale, and rotation,... Read more
Key finding: Proposed SURF, a detector and descriptor that approximates or outperforms SIFT in terms of repeatability, distinctiveness, and robustness to scale and rotation, while achieving significantly faster computation through... Read more
Key finding: Provided a detailed theoretical foundation and implementation of SIFT including detection of interest points from scale-space extrema of Difference-of-Gaussians approximating the scale-normalized Laplacian, which is proven to... Read more
Key finding: Analyzed how varying the parameter controlling the number of sublevels per octave in the SIFT algorithm affects the number of detected keypoints and matching performance. Found that increasing this parameter up to an optimal... Read more

2. What advances have been made in incorporating scale invariance within convolutional neural networks for feature learning and recognition?

This theme explores novel methods to embed local scale invariance directly into convolutional neural networks (CNNs), addressing the challenge that vanilla CNNs lack intrinsic scale equivariance, thus requiring data-intensive learning of multiple scale filters. Such embedded scale invariance reduces overfitting, improves data efficiency, and enhances robustness to scale variations in visual classification tasks.

Key finding: Proposed Scale-Invariant CNNs (SI-ConvNets) that apply filters at multiple scales within each convolutional layer combined with max-pooling over scales to produce locally scale-invariant feature representations without... Read more
Key finding: Introduced a novel scale-steerable filter basis, the log-radial harmonics, enabling filters to be steered exactly in scale via linear combinations of basis functions within CNNs. The resulting scale-steered CNNs (SS-CNN)... Read more

3. How can scale invariance be extended and utilized in hyperspectral image analysis and remote sensing applications?

This area investigates the extraction of scale-invariant features from hyperspectral and multispectral images, which comprise multiple spectral bands or channels. Such features are essential for robust image registration, matching, and classification under varying spectral and spatial conditions, including different illumination, sensor characteristics, and object materials. Integrating spatial and spectral invariance improves the robustness and accuracy of remote sensing and agricultural monitoring systems.

Key finding: Examined multi-scale segmentation and feature descriptors for remote sensing image classification, revealing that using multiple scales improves classification accuracy but scale importance varies and descriptors exhibit... Read more
Key finding: Conducted an extensive evaluation of three different SIFT and RANSAC-based multispectral image registration techniques applied to close-range agricultural images with crops and weeds. Found the highest accuracy occurred when... Read more

All papers in Scale Invariant Feature Transform

In digital image manipulation the images are converted into desired image by various tools. Detection of such kind of manipulation with digital images especially the cloning is the main area of this work. Earlier the cloning detection is... more
Human face recognition is one of the research areas in the current era of the research. It is one the widely used biometric technique for identification and verification of the human face. There are many challenges to face recognition... more
Avoiding false-positive recognition of objects is a topical problem for specific areas, such as detecting traffic signs for visually impaired pedestrians, fire emergency signs inside buildings, and construction safety signs. Existing... more
This paper presents a method for extracting distinctive invariant features from ear images that can be used to perform reliable matching between different views of an ear. It shows the extraction of features from an ear image and also the... more
We introduce and compare two algorithms related to ego-motion, applicable to a robot using a panoramic visual sensor in an unknown environment. The first method, computationally cheap, extends a family of bio-inspired navigation systems... more
IntelliVision is a production-ready platform for video analytics and face authentication supporting people counting, vehicle counting and automatic number plate recognition (ANPR), parking analysis, emergency flow detection, lobby/crowd... more
Excessive amounts of image spam cause many problems to e-mail users. Since image spam is difficult to detect using conventional text-based spam approach, various image processing techniques have been proposed. In this paper, we present an... more
We present a two-stage, geometry-aware approach for matching SIFT-like features in a fast and reliable manner. Our approach first uses a small sample of features to estimate the epipolar geometry between the images and leverages it for... more
Vision based localization is widely investigated for the autonomous navigation and robotics. One of the basic steps of vision based localization is the extraction of interest points in images that are captured by the embedded camera. In... more
Remote sensing of the structural and spectral traits of vegetation is being transformed by structure from motion (SFM) algorithms that combine overlapping images to produce three-dimensional (3D) red-green-blue (RGB) point clouds.... more
The new advance in photogrammetry using the automatic procedures such as the famous algorithm which was proposed by David Lowe (Lowe, 2004) features descriptors and matching (SIFT) and then the recent development of external orientation... more
Facial feature tracking is a key component of imaging ballistocardiography (BCG) where accurate quantification of the displacement of facial keypoints is needed for good heart rate estimation. Skin feature tracking enables video-based... more
Detection of objects in cluttered scenes is a basic challenge that has only recently been widely undertaken by computer vision systems. This paper proposes a novel method how to detect a particular object in cluttered scenes, given a... more
We consider the image retrieval problem of finding the images in a dataset that are most similar to a query image. Our goal is to reduce the number of vector operations and memory for performing a search without sacrificing accuracy of... more
Distinguishing shot boundaries and Gradual shot change happens to be critical research area in the field of video retrieval, summarization and segmentation. Identification of Video shot boundaries is generally a significant and first step... more
Detecting shot boundary and gradual shot change happens to the prominent research problem in the field of video retrieval or indexing. Video shot boundary detection (SBD) is commonly an important and first step for indexing, and... more
Many fundamental importance questions of Internet behavior are still remained unexplored. This paper considers the basic engineering problem "on which length scale time-invariant traffic characteristics become visible, or do TCP attractor... more
We propose a contextual framework for 2D image matching and registration using an ensemble feature. Our system is beneficial for registering image pairs that have captured the same scene but have large visual discrepancies between them.... more
COntains research objectives.U. S. Air Force (Electronic Systems Division) under Contract AF 19(628)-3325National Science Foundation (Grant G-16526)National Institutes of Health (Grant MH-04737-03)National Institutes of Health (Grant... more
This paper proposes an efficient approach known as F-SIFT, for unconstrained iris recognition. The acquired iris image is preprocessed to find the annular region underlying inner and outer iris boundary. From the annular iris image,... more
Iris is the Optimum Biometric-trait present in Biometrics Security. Our emphasis on this paper is to obtain efficient, fast and robust algorithm set for iris detection. There are number of algorithms proposed for the efficient result but... more
This paper proposes an efficient three fold stratified SIFT matching for iris recognition. The objective is to filter wrongly paired conventional SIFT matches. In Strata I, the keypoints from gallery and probe iris images are paired using... more
Automatic image annotation is to associate each image a set of keywords and describing the visual content of the image using an automatic system without any human intervention, many approaches have been proposed for the realization of... more
Facial feature tracking is a key component of imaging ballistocardiography (BCG) where accurate quantification of the displacement of facial keypoints is needed for good heart rate estimation. Skin feature tracking enables video-based... more
SIFT-GSI is proposed to establish accurate SIFT feature correspondences for IR face recognition. A smooth spatial mapping function for the underlying correct matches is estimated by using GSI. The proposed method can establish accurate... more
In digital images, edges can be detected by applying mathematical techniques in order to identify the locations in which the brightness of an image unexpectedly changes. Edges are generally curved lines segmented into sharp adjustments in... more
Stereo vision is one of the interesting research topics in the computer vision field. Two cameras are used to generate a disparity map, resulting in the depth estimation. Camera calibration is the most important step in stereo vision. The... more
Background: Soybean, a vital global oilseed crop, faces significant challenges from pest infestations and related damage that compromise productivity. Early and accurate detection of pest damage is critical for sustainable agriculture.... more
Feature analysis is the extraction and comparison of signals from multimedia data, which can subsequently be semantically analyzed. Feature analysis is the foundation of many multimedia computing tasks such as object recognition, image... more
In this paper, a robust hybrid watermarking method based on discrete wavelet transform (DWT), discrete cosine transform (DCT), and scale-invariant feature transformation (SIFT) is proposed. Indeed, it is of prime interest to develop... more
Image Forgery means manipulation of digital image to conceal meaningful information of the image. The detection of forged image is driven by need of authenticity and to maintain integrity of an image. A copy move forgery detection theme... more
The need for automatic understanding and examination of data increased with the tremendous growth of video and imaging databases. The change of identity, feelings and attitudes of a person's face always play a key role in terms of... more
Ahuna Mons is a 4 km particular geologic feature on the surface of Ceres, of possibly cryovolcanic origin. The special characteristics of Ahuna Mons are also interesting in regard of its surrounding area, especially for the big crater... more
Hyperspectral images can offer a lot of clarity by blending both spectral and spatial data. Details are for the researcher. A multidimensional paper in this paper, the hyperspectral image mosaic solution, was suggested to properly... more
This paper introduces a new feature-based image registration algorithm which registers images by finding rotation and scale invariant features and matches them using an evidence accumulation process based on the Generalized Hough... more
Log-phase K562 cells (2 ϫ 10 7 ) were electroporated (260 V, 960 F; Bio-Rad Gene Pulser with capacitance extender) with 50 g of ScaI linearized p␥͞GFP͞3MRE and 5 g of supercoiled pCMV͞CD20 plasmid in 0.4 ml of RPMI 1640 medium. Thirty-six... more
The automatic detection and tracking of human body parts in color images is highly sensitive to appearance features such as illumination, skin color and clothes. As a result, the use of depth images has been shown to be an attractive... more
In Image processing, mosaicing image is the combine two or more images of the same scene into one image. Image stitching "mosaicing" is the process of assembling images of the same scene into a large image. Image... more
Hyperspectral images (HI) and Light Detection and Ranging (LiDAR) provide high resolution radiometric and geometric information for monitoring forests at individual tree crown (ITC) level. It has many important applications for... more
Vehicle detection and classification are the most significant and challenging activities of an intelligent traffic monitoring system. Traditional methods are highly computationally expensive and also impose restrictions when the mode of... more
The proposed system is a portable camera based visual assistance prototype for blind people to identify currency notes and also helps them to read printable texts from the handheld objects. To read printable texts, an efficient algorithm... more
This paper proposes a novel hardware design method of scale-invariant feature transform (SIFT) algorithm for implementation on field-programmable gate array (FPGA). To reduce the computing costs, Gaussian kernels are calculated offline... more
Now a days sharing food related photos on social media has become a trend and people are looking for the interested food dishes and the restaurants. So detecting the food items, classifying them and analyzing have been the topic of... more
We propose in this paper a novel multimodal approach to automatically predict the visual concepts of images through an effective fusion of visual and textual features. It relies on a Selective Weighted Late Fusion (SWLF) scheme which, in... more
Synthesizing the image of a 3-D scene as it would be captured by a camera from an arbitrary viewpoint is a central problem in Computer Graphics. Given a complete 3-D model, it is possible to render the scene from any viewpoint. The... more
In this paper, we have proposed an efficient and computationally inexpensive approach toward two mainstreams of image recognition, i.e, face recognition and person identification. Our proposed model is invariant to pose, expression, scale,
In this paper we propose a method for person identification. The proposed method is invariant to illumination, scale, pose, camera exposure and translation of the head. In order to make the model illumination invariant, a linear transform... more
In this paper, we have proposed an efficient and computationally inexpensive approach toward two mainstreams of image recognition, i.e, face recognition and person identification. Our proposed model is invariant to pose, expression, scale,
The paper proposes an efficient and accurate model for face recognition using an attentive local feature descriptor extracted from Convolutional Neural Network referred to as DEep Local Feature (DELF). The algorithm mentioned formerly is... more
Download research papers for free!