Sparse Representation For Image Prediction

C. Guillemot

doi:10.5281/ZENODO.40459

Outline

Sparse Representation For Image Prediction

C. Guillemot

2007

https://doi.org/10.5281/ZENODO.40459

visibility

…

description

5 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Publication in the conference proceedings of EUSIPCO, Poznan, Poland, 2007

Luwen Huangfu

Signal Processing, 2015

Motivated by the fact that the signals tend to have a representation biased towards their own classes, we propose a novel Sparse Representation-based Classifier (SRC) named Class Specific Sparse Representation-based Classifier (CSSRC), which incorporates the class information in the representation learning. Unlike the conventional SRC algorithms, CSSRC defines each class as a group and then impels these groups to compete for representing the test sample. To achieve such property, CSSRC imposes a L 1-norm constraint to the classes for compulsively selecting the most relevant classes and introduces a L 2-norm constraint to the samples belonging to the same class for making sure that all homogeneous samples can be sufficiently exploited for representation. Since CSSRC is a typical structure sparse representation issue, it can be efficiently solved by the convex optimization. Seven popular visual and audio signal databases are employed for evaluation. The results demonstrate its effectiveness in comparison with the state-of-the-art classifiers.

downloadDownload free PDF View PDFchevron_right

Sparse Representations for Image Decomposition

Robert Hummel

Ijcv, 1996

We study the problem of how to detect \interesting objects" appeared in a given image, I. Our approach is to treat it as a function approximation problem based on an over-redundant basis, and also account for occlusions, where the basis superposition principle is no longer valid. Since the basis (a library of image templates) is over-redundant, there are in nitely many ways to decompose I. We are motivated to select a sparse/compact representation of I, and to account for occlusions and noise. We then study a greedy and iterative \weighted L p Matching Pursuit" strategy, with 0 < p < 1. We use an L p result to compute a solution, select the best template, at each stage of the pursuit.

downloadDownload free PDF View PDFchevron_right

Sparse modeling of high-dimensional data for learning and vision

Narendra Ahuja

2012

Sparse representations account for most or all of the information of a signal by a linear combination of a few elementary signals called atoms, and have increasingly become recognized as providing high performance for applications as diverse as noise reduction, compression, inpainting, compressive sensing, pattern classification, and blind source separation. In this dissertation, we learn the sparse representations of high-dimensional signals for various learning and vision tasks, including image classification, single image super-resolution, compressive sensing, and graph learning. Based on the bag-of-features (BoF) image representation in a spatial pyramid, we first transform each local image descriptor into a sparse representation, and then these sparse representations are summarized into a fixed-length feature vector over different spatial locations across different spatial scales by max pooling. The proposed generic image feature representation properly handles the large in-class variance problem in image classification, and experiments on object recognition, scene classification, face recognition, gender recognition, and handwritten digit recognition all lead to state-of-the-art performances on the benchmark datasets. We cast the image super-resolution problem as one of recovering a highresolution image patch for each low-resolution image patch based on recent sparse signal recovery theories, which state that, under mild conditions, a high-resolution signal can be recovered from its low-resolution version if the signal has a sparse representation in terms of some dictionary. We jointly learn the dictionaries for high-and low-resolution image patches and enforce them to have common sparse representations for better recovery. Furthermore, we employ image features and enforce patch overlapping constraints to improve prediction accuracy. Experiments show that the algorithm leads to surprisingly good results. Graph construction is critical for those graph-orientated algorithms designed for the purposes of data clustering, subspace learning, and semi-supervised learning. We model the graph construction problem, including neighbor selection and First, I would like to express profound gratitude to my adviser Professor Thomas Huang for all the guidance, advice, and support he has given me during my Ph.D. study at UIUC. For the last five and half years, I have been inspired by his vision and passion to research, his attention and curiosity to details, his dedication to the profession, his intense commitment to his work, and his humble and respectful personality. During this most important period in my career, I thoroughly enjoyed working with him, and what I have learned from him will benefit me for my whole life. I also would like to give my thanks to Professor Yi Ma for his suggestions and support for the last several years. I was deeply impressed and inspired by his passion and insights into research. I also feel very lucky to have had the chance to work with Dr. Kai Yu as a summer intern at NEC Laboratories America. From him, I learned not only techniques and skills but also his precious research insights. I am also very thankful to Professor Narendra Ahuja and Professor Zhi-Pei Liang for their advice and comments. I am fortunate to have them as my thesis committee members. During my Ph.D. study over the last five years, it has been a great honor for me to collaborate with Hasegawa-Johnson at UIUC, Professor Shuicheng Yan at

downloadDownload free PDF View PDFchevron_right

Sparse Representation for Signal Classification

Sinem Kurt

In this paper, application of sparse representation (factorization) of signals over an overcomplete basis (dictionary) for signal classification is discussed. Searching for the sparse representation of a signal over an overcomplete dictionary is achieved by optimizing an objective function that includes two terms: one that measures the signal reconstruction error and another that measures the sparsity. This objective function works well in applications where signals need to be reconstructed, like coding and denoising. On the other hand, discriminative methods, such as linear discriminative analysis (LDA), are better suited for classification tasks. However, discriminative methods are usually sensitive to corruption in signals due to lacking crucial properties for signal reconstruction. In this paper, we present a theoretical framework for signal classification with sparse representation. The approach combines the discrimination power of the discriminative methods with the reconstruction property and the sparsity of the sparse representation that enables one to deal with signal corruptions: noise, missing data and outliers. The proposed approach is therefore capable of robust classification with a sparse representation of signals. The theoretical results are demonstrated with signal classification tasks, showing that the proposed approach outperforms the standard discriminative methods and the standard sparse representation in the case of corrupted signals.

downloadDownload free PDF View PDFchevron_right

Robust Hierarchical Framework for Image Classification via Sparse Representation

Pankti Bhatt

The sparse representation-based classification algorithm has been used for human face recognition. But an image database was restricted to human frontal faces with only slight illumination and expression changes. Cropping and normalization of the face needs to be done beforehand. This paper uses a sparse representation-based algorithm for generic image classification with some intra-class variations and background clutter. A hierarchical framework based on the sparse representation is developed which flexibly combines different global and local features. Experiments with the hierarchical framework on 25 object categories selected from the Caltech101 dataset show that exploiting the advantage of local features with the hierarchical framework improves the classification performance and that the framework is robust to image occlusions, background clutter, and viewpoint changes.

downloadDownload free PDF View PDFchevron_right

Sparse coding in practice

A. Jepson

The goal in sparse coding is to seek a linear basis representation where each image is represented by a small number of active coefficients. The learning algorithm involves adapting a basis vector set while imposing a low-entropy, or sparse, prior on the output coefficients. Sparse coding applied on natural images has been shown to extract wavelet-like structure [9, 4]. However, our experience in using sparse coding for extracting multi-scale structure in object-specific ensembles, such as face images or images of a gesturing hand, has been negative. In this paper we highlight three points about the reliability of sparse coding for extracting the desired structure:´½µ using an overcomplete representation´¾µ projecting data into a low-dimensional subspace before attempting to resolve the sparse structure and´¿µ applying sparsity constraint on the basis elements, as opposed to the output coefficients.

downloadDownload free PDF View PDFchevron_right

Ensemble Sparse Models for Image Analysis

Andreas Spanias

Sparse representations with learned dictionaries have been successful in several image analysis applications. In this paper, we propose and analyze the framework of ensemble sparse models, and demonstrate their utility in image restoration and unsupervised clustering. The proposed ensemble model approximates the data as a linear combination of approximations from multiple weak sparse models. Theoretical analysis of the ensemble model reveals that even in the worst-case, the ensemble can perform better than any of its constituent individual models. The dictionaries corresponding to the individual sparse models are obtained using either random example selection or boosted approaches. Boosted approaches learn one dictionary per round such that the dictionary learned in a particular round is optimized for the training examples having high reconstruction error in the previous round. Results with compressed recovery show that the ensemble representations lead to a better performance compared to using a single dictionary obtained with the conventional alternating minimization approach. The proposed ensemble models are also used for single image superresolution, and we show that they perform comparably to the recent approaches. In unsupervised clustering, experiments show that the proposed model performs better than baseline approaches in several standard datasets.

downloadDownload free PDF View PDFchevron_right

Connecting the Dots: Image Classification via Sparse Representation from a Constrained Subspace Perspective

Stephen Maybank

2018

We consider the problem of classifier design via sparse representation based on a constrained subspace model. We argue that the data points in the linear span of the training samples should be constrained in order to yield a more accurate approximation to the corresponding data manifold. For this purpose, the constrained set of data points is formulated as a union of affine subspaces in the form of affine hulls spanned by training samples. We argue that the intrinsic dimension of the affine subspaces should be equal to that of data manifold. Thus, a classifier based on this model has a high classification accuracy similar to that of the conceptual NM (Nearest Manifold) classifier. Based on this model, we connect the dots of some classical classifiers including NN (Nearest Neighbor), NFL (Nearest Feature Line), NS (Nearest subspace) and the recently emerged state-of-the-art SRC (Sparse Representation Classifiers) and interpret the mechanism of SRC and Yang's variant of the SRC using the constrained subspace perspective. Experiments on the Extended Yale B database for image classification corroborate our claims and demonstrate the possibility of a proposed classifier called NCSC-CSR which has higher classification accuracy and robustness.

downloadDownload free PDF View PDFchevron_right

A First Step to Convolutive Sparse Representation

Christian Jutten

Computing Research Repository, 2008

In this paper an extension of the sparse decomposition problem is considered and an algorithm for solving it is pre- sented. In this extension, it is known that one of the shifted versions of a signal s (not necessarily the original signal itself) has a sparse representation on an overcomplete dictionary, and we are looking for the sparsest representation among

downloadDownload free PDF View PDFchevron_right

A sparse reconstruction based algorithm for image and video classification

Rabab Ward

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012

The success of sparse reconstruction based classification algorithms largely depends on the choice of overcomplete bases (dictionary). Existing methods either use the training samples as the dictionary elements or learn a dictionary by optimizing a cost function with an additional discriminating component. While the former method requires a good number of training samples per class and is not suitable for video signals, the later adds instability and more computational load. This paper presents a sparse reconstruction based classification algorithm that mitigates the above difficulties. We argue that learning class-specific dictionaries, one per class, is a natural approach to discrimination. We describe each training signal by an error vector consisting of the reconstruction errors the signal produces w.r.t each dictionary. This representation is robust to noise, occlusion and is also highly discriminative. The efficacy of the proposed method is demonstrated in terms of high accuracy for image-based Species and Face recognition and video-based Action recognition.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (11)

T. Wiegand, G.J. Sullivan, G. Bjontegaard, and A. Luthra, "Overview of the H.264/AVC" Circuits and Systems for Video Technology, IEEE Transactions, Vol 13,7, 560 -576, July 2003
J. Yang, B. Yin, Y. Sun and N. Zhang, "A block- matching based intra frame prediction H.264/AVC" ICME,2006.
T. K. Tan, C. S. Boon and Y. Suzuki, "Intra prediction by template matching" ICIP,2006.
S. Mallat and Z. Zhang, "Matching Pursuits with time frequency dictionaries" IEEE Sig. Processing,vol. 41, 12, dec 1993.
G.M. Davis, S. Mallat, and M. Avellaneda, "Adaptive greedy approximations", Conts. Approx., Vol 13, 57-98 (1997).
J.J. Fuchs, "On the application of the global matched filter to DOA estimation with uniform circular arrays" IEEE Sig. Processing,vol. 49, 4, april 2001.
R.Neff and A. Zakhor, "Very low bit-rate video coding based on matching pursuit video coder", IEEE Circuits and systems for video technology, vol. 7, 1, feb. 1997.
U.T. Desai, "DCT and Wavelet based representations of arbitraily shaped image segments", proc. IEEE Intl. Conference on Image Processing, 1995.
S. Chen, D. Donoho and M. Saunders, "Atomic De- composition by Basis Pursuit" SIAM J. on Scientific Comput., 20, 1, 33-61, 1999.
B. Efron, T. Hastie, I. Johnstone and R.Tibshirani, "Least angle regression," Annals of Statistics, 32, p- p. 407-499, Apr. 2004.
S. Maria and J.J. Fuchs, "Application of the global matched filter to stap data, an efficient algorithmic ap- proach" ICASSP,2006.

Javier Turek

2015

First and foremost, I would like to thank my advisors Prof. Irad Yavneh and Prof. Michael Elad for their invaluable guidance and support during the course of my PhD studies. Irad's and Miki's enthusiasm and experience in research are a source of endless ideas. Their optimism, openness, brightness and clarity made working with them a unforgivable and enjoyable journey. Special thanks go to my colleagues and friends Dr. Eran Treister and Jeremias Sulam, with whom I had the honor to collaborate. I hope that we will find many more opportunities to collaborate in the future. I want to express my gratitude to my friends and colleagues for many inspiring discussions, helpful tips and fruitful conversations:

downloadDownload free PDF View PDFchevron_right

Learning the sparse representation for classification

Jianchao Yang

International Conference on Multimedia Computing and Systems/International Conference on Multimedia and Expo, 2011

In this work, we propose a novel supervised matrix factorization method used directly as a multi-class classifier. The coefficient matrix of the factorization is enforced to be sparse by ℓ1-norm regularization. The basis matrix is composed of atom dictionaries from different classes, which are trained in a jointly supervised manner by penalizing inhomogeneous representations given the labeled data samples. The

downloadDownload free PDF View PDFchevron_right

Introduction to the issue on Adaptive Sparse Representation of Data and Applications in Signal and Image Processing

Panagiotis Tsakalides

IEEE Journal of Selected Topics in Signal Processing, 2000

downloadDownload free PDF View PDFchevron_right

Sparse Modeling for Image and Vision Processing

Jean Ponce

Foundations and Trends® in Computer Graphics and Vision, 2014

In recent years, a large amount of multidisciplinary research has been conducted on sparse models and their applications. In statistics and machine learning, the sparsity principle is used to perform model selection-that is, automatically selecting a simple model among a large collection of them. In signal processing, sparse coding consists of representing data with linear combinations of a few dictionary elements. Subsequently, the corresponding tools have been widely adopted by several scientific communities such as neuroscience, bioinformatics, or computer vision. The goal of this monograph is to offer a self-contained view of sparse modeling for visual recognition and image processing. More specifically, we focus on applications where the dictionary is learned and adapted to data, yielding a compact representation that has been successful in various contexts.

downloadDownload free PDF View PDFchevron_right

Sparse approximation with adaptive dictionary for image prediction

Christine Guillemot

2009 16th IEEE International Conference on Image Processing (ICIP), 2009

The paper presents a dictionary construction method for spatial texture prediction based on sparse approximations. Sparse approximations have been recently considered for image prediction using static dictionaries such as a DCT or DFT dictionary. These approaches rely on the assumption that the texture is periodic, hence the use of a static dictionary formed by pre-defined waveforms. However, in real images, there are more complex and non-periodic textures. The main idea underlying the proposed spatial prediction technique is instead to consider a locally adaptive dictionary, A, formed by atoms derived from texture patches present in a causal neighborhood of the block to be predicted. The sparse spatial prediction method is assessed against the sparse prediction method based on a static DCT dictionary. The spatial prediction method is then assessed in a complete image coding scheme where the prediction residue is encoded using a coding approach similar to JPEG.

downloadDownload free PDF View PDFchevron_right

ϵ-Sparse Representations: Generalized Sparse Approximation and the Equivalent Family of SVM Tasks

Andras Lorincz

Relation between a family of generalized Support Vector Machine (SVM) problems and the novel-sparse representation is provided. In defining-sparse representations, we use a natural generalization of the classicalinsensitive cost function for vectors. The insensitive parameter of the SVM problem is transformed into component-wise insensitivity and thus overall sparsification is replaced by component-wise sparsification. The connection between these two problems is built through the generalized Moore-Penrose inverse of the Gram matrix associated to the kernel.

downloadDownload free PDF View PDFchevron_right

On sparse representations of color images

Xiaolin Wu

2011 18th IEEE International Conference on Image Processing, 2011

We investigate an intrinsic and useful form of sparsity of color images that was largely overlooked in the literature of image/video processing. This sparsity of multispectral images is revealed and formulated by modeling the image formation process. The underlying new sparse representations of color images are general and can be exploited to improve the performance of existing image restoration algorithms, such as denoising, deblurring, and resolution upconversion.

downloadDownload free PDF View PDFchevron_right

Sparse representations for pattern classification using learned dictionaries

Karthikeyan Natesan Ramamurthy

2009

Sparse representations have been often used for inverse problems in signal and image processing. Furthermore, frameworks for signal classification using sparse and overcomplete representations have been developed. Data-dependent representations using learned dictionaries have been significant in applications such as feature extraction and denoising.

downloadDownload free PDF View PDFchevron_right

Multiple Kernel Sparse Representations for Supervised and Unsupervised Learning

Andreas Spanias

IEEE Transactions on Image Processing, 2014

In complex visual recognition tasks it is typical to adopt multiple descriptors, that describe different aspects of the images, for obtaining an improved recognition performance. Descriptors that have diverse forms can be fused into a unified feature space in a principled manner using kernel methods. Sparse models that generalize well to the test data can be learned in the unified kernel space, and appropriate constraints can be incorporated for application in supervised and unsupervised learning. In this paper, we propose to perform sparse coding and dictionary learning in the multiple kernel space, where the weights of the ensemble kernel are tuned based on graph-embedding principles such that class discrimination is maximized. In our proposed algorithm, dictionaries are inferred using multiple levels of 1−D subspace clustering in the kernel space, and the sparse codes are obtained using a simple levelwise pursuit scheme. Empirical results for object recognition and image clustering show that our algorithm outperforms existing sparse coding based approaches, and compares favorably to other state-of-the-art methods.

downloadDownload free PDF View PDFchevron_right

<title>Learned dictionaries for sparse image representation: properties and results</title>

Karl Skretting, Kjersti Engan

Wavelets and Sparsity XIV, 2011

Sparse representation of images using learned dictionaries have been shown to work well for applications like image denoising, impainting, image compression, etc. In this paper dictionary properties are reviewed from a theoretical approach, and experimental results for learned dictionaries are presented. The main dictionary properties are the upper and lower frame (dictionary) bounds, and (mutual) coherence properties based on the angle between dictionary atoms. Both l 0 sparsity and l 1 sparsity are considered by using a matching pursuit method, order recursive matching Pursuit (ORMP), and a basis pursuit method, i.e. LARS or Lasso. For dictionary learning the following methods are considered: Iterative least squares (ILS-DLA or MOD), recursive least squares (RLS-DLA), K-SVD and online dictionary learning (ODL). Finally, it is shown how these properties relate to an image compression example.

downloadDownload free PDF View PDFchevron_right

Sparse Representation For Image Prediction

Sign up for access to the world's latest research

Abstract

Related papers

References (11)

Related papers

Related topics