Content Coverage and Redundancy Removal in Video Summarization

Hrishikesh Bhaumik

doi:10.4018/978-1-5225-0498-6.CH013

Outline

Natural Language Processing

Content Coverage and Redundancy Removal in Video Summarization

Hrishikesh Bhaumik

Intelligent Analysis of Multimedia Information

https://doi.org/10.4018/978-1-5225-0498-6.CH013

visibility

…

description

3 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Over the past decade, research in the field of Content-Based Video Retrieval Systems (CBVRS) has attracted much attention as it encompasses processing of all the other media types i.e. text, image and audio. Video summarization is one of the most important applications as it potentially enables efficient and faster browsing of large video collections. A concise version of the video is often required due to constraints in viewing time, storage, communication bandwidth as well as power. Thus, the task of video summarization is to effectively extract the most important portions of the video, without sacrificing the semantic information in it. The results of video summarization can be used in many CBVRS applications like semantic indexing, video surveillance copied video detection etc. However, the quality of the summarization task depends on two basic aspects: content coverage and redundancy removal. These two aspects are both important and contradictory to each other. This chapter aim...

Ebroul Izquierdo

Lecture Notes in Computer Science, 2007

Video summarization approaches have various fields of application, specifically related to organizing, browsing and accessing large video databases. In this paper we propose and evaluate two novel approaches for video summarization, one based on spectral methods and the other on ant-tree clustering. The overall summary creation process is broke down in two steps: detection of similar scenes and extraction of the most representative ones. While clustering approaches are used for scene segmentation, the post-processing logic merges video scenes into a subset of user relevant scenes. In the case of the spectral approach, representative scenes are extracted following the logic that important parts of the video are related with high motion activity of segments within scenes. In the alternative approach we estimate a subset of relevant video scene using ant-tree optimization approaches and in a supervised scenario certain scenes of no interest to the user are recognized and excluded from the summary. An experimental evaluation validating the feasibility and the robustness of these approaches is presented.

downloadDownload free PDF View PDFchevron_right

An Approach to Summarize Video Data in Compressed Domain

Gökhan Şimşek

2007

downloadDownload free PDF View PDFchevron_right

Analysis and Verification of Video Summarization using Shot Boundary Detection

Rajashree Shettar

2013

The digital storage in this technology world has played a major role in all most all of our routine applications. It contains a large amount of data including videos, extracted features, alerts, statistics etc. Designing systems to manage this extensive data and make it easily accessible for query and search is a very challenging and potentially rewarding problem. However, the vast majority of research in video indexing has taken place in the field of multimedia, in particular for authored or produced video such as news or movies, and spontaneous and broadcast video such as sporting events.This paper mainly focuses on the analyis of the video using the shot boundary detection methods. Shot boundary detection is the fundamental step in the content based video analysis. It is even a major research issue, since this has been used as an important parameter in the video retrieval process.The analysis in this paper contains two different methods, 1. Block based Histogram difference and 2....

downloadDownload free PDF View PDFchevron_right

Recent Challenges and Opportunities in Video Summarization With Machine Learning Algorithms

Lubna Gabralla

IEEE Access

The fast progress in digital technology has sparked the generation of the amount of voluminous data from different social media platforms like Instagram, Facebook, YouTube, etc. There are other platforms, as well which generate large data like News, CCTV videos, sports, entertainment, etc. Lengthy Videos typically contain a significant number of duplicate occurrences that are uninteresting to the viewer. Eliminating this unnecessary information and concentra only on the crucial events will be far more advantageous. This produces a summary of lengthy films, which can save viewers time and enable better memory management. The highlights of a lengthy video are condensed into a video summary. Video summarization is an essential topic today, since many industries have CCTV cameras installed for various reasons such as monitoring, security, and tracking. Because surveillance videos are taken 24 hours a day, enormous amounts of memory and time are required if one wishes to trace any incident or person from the full day's video. The summary generated from multiple views is far more challenging, so more study and advancement in MVS is required. The conceptual basis of video summarizing approaches is thoroughly addressed in this paper. This paper addresses applications and technology challenges in Single view and Multi View summarization. INDEX TERMS Video summarization survey, video sequence, single view summarization (SVS), multi view summarization (MVS), big data.

downloadDownload free PDF View PDFchevron_right

A Unified Framework for Video Summarization, Browsing, and Retrieval

Regunathan Radhakrishnan

A Unified Framework for Video Summarization, Browsing and Retrieval, 2006

downloadDownload free PDF View PDFchevron_right

SEMANTIC VIDEO SUMMARIZATION IN COMPRESSED DOMAIN MPEG VIDEO

Philippe Mulhem

In this paper, we present a semantic summarization algorithm that interfaces with the metadata and that works in compressed domain, in particular MPEG-1 and MPEG-2 videos. In enabling a summarization algorithm through high-level semantic content, we try to address two major problems: First, we present the facility provided in the DVA system that allows the semiautomatic creation of this metadata. Second, we address the main point of this system which is the utilization of this metadata to filter out the frames, creating an abstract of a video based on a Boolean condition set by the user. Our video summary quality survey indicates that the proposed method performs satisfactorily.

downloadDownload free PDF View PDFchevron_right

A Comparative analysis of Video Summarization techniques for different domains

Bijal Gadhia

Samriddhi - a journal of physical sciences, engineering and technology, 2023

As technology progresses, a gigantic amount of video data is generated day by day. Processing such a huge video needs time and requires increased storage and computational power. Sometimes it is convenient for the user to watch a summary or highlight rather than a complete video, which is time-consuming. So, a fully automated solution is required to extract important segments from a video. Researchers have proposed multiple approaches/techniques for summarizing the videos, which resolve the problem of long videos and summarize them according to the video type. This survey and comparative evaluation of video summarizing techniques based on several domains are presented in this paper. Primarily, these methods are classified into different categories based on their methods or techniques used. Then an overview of some the latest literature is presented with the dataset and evaluation approaches used. The review is also made related to the domain direction and concluded by presenting benefits and difficulties associated with current video summarization techniques.

downloadDownload free PDF View PDFchevron_right

Hierarchical video content description and summarization using unified semantic and visual similarity

Jianping Fan

Multimedia Systems, 2003

Video is increasingly the medium of choice for a variety of communication channels, resulting primarily from increased levels of networked multimedia systems. One way to keep our heads above the video sea is to provide summaries in a more tractable format. Many existing approaches are limited to exploring important low-level feature related units for summarization. Unfortunately, the semantics, content and structure of the video do not correspond to low-level features directly, even with closed-captions, scene detection, and audio signal processing. The drawbacks of existing methods are the following:

downloadDownload free PDF View PDFchevron_right

Heterogeneity image patch index and its application to consumer video summarization

Hayder Radha

Automatic video summarization is indispensable for fast browsing and efficient management of large video libraries. In this paper, we introduce an image feature that we refer to as heterogeneity image patch (HIP) index. The proposed HIP index provides a new entropy-based measure of the heterogeneity of patches within any picture. By evaluating this index for every frame in a video sequence, we generate a HIP curve for that sequence. We exploit the HIP curve in solving two categories of video summarization applications: key frame extraction and dynamic video skimming. Under the key frame extraction framework, a set of candidate key frames is selected from abundant video frames based on the HIP curve. Then, a proposed patchbased image dissimilarity measure is used to create affinity matrix of these candidates. Finally, a set of key frames is extracted from the affinity matrix using a min-max based algorithm. Under video skimming, we propose a method to measure the distance between a video and its skimmed representation. The video skimming problem is then mapped into an optimization framework and solved by minimizing a HIP-based distance for a set of extracted excerpts. The HIP framework is pixel-based and does not require semantic information or complex camera motion estimation. Our simulation results are based on experiments performed on consumer videos and are compared with state-ofthe-art methods. It is shown that the HIP approach outperforms other leading methods, while maintaining low complexity.

downloadDownload free PDF View PDFchevron_right

Automatic summarization and annotation of videos with lack of metadata information

Nikos Papamarkos

Expert Systems with Applications, 2013

The advances in computer and network infrastructure together with the fast evolution of multimedia data has resulted in the growth of attention to the digital video's development. The scientific community has increased the amount of research into new technologies, with a view to improving the digital video utilization: its archiving, indexing, accessibility, acquisition, store and even its process and usability. All these parts of the video utilization entail the necessity of the extraction of all important information of a video, especially in cases of lack of metadata information. The main goal of this paper is the construction of a system that automatically generates and provides all the essential information, both in visual and textual form, of a video. By using the visual or the textual information, a user is facilitated on the one hand to locate a specific video and on the other hand is able to comprehend rapidly the basic points and generally, the main concept of a video without the need to watch the whole of it. The visual information of the system emanates from a video summarization method, while the textual one derives from a keyword based video annotation approach. The video annotation technique is based on the keyframes, that constitute the video abstract and therefore, the first part of the system consists of the new video summarization method. According to the proposed video abstraction technique, initially, each frame of the video is described by the Compact Composite Descriptors (CCDs) and a visual word histogram. Afterwards, the proposed approach utilizes the Self-Growing and Self-Organized Neural Gas (SGONG) network, with a view to classifying the frames into clusters. The extraction of a representative key frame from every cluster leads to the generation of the video abstract. The most significant advantage of the video summarization approach is its ability to calculate dynamically the appropriate number of final clusters. In the sequel, a new video annotation method is applied to the generated video summary leading to the automatic generation of keywords capable of describing the semantic content of the given video. This approach is based on the recently proposed N-closest Photos Model (NCP). Experimental results on several videos are presented not only to evaluate the proposed system but also to indicate its effectiveness.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Patricia Wang

Proceedings of the 15th ACM international conference on Multimedia, 2007

downloadDownload free PDF View PDFchevron_right

Video Summarization: Correlation for Summarization and Subtraction for Rare Event

J4R - Journal for Research

Journal 4 Research - J4R Journal, 2016

The ever increasing number of surveillance camera networks being deployed all over the world has not only resulted in a high interest in the development of algorithms to automatically analyze the video footage, but has also opened new questions as how to efficiently manage the vast amount of information generated. The user may not have sufficient time to watch the entire video or the whole of video content may not be of interest to the user. In such cases, the user may just want to view the summary of the video instead of watching the whole video. In this paper, we present a video summarization technique developed in order to efficiently access the points of interest in the video footage. The technique aims to eliminate the sequences which contain no activity of significance. The system being developed actually captures each frame from the video, then it processes the frame; if the frame is of its interest, it retains the frames otherwise it discards the frame; hence the resultant video is very short. The proposed method is extended to obtain rare event detection for security systems. These rare event detections refer to suspicious scenarios. The system will consider a particular frame of interest from a video footage taken at given time and search for actions from video footages across the particular area of interest specified by the user. The user is then notified about the objects and actions occurred in the area of interest. This helps in detecting suspicious behavior that would have otherwise been deemed unsuspicious and gone unnoticed in the context of a narrow timeframe.

downloadDownload free PDF View PDFchevron_right

Content based Video Retrieval, Classification and Summarization: The State-of-the-Art and the Future

Dan Schonfeld

Abstract This chapter provides an overview of different video content modeling, retrieval and classification techniques employed in existing content-based video indexing and retrieval (CBVIR) systems. Based on the modeling requirements of a CBVIR system, we analyze and categorize existing modeling approaches. Starting with a review of video content modeling and representation techniques, we study view-invariant representation approaches and the corresponding performance analysis.

downloadDownload free PDF View PDFchevron_right

Video summarization preserving dynamic content

John Adcock

Proceedings of the international workshop on TRECVID video summarization - TVS '07, 2007

This paper describes a system for selecting excerpts from unedited video and presenting the excerpts in a short summary video for efficiently understanding the video contents. Color and motion features are used to divide the video into segments where the color distribution and camera motion are similar. Segments with and without camera motion are clustered separately to identify redundant video. Audio features are used to identify clapboard appearances for exclusion. Representative segments from each cluster are selected for presentation. To increase the original material contained within the summary and reduce the time required to view the summary, selected segments are played back at a higher rate based on the amount of detected camera motion in the segment. Pitch-preserving audio processing is used to better capture the sense of the original audio. Metadata about each segment is overlayed on the summary to help the viewer understand the context of the summary segments in the original video.

downloadDownload free PDF View PDFchevron_right

Digital Video Summarization: A Survey

Hazeem Taher

International Journal of Innovative Computing

Video summarization has arisen as a method that can help with efficient storage, rapid browsing, indexing, fast retrieval, and quick sharing of the material. The amount of video data created has grown exponentially over time. Huge amounts of video are produced continuously by a large number of cameras. Processing these massive amounts of video requires a lot of time, labor, and hardware storage. In this situation, a video summary is crucial. The architecture of video summarization demonstrates how a lengthy film may be broken down into shorter, story-like segments. Numerous sorts of studies have been conducted in the past and continue now. As a result, several approaches and methods—from traditional computer vision to more modern deep learning approaches—have been offered by academics. However, several issues make video summarization difficult, including computational hardware, complexity, and a lack of datasets. Many researchers have recently concentrated their research efforts on ...

downloadDownload free PDF View PDFchevron_right

Implementation of Methodology for Video Summarization

International Journal of Scientific Research in Science and Technology IJSRST

International Journal of Scientific Research in Science and Technology, 2021

Modern era, a massive amount of multimedia data is analysed, browsed, and retrieved, slowing down delivery and increasing computation costs. Video summarization is an aspect of building video and browsing that has been increased to process all video information in the shortest amount of time. This method allows users to browse large amounts of data quickly. It is the method of separating key frames and video skims to create a summarized or abstract view of an entire video in the shortest amount of time while also removing duplication or redundant features. Paper focus on different ways to achieve a sample video: static and dynamic, which are divided into two categories. With both the rapid advancement of digital video technology, it is now possible to upload large videos from Youtube or other websites, as well as record massive amounts of data such as news, sports, lecture, and surveillance videos, among other things. Video storage, transfer, and processing take a significant amount of time. The user may not have enough time to watch the video prior to actually downloading it, or the user requires a quick and precise video search result. In these kind of cases, the video's highlight or summary speeds up search and indexing operations, and the user can view the video's focus or summary before downloading it.

downloadDownload free PDF View PDFchevron_right

An improved sub-optimal video summarization algorithm

Pedro Assunção

2010

During the last few years the amount of digital video content has been increasing exponentially as a result of the proliferation of media sources like digital TV, streaming video internet sites like YouTube and wider availability of digital video cameras. The video data volume is so large that the only way a user can browse these libraries is through the use of timecondensation techniques. Video summarization achieves timecondensation by choosing a sub-set of frames of the original video creating a summary hopefully representative of the source video. The frame selection process can be directed according to different principles, based on either subjective or objective frame-relevance measures. Previous works have used dynamic programming (DP) and greedy approaches to choose the frames that make up the video summary. We present an algorithm that performs better than the greedy solution achieving a performance simplicity.

downloadDownload free PDF View PDFchevron_right

Video Summarization: Techniques and Applications

Zaynab El Khattabi, Benkaddour Abdelhamid, Youness Tabii

Nowadays, huge amount of multimedia repositories make the browsing, retrieval and delivery of video contents very slow and even difficult tasks. Video summarization has been proposed to improve faster browsing of large video collections and more efficient content indexing and access. In this paper, we focus on approaches to video summarization. The video summaries can be generated in many different forms. However, two fundamentals ways to generate summaries are static and dynamic. We present different techniques for each mode in the literature and describe some features used for generating video summaries. We conclude with perspective for further research.

downloadDownload free PDF View PDFchevron_right

A fuzzy video content representation for video summarization and content-based retrieval

Stefanos Kollias

Signal Processing, 2000

In this paper, a fuzzy representation of visual content is proposed, which is useful for the new emerging multimedia applications, such as content-based image indexing and retrieval, video browsing and summarization. In particular, a multidimensional fuzzy histogram is constructed for each video frame based on a collection of appropriate features, extracted using video sequence analysis techniques. This approach is then applied both for video summarization, in the context of a content-based sampling algorithm, and for content-based indexing and retrieval. In the "rst case, video summarization is accomplished by discarding shots or frames of similar visual content so that only a small but meaningful amount of information is retained (key-frames). In the second case, a content-based retrieval scheme is investigated, so that the most similar images to a query are extracted. Experimental results and comparison with other known methods are presented to indicate the good performance of the proposed scheme on real-life video recordings. (A.D. Doulamis) 0165-1684/00/$ -see front matter

downloadDownload free PDF View PDFchevron_right

Video summarization with SOMs

Jorma Laaksonen

2007

Video summarization is a process where a long video file is converted to a considerably shorter form. The video summary can then be used to facilitate efficient searching and browsing of video files in large video collections. The aim of successful automatic summarization is to preserve as much as possible from the essential content of each video. What is essential is subjective and dependent on the use of the videos and the overall content of the collection. In this paper we present an overview of the SOMbased methodology we have used for video summarization. The method uses temporal trajectories of the best-matching units of frame-wise feature vectors for shot boundary detection and shot similarity assessment. The video material we have used in our experiments comes from NIST's annual TRECVID evaluation for content-based video retrieval systems.

downloadDownload free PDF View PDFchevron_right

Content Coverage and Redundancy Removal in Video Summarization

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics