Academia.eduAcademia.edu

Video Compression

description4,849 papers
group2,271 followers
lightbulbAbout this topic
Video compression is the process of reducing the size of video files by encoding and decoding data to minimize redundancy and optimize storage and transmission efficiency, while maintaining acceptable quality levels. This technique is essential for streaming, storage, and bandwidth management in digital media.
lightbulbAbout this topic
Video compression is the process of reducing the size of video files by encoding and decoding data to minimize redundancy and optimize storage and transmission efficiency, while maintaining acceptable quality levels. This technique is essential for streaming, storage, and bandwidth management in digital media.

Key research themes

1. How can optimized quantization and entropy encoding improve compression efficiency without compromising video quality in modern video codecs?

This research theme focuses on enhancing video compression by designing and applying optimized quantization matrices combined with advanced entropy encoding techniques. Such improvements aim to achieve higher compression rates while preserving or even improving the decoded video quality, particularly relevant for standards like HEVC that incorporate quantization matrices as coding tools.

Key finding: The authors propose an entropy encoding method integrating an optimized quantization matrix (WE-OQM) within HEVC, achieving up to 35.29% better performance than standard entropy encoding and 62.5% better than weighted entropy... Read more
Key finding: This paper introduces an advanced context modeling scheme for coding quantized transform coefficients tailored for large transform blocks (up to 128x128), extending beyond the CABAC scheme in H.264/AVC. By adaptively... Read more
Key finding: The researchers propose DST-7 and DCT-8 transform approximations and fast computation algorithms leveraging the relationship with DCT-2, suitable for Versatile Video Coding (VVC). These transforms, combined with Multiple... Read more

2. What roles do advanced coding frameworks and adaptive partitioning techniques play in balancing compression efficiency, computational complexity, and real-time processing in next-generation video codecs?

This theme investigates novel codec architectures and control mechanisms, including distributed video coding, low complexity enhancement layers, and dynamic frame partitioning, to optimize the trade-offs between compression performance, computational cost, and latency for real-time or bandwidth-limited applications.

Key finding: This study proposes the QUAM scheme that selectively drops zero quantized transform blocks in distributed residual video coding (DRVC), reducing bit planes that require channel coding. This reduces encoder and decoder... Read more
Key finding: LCEVC, designed as a low-complexity enhancement layer over existing codecs (AVC, HEVC, EVC, VVC), shows bitrate savings up to 40% with adaptive streaming at full resolution. Verification tests employing objective and... Read more
Key finding: The authors introduce a dynamic Tile and Rectangular Slice (TRS) frame partitioning approach for VVC encoding that jointly optimizes multi-thread encoding time and encoding quality loss. Using spatial content and prior frame... Read more

3. How do emerging AI-driven and diffusion-based methods enhance perceptual quality and compression efficiency beyond classical video coding standards?

This theme explores the integration of deep learning, neural network architectures, and diffusion models in video compression, aiming to surpass traditional codec limitations by leveraging learned priors and advanced perceptual models to achieve improved compression ratios and visual perceptual quality.

Key finding: This paper presents a novel diffusion-based video compression approach that utilizes denoising diffusion generative models fine-tuned via low-rank adaptations (LoRA) to compress video frames with very low-quality guidance... Read more
Key finding: The authors design an end-to-end trainable deep learning compression network integrating optical flow estimation for motion compensation with convolutional and recurrent neural networks. Incorporating optical flow improves... Read more

All papers in Video Compression

The Discrete Cosine Transform is statistically optimal transform for first order Markov signals, which is widely used in image and video coding. However, in intra frame coding of H.264/AVC, it is knowm that after directional intra... more
Motion Estimation (ME) is the most computationally intensive part in the whole video compression process. The ME algorithms can be divided into full search ME (FS) and fast ME (FME). The FS is not suitable for high definition (HD) frame... more
This work presents a 9/7 tap biorthogonal wavelet design with a minimal degree of complexity. There are two phases in the design process. In the first stage, coarse dyadic wavelet coefficients that approximate the CDF 9/7 filter are... more
Traditionally, sign languages used by the deaf can be learned from teachers, books or video tapes. Digital multimedia together with CD ROM technology now enables production of interactive textbooks and dictionaries for sign languages on... more
The tremendous growth of digital data has led to a high necessity for compressing applications either to minimize memory usage or transmission speed. Despite of the fact that many techniques already exist, there is still space and need... more
An extended version of low-complexity IP Core for image/video transformations based on the CORDIC architecture is presented. This IP core is able to perform quantized 8×8 IDCT and quantized 8×8/4×4 H.264-inverse integer transforms on a... more
Robot and sensor networks are needed for safety, security, and rescue applicationssuch as port security and reconnaissance during a disaster. These applications rely on realtimetransmission of images, which generally saturate the... more
Distributed video coding (DVC) is a recent paradigm which aims at transferring part of the coding complexity from the encoder to the decoder. The performance of such a coding scheme strongly depends on the capacity to estimate correlation... more
Motion estimation (ME) methods based on differential techniques provide useful information for video analysis, and moreover it is relatively easy to embed into them regularity constraints enforcing for example, contour preservation. On... more
In this paper, we address the problem of a temporal signature c onservation, arising in multimedia communications. Speci cally, we are interested in the problem of the ow reconstitution and variable end-to-end delay compensation at the... more
We propose an efficient and accurate wavelet-based noise estimation method for white Gaussian noise in video sequences. The proposed method analyzes the distribution of spatial and temporal gradients in the video sequence in order to... more
A new vector quantisation (VQ) algorithm based on a strategy similar to the cache in traditional computers is proposed and simulated. The method can significantly reduce the redundancy in VQ indices. According to the simulated rcsults,... more
This paper investigates rhe algorithmic complexiq of arithmetic coding in the new H264 video codzng standard and proposes a coprocessor to reduce it by more ihan an order of magnitude. The coprocessor is based on an innovative algOFlfhFi2... more
While computing power and transmission bandwidth have both been steadily increasing over the last few years, bandwidth rather than processing power remains the primary bottleneck for many complex multimedia applications involving... more
Wider dissemination of medical digital video libraries is affected by two correlated factors, resource effective content compression that directly influences its diagnostic credibility. It has been proved that it is possible to meet these... more
Most current research in the domain of image compression focuses solely on achieving state of the art compression ratio, but that is not always usable in today's workflow due to the constraints on computing resources. Constant market... more
During the online edition of video material for television over data networks a major concern is the accumulation of delay over the production chain. Before a video signal of a camera arrives at a remote studio to be edited live, it... more
Gaming on phones, tablets and laptops is very popular. Cloud gaming -where remote servers perform game execution and rendering on behalf of thin clients that simply send input and display output frames -promises any device the ability to... more
This paper presents Kahawai 1 , a system that provides high-quality gaming on mobile devices, such as tablets and smartphones, by offloading a portion of the GPU computation to server-side infrastructure. In contrast with previous... more
This paper presents MAUI, a system that enables fine-grained energy-aware offload of mobile code to the infrastructure. Previous approaches to these problems either relied heavily on programmer support to partition an application, or they... more
The Portuguese coastline, like many other worldwide coastlines, is often submitted to several types of extreme events resulting in erosion, thus, acquisition of high quality field measurements has become a common concern. The nearshore... more
ABSTRACT. In this paper we show the obtained results when using the Discrete Cosine Transform (DCT) into the compression and decompression of digital images; for which, the image is divided in square blocks of nxn. Later these blocks are... more
In this paper, we propose an interactive DTV design that converts non-interactive broadcast DTV streams into interactive ones for multiple simultaneous viewers. To enable viewing interactivity, we show that it is critical to organize data... more
Scalable multimedia data transmission are subject to specific constraints such as the Quality of Service (QoS) of sensitivity classes and the transmission rate (yielding a maximum size of each frame to send). Many scalable source decoders... more
This paper is among the first works to document experimental results for application-aware H.264 Scalable Video Coding (SVC) support over Wireless LANs. Application-aware support is achieved by introducing a bandwidth throttling device,... more
In this paper, we share our experience of designing single-chip multiprocessor controller for advanced multimedia application. We cover the architecture design and validation, the encountered problems and our solutions and we will provide... more
The paper considers the source description problem with average distortion and per-symbol reproduction cost constraints. The source description cost-distortion function is then defined as the minimum of a weighted sum of the rate and the... more
This is a repository copy of Selecting stimuli parameters for video quality studies based on perceptual similarity distances.
Motion Estimation is key role in the world of video compression. Compression is a technique for compacting a video data used for easily transmitting over a network and reducing a size for storage. Motion Estimation is computationally... more
High Efficiency Video Coding (HEVC) is the most recent video coding standard to achieve a higher coding performance than the previous H.264/AVC. In order to accomplish this improved coding performance, HEVC adopted several advanced coding... more
Mobile sign language video conversations can become unintelligible if high video transmission rates cause network congestion and delayed video. In an effort to understand the perceived lower limits of intelligible sign language video... more
We describe our system called MobileASL for real-time video communication on the current U.S. mobile phone network. The goal of MobileASL is to enable Deaf people to communicate with Sign Language over mobile phones by compressing and... more
The current recommended video transmission standards, Telecommunication Standardization Sector (ITU-T) Q.26/16, of 25 frames per second at 100 kilobits per second or higher make mobile sign language video communication less accessible... more
Mobile sign language video conversations can become unintelligible if high video transmission rates cause network congestion and delayed video. In an effort to understand the perceived lower limits of intelligible sign language video... more
In presenting this dissertation in partial fulfillment of the requirements for the doctoral degree at the University of Washington, I agree that the Library shall make its copies freely available for inspection. I further agree that... more
Decomposing video frames into coherent two-dimensional motion layers is a powerful method for representing videos. Such a representation provides an intermediate description that enables applications such as object tracking, video... more
This paper presents an analysis of the scalability of the parallel video decoding on heterogeneous many core architectures. As benchmark, we use a highly parallel H.264/AVC video decoder that generates a large number of independent tasks.... more
This paper proposes a method of transmitting video streaming data based on downsampling-upsampling pyramidal decomposition. By implementing an octal tree decomposition of the frame cubes, prior to transforming them into hypercubes, the... more
* This work was supported in part by the NSF grants CDA-9617310.
-for compression of full-motion video -data rate: 9-40kbps -applications: interactive multimedia and video telephony Format Video Parameters Compressed bit rate SIF 352x240 at 30Hz 1.2-3 Mbps → MPEG-1 CCIR 601 720x486 at 30Hz 5-10 Mbps →... more
Mobile phones have become essential part of modern life and continue to change the way people communicate with each other. The camera phones have become ubiquitous in the recent past and video communication services such as videophone are... more
Digital Video Cassette (DVC) is a quickly proliferating new standard for real-time digital video recording. DVC is presently used both in consumer and professional applications. In this paper we describe the DVC principles, data... more
I151 P. J. Movlan. "Matrices with positive urincioal minors." Linear [ 4 ( 1 + d e t A ) 2 -(all + a 2 2 ) 2 1982. . .
Content inappropriate for children on Internet television is a serious problem in today's multimedia world. There are numerous methods which are used to control the content of the transmitted television programmes. However, these... more
The current conditional access techniques for digital TV systems depend on encrypting MPEG-2 data packets at the bitstream level. Known methods, using visual distortion, can not be used in digital systems because they produce video... more
The motion-estimation search range required for interframe encoding with the MPEG-2 video compression standard depends on a number of factors, including video content, video resolution, elapsed time between reference and predicted... more
Inclusion of the latest research work to a real life applications and achieving adaptive video signal postprocessing is a complex and timely task. The proposed framework breaks the filtering process into several distinct phases and... more
En el presente trabajo se describe un método para transmitir video codificado en H.264/AVC. La transferencia de datos se realiza adaptando el envío de datos al desempeño de la red para garantizar una transmisión continua. La transferencia... more
Download research papers for free!