3D Video Compression

description31 papers

group401 followers

lightbulbAbout this topic

3D video compression is the process of reducing the data size of three-dimensional video content while maintaining visual quality. This technique employs algorithms to efficiently encode spatial and temporal information, enabling effective storage and transmission of 3D videos across various platforms and devices.

lightbulbAbout this topic

Key research themes

1. How can video coding standards be extended to efficiently compress 3D and multiview video formats?

This research area focuses on the development and standardization of video coding extensions to support multiview and 3D video, particularly integrating depth information alongside texture to enable high-quality stereoscopic and autostereoscopic displays. Efficient exploitation of inter-view redundancy and depth-texture correlations is vital to achieve substantial bitrate savings for immersive video applications.

Overview of the Multiview and 3D Extensions of High Efficiency Video Coding

by Soodabeh 1992

2016

Key finding: Introduces MV-HEVC and 3D-HEVC as extensions of the HEVC standard, where MV-HEVC reuses single-layer decoders and exploits inter-view references for 20-30% bitrate savings over simulcast. 3D-HEVC further improves coding... Read more

articleView Paper downloadDownload

HEVC Based Frame Interleaved Coding Technique for Stereo and Multi-View Videos

by Pooneh Bagheri Zadeh

2023, Information

Key finding: Presents HEVC-FISMVC, a single-layer HEVC-based interleaved frame coding approach that maximizes temporal, inter-view, and combined correlations by treating stereo/multiview video as an interleaved monoscopic stream,... Read more

articleView Paper downloadDownload

Effect of inter-camera angles on the performance of an H.264/AVC based multi-view video codec

by Hany H . M . Said and

2013

Key finding: Analyzes how varying inter-camera angles impact coding efficiency of an H.264/AVC multi-view codec using different three-camera combinations from the Breakdancers dataset. Results indicate that multi-view coding outperforms... Read more

articleView Paper downloadDownload

Design and Evaluation of 3D Video System Based on H.264 View Coding

by Lakis Christodoulou

2022

Key finding: Demonstrates exploiting human visual system characteristics to enable asymmetric compression of stereo views using H.264, showing that unequal quality compression guided by eye dominance can maintain perceptual 3D quality... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What methods are effective for compressing stereoscopic and multi-picture object (MPO) images leveraging inter-view redundancy?

This theme explores approaches for stereoscopic image compression that exploit the high redundancy between left and right views, focusing on MPO formats and adaptive compression strategies that balance storage savings and visual quality. Evaluations consider traditional disparity map-based methods as well as adaptive independent coding enhanced by inter-view information sharing.

A Benchmark Evaluation of Adaptive Image Compression for Multi Picture Object Stereoscopic Images

by Alessandro Ortis

2022, Journal of Imaging

Key finding: Evaluates adaptive stereoscopic image compression where the two views are independently compressed at different quality factors, followed by an enhancement step exploiting inter-view redundancy. Experimental results highlight... Read more

articleView Paper downloadDownload

Subjective study on compressed asymmetric stereoscopic video

by Moncef Gabbouj

2024

Key finding: Through subjective testing, shows that resolution-asymmetric stereoscopic video where one view is downsampled by 1/2 horizontally and vertically yields perceptual quality comparable to symmetric full-resolution stereo at... Read more

articleView Paper downloadDownload

3D22MX: Performance Subjective Evaluation of 3D/Stereoscopic Image Processing and Analysis

by Oswaldo Matamoros

2023, Mathematics

Key finding: Develops a 3D/stereoscopic image database, degradation software, and conducts psychophysical experiments for quality assessment specifically tailored for stereo images. The study reveals that conventional 2D image quality... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can compression of 3D point clouds and 3D meshes be optimized for immersive and real-time applications?

This theme covers the burgeoning research into compressing 3D spatial data representations such as point clouds and meshes, critical for augmented reality, tele-immersion, and interactive 3D applications. It emphasizes techniques addressing the high data volume while maintaining visual fidelity and the perceptual quality of reconstructed content, and the challenges posed by real-time processing and noisy reconstructed data.

3D Point Cloud Compression

by Marius Preda

2021, The 24th International Conference on 3D Web Technology

Key finding: Provides a structured survey categorizing 3D point cloud compression methods into 1D traversal, 2D projection-based, and direct 3D approaches. Highlights the challenge of compressing unstructured point clouds carrying... Read more

articleView Paper downloadDownload

Subjective Visual Quality Assessment of Immersive 3D Media Compressed by Open-Source Static 3D Mesh Codecs

by Petros Drakoulis

2021, EasyChair Preprints

Key finding: Presents subjective studies evaluating compression artifacts on 3D meshes generated from real-time RGB-D sensors versus high-quality scans. Results indicate that irregular and noisy meshes from real-time reconstruction are... Read more

articleView Paper downloadDownload

4. Can emerging deep learning and diffusion-based methods improve video compression beyond traditional codecs?

This research area investigates the integration of deep learning techniques, including neural networks and diffusion models, in video compression frameworks aiming at end-to-end optimized, perceptually enhanced coding. These methods promise to overcome limitations of classical codecs by leveraging learned priors and generative modeling to better exploit spatio-temporal redundancies and enable novel functionalities like view synthesis with reduced bitrates.

OpenDMC: An Open-Source Library and Performance Evaluation for Deep-learning-based Multi-frame Compression

by Shangkun Sun

2025, Proceedings of the 31st ACM International Conference on Multimedia

Key finding: Introduces OpenDMC, the first open-source cross-platform deep learning video compression library, encompassing classical and recent end-to-end neural methods such as DVC. Benchmarks report promising rate-distortion efficiency... Read more

articleView Paper downloadDownload

Diffusion-Based Compression

by Bryan Westcott

2024

Key finding: Proposes a novel diffusion-based video compression scheme leveraging denoising diffusion generative models as powerful priors, combined with low-quality encoded guidance data via finetuned low-rank adaptations (e.g., LoRA).... Read more

articleView Paper downloadDownload

Low Computational Coding-Efficient Distributed Video Coding: Adding a Decision Mode to Limit Channel Coding Load

by Manzoor Ahmed Hashmani

2024, Entropy

Key finding: Develops a transform-domain distributed residual video coding architecture implementing a Quantized Transform Decision Mode (QUAM) to skip zero transform blocks and thus reduce the overall channel coding and decoding... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in 3D Video Compression

Diffusion-Based Compression

by Bryan Westcott

2024

This document presents a novel diffusion-based video compression technique. We leverage the inherent expressiveness, photorealism and 3D awareness of denoising diffusion generative AI models as a powerful general-purpose prior that only... more

descriptionView Paper arrow_downwardDownload

Depth map compression using multi-resolution graph-based transform for depth-image-based rendering

by Oscar Au

2024, 2012 19th IEEE International Conference on Image Processing

Depth map compression is important for efficient network transmission of 3D visual data in texture-plus-depth format, where the observer can synthesize an image of a freely chosen viewpoint via depth-image-based rendering (DIBR) using... more

descriptionView Paper arrow_downwardDownload

Mixed-resolution HEVC based multiview video codec for low bitrate transmission

by Akbar Sheikh Akbari

2024, Multimedia Tools and Applications

There has been increasing demand for multiview video transmission over band limited channel over past years and various techniques have been proposed to fulfil this need. In this paper, a High Efficiency Video Codec (HEVC) based spatial... more

descriptionView Paper arrow_downwardDownload

Efficient streaming of stereoscopic depth-based 3D videos

by Ghassan Alregib

2024, Visual Information Processing and Communication IV

In this paper, we propose a method to extract depth from motion, texture and intensity. We first analyze the depth map to extract a set of depth cues. Then, based on these depth cues, we process the colored reference video, using texture,... more

descriptionView Paper arrow_downwardDownload

Fast and Efficient Lenslet Image Compression

by Antonio Pinheiro

2023, ArXiv

Light field imaging is characterized by capturing brightness, color, and directional information of light rays in a scene. This leads to image representations with huge amount of data that require efficient coding schemes. In this paper,... more

descriptionView Paper arrow_downwardDownload

Evaluation of 4D Light Field Compression Methods

by David Barina

2023, Computer Science Research Notes

Light field data records the amount of light at multiple points in space, captured e.g. by an array of cameras or by a light-field camera that uses microlenses. Since the storage and transmission requirements for such data are tremendous,... more

descriptionView Paper arrow_downwardDownload

Mixed-resolution HEVC based multiview video codec

by Ah-Lian Kor

2023

The aim of the Leeds Beckett Repository is to provide open access to our research, as required by funder policies and permitted by publishers and copyright law. The Leeds Beckett repository holds a wide range of publications, each of... more

descriptionView Paper arrow_downwardDownload

Fusion of colour and depth partitions for depth map coding

by R. Morros

2023

3D video coding includes the use of multiple color views and depth maps associated to each view. An adequate coding of depth maps should be adapted to the characteristics of depth maps: smooth regions and sharp edges. In this paper a... more

descriptionView Paper arrow_downwardDownload

Sistem Klasifikasi Penumpang Bus Trans Padang Berdasarkan Pakaian Menggunakan Metode Image Processing

by Muhammad Naufal

2023, CHIPSET

Public transportation is one of the things that is often used by everyone. One type of public transportation that is often used in Indonesia is the Bus Rapid Transit (BRT) or commonly known as the busway. The city of Padang, West Sumatra... more

descriptionView Paper arrow_downwardDownload

Adaptive Quantization Parameter Cascading in HEVC Hierarchical Coding

by Chang Wen Chen

2023, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

The state-of-the-art High Efficiency Video Coding (HEVC) standard adopts a hierarchical coding structure to improve its coding efficiency. This allows for the Quantization Parameter Cascading (QPC) scheme that assigns Quantization... more

COMPARISON I OF DIFFERENT D-Q AND R-Q MODELS TABLE II

Fig. 1. An example of an RA prediction structure with a GOP size of 8. influenced by its reference frames in a lower layer. This RD dependency was studied in [28] for HEVC default RA setting. An approximately linear inter-layer D dependency was observed as fe: es

According to [37], the convergence is quadratic if A), B; and I; are sufficiently smooth and the initial point is close to one of the roots of the equations. In this work, we set xeE(0) = {0;0;...;0; 1} and the iteration is stopped when |J~'£(xg(1)) |? < 0.01 - (L + 1), which yields an average error of AQ) to be approximately less than 0.1. The iteration process usually takes less than 10 times. Note that there exists a very small possibility that J~! does not exist if

Fig. 2. Examples of Y-PSNR improvements at the same bit rates. Horizontal axis: Bit rate; Vertical axis: Y-PSNR improvement. (a) Traffic in Class A. (b) Kimono in Class B. (c) RaceHorses in Class C. (d) RaceHorses in Class D. (e) BasketballDrillText in Class F.

Fig. 3. An example of an LD prediction structure with a GOP size of 4. COMPARISON OF DIFFERENT METHODS WITH HEVC LD

COMPARISON OF DIFFERENT METHODS WITH HEVC LP Fig. 4. The proposed QPC method adapts to different @ values.

Fig. 5. The proposed QPC method adapts to different SI and TI values. (a) & versus SI/O. (b) & versus TI/O.

Fig. 6. The proposed QPC method adapts in timeline.

PERCENTAGES OF USING THE NEAREST REFERENCES IN HEVC RA TABLE I which is similar to that in [28]. The coding performance of our conclusion will be examined by HEVC default multiple reference settings in Section V.

COMPARISON II OF DIFFERENT D-Q AND R-Q MODELS TABLE III

COMPARISON OF DIFFERENT METHODS WITH HEVC RA TABLE IV where C, and C> are two constants. Hence, we determine O; as the quantization step size that is least different from 01 in logarithmic domain. Qp, is then set as the corresponding Qp of Q). Specially, to avoid quality fluctuation between GOPs, we clip the Qps of all layers to be within [Q p—3, QO p+3l, where Q p is the average Qp of all layers calculated by the static Qp scheme.

COMPUTATIONAL COMPLEXITIES OF NEWTON-RAPHSON METHOD IN TERMS AVENUMITERPERGOP, AVETIMEPERITER (15) AND TOTITERTIMEPPM

CONTRIBUTION OF ADAPTATION IN THE PROPOSED METHOD TABLE VII AveNumlterPerGOP, represents the average number of iterations when perform QPC in one GOP; AveTimePerlter, represents the average time (in terms of ys) per iteration; and TotIterTimePPM, represents the total iteration time in terms of part per million of the overall encoding time. The test was run on a 2011 Laptop with a Quad-Core i17-2.70GHz CPU and 8GB memory. The measure results are summarized in Table V. It can be seen that on average, less than 10 times of Newton-Raphson iterations are required to perform QPC and the average time of each iteration is less than 1 ws. As the Qp increase, the AveNumlIterPerGOP slightly increases. Besides, the total time cost of these iterations is less than | part per mil- lion of the overall encoding time. Therefore, the computation overhead of Newton-Raphson method is negligible. proposed approach is negligible. Investigations of the Newton- Raphson method also show that, less than 10 iterations are required for most cases, which ensures a low computational complexity. Therefore, the coding time differences between the original HEVC and the proposed approaches are mostly determined by the changing of Qps, which may be positive or negative. Therefore, our scheme may even reduce the overall coding time in some cases.

descriptionView Paper arrow_downwardDownload

Sistem Klasifikasi Penumpang Bus Trans Padang Berdasarkan Pakaian Menggunakan Metode Image Processing

by Muhammad Rizky Naufal

2023, CHIPSET

descriptionView Paper arrow_downwardDownload

Fast and Efficient Lenslet Image Compression

by Antonio Pinheiro

2023, ArXiv

descriptionView Paper arrow_downwardDownload

Computationally Efficient Light Field Image Compression Using a Multiview HEVC Framework

by Waqas Ahmad

2023, IEEE Access

The acquisition of the spatial and angular information of a scene using light field (LF) technologies supplement a wide range of post-processing applications, such as scene reconstruction, refocusing, virtual view synthesis, and so forth.... more

FIGURE 7. The rate-distortion comparison of the proposed scheme with graph learning scheme [29] and JPEG Pleno anchor scheme for selected LF images.

TABLE 1. Selected LF images from JPEG Pleno datasets.

TABLE 2. The reduction in number of times SAD is computed using the proposed motion-search optimization.

FIGURE 1. Block diagram of the overall proposed framework. The input from plenoptic cameras or MCSs is converted into MPVS and transferred as input to the MV-HEVC based LF coding framework. A hierarchical organization of frames is introduced in MV-HEVC, and modifications are proposed in reference picture management, rate-allocation, and ME schemes.

FIGURE 2. Hierarchical organization for horizontal parallax only LF data (17 views). The central view at index 8 is of high importance, and it is placed in the first level. Views with indexes {0,4,12,16} are placed in the second level, views with indexes {2,6,10,14} are placed in the third level, and the remaining views are placed in the last level. The same hierarchical organization is applied to 2D LF views.

FIGURE 3. Rate-Distortion comparison for three different input formats (10-bit YUV444, 8-bit YUV444, and 8-bit YUV420 format).

FIGURE 5. The impact of the number of reference pictures on rate-distortion curves for Lytro, Stanford, and HDCA LF images.

FIGURE 6. The rate-distortion analysis between reference motion-search (RMS) and the proposed LF motion-search (LMS) for the “Bikes” LF image.

TABLE 3. BD-PSNR gain of the proposed scheme relative to graph learning scheme [29] and JPEG Pleno anchor schemes. of 2.4 dB for the “Tarot cards” LF image and 2.2 dB for the ““Set2”’ LF image. In all the RD comparisons, the proposed scheme performs equally better in all the tested bitrates. The RD information indicates that gains in compression effi- ciency for plenoptic images are higher compared for MCS. The SAIs of plenoptic images contain high angular correla- tion, which is a consequence of the narrow baseline; also in addition, the small disparity among SAIs contributes to less overhead of MVs. In an alternative perspective, the proposed coding scheme relates the disparity information to the frame- per-second (FPS) analogy of video acquisition systems. The lower disparity in the captured SAIs represents high FPS, which in turn, reflects high correlation in neighboring SAIs. On the other hand, high disparity relates to low FPS and lower correlation among neighboring views.

descriptionView Paper arrow_downwardDownload

Fast Multi-View Image Rendering Method Based on Reverse Search for Matching

by Frank Yan

2023, Optik

In a traditional multi-view image generation algorithm, partial image information might be lost at the pixel mapping step during the 3D image acquisition. A lower hardware cost and shorter operation time can be realized if an effective... more

descriptionView Paper arrow_downwardDownload

Bayesian Early Mode Decision Technique for View Synthesis Prediction-Enhanced Multiview Video Coding

by Shakeel Ahmad

2023, IEEE Signal Processing Letters

View synthesis prediction (VSP) is a coding mode that predicts video blocks from synthesised frames. It is particularly useful in a multi-camera setup with large inter-camera distances. Adding a VSP-based SKIP mode to a standard Multiview... more

descriptionView Paper arrow_downwardDownload

HEVC Based Frame Interleaved Coding Technique for Stereo and Multi-View Videos

by Akbar Sheikh Akbari

2023, Information

The standard HEVC codec and its extension for coding multiview videos, known as MV-HEVC, have proven to deliver improved visual quality compared to its predecessor, H.264/MPEG-4 AVC’s multiview extension, H.264-MVC, for the same frame... more

descriptionView Paper arrow_downwardDownload

Harnessing Multi-View Perspective of Light Fields for Low-Light Imaging

by kranthi kumar

2023, IEEE Transactions on Image Processing

Light Field (LF) offers unique advantages such as post-capture refocusing and depth estimation, but low-light conditions severely limit these capabilities. To restore low-light LFs we should harness the geometric cues present in different... more

descriptionView Paper arrow_downwardDownload

Perceptual Awareness Rate Control for Multi-View Video Encoder in Stereoscopic Display

by Pei-Jun Lee

2023, Journal of Display Technology

Multi-view video coding technique has been widely applied to 3D stereoscopic display. This paper proposes a rate control algorithm for multi-view video codec which is a tradeoff method between the data compression ratio and picture... more

descriptionView Paper arrow_downwardDownload

PBPAIR: Probability Based Power Aware Intra Refresh A New Energy-efficient Error-resilient Encoding Scheme

by Nalini Venkatasubramanian

2023

Error resilient encoding in video communication is becoming increasingly important due to data transmission over unreliable channels. In this paper, we propose a new power-aware error resilient coding scheme based on network error probability and user expectation in video communication using mobile handheld devices. By considering both image content and network conditions, we can achieve a fast recoverable and energy-efficient error resilient coding scheme. More importantly, our approach allows system designers to evaluate various operating points in terms of error resilient level and energy consumption over a wide range of system operating conditions. We have implemented our scheme on an H.263 video codec algorithm, compared it with the previous AIR, GOP and PGOP coding schemes, and measured energy consumption and video quality on the IPAQ and Zaurus PDAs. Our experimental results show that our approach reduces energy consumption by 34%, 24% and 17% compared with AIR, GOP and PGOP schemes respectively, while incurring only a small fluctuation in the compressed frame size. In addition, our experimental results prove that our approach allows faster error recovery than the previous AIR, GOP and PGOP approaches. We believe our error resilient coding scheme is therefore eminently applicable for video communication on energy-constrained wireless mobile handheld devices. Recent advances in technology enable mobile handheld devices to be equipped with wireless interfaces and there will be growing demand for high quality mobile multimedia communications. However, wireless multimedia communications in the mobile handheld environment face several challenges, including high error rate, bandwidth variations, and limitations of the mobile devices such as battery lifetime constraints and the low CPU computation capability. To overcome the bandwidth limitation, there are several existing video coding techniques developed, for example, H.263 and MPEG, to compress raw video sequences to encoded bitstreams. These video encoding techniques exploit spatial and temporal correlation to achieve a high compression ratio, but they are usually unaware about the device status and the network conditions during the coding process. Therefore, multimedia data encoding requires a large amount of information, leading to high computation and communication energy consumption, and transmitting multimedia data over wireless networks can be very unreliable due to packet loss. This problem should be solved with the reasonable compression efficiency with high error resiliency considering resource constraints, which is a crucial factor for the real-time multimedia communication over error prone and lossy network using mobile handheld devices. Video communication over unreliable networking environments is challenging since data loss and corruption from several reasons such as traffic congestion and physical channel failure affect video quality severely unless a guaranteed quality of service (QoS) is available between the source and the destination. Also, the spatio-temporal prediction encoding and variable length coding (VLC) of the source coding cause error propagation. Since spatiotemporal prediction requires the previous frame to reconstruct the current frame, a single error can lead to consecutive errors in the following frames. Likewise, because of VLC, a single bit error causes the decoder to lose a synchronization point that makes the following bits useless. Therefore, a variety of techniques have been proposed to enhance the resilience of the video data encoding against the packet errors [1, 2]. The most well recognized method is to insert intra-coding 1 to mitigate the effect of error propagation in a predictive video compression algorithm [3, 4, 5, 6]. However, inserting intra-coding influences compression efficiency adversely since it tends to increase total length of the encoded bitstream. From this observation, the prior studies on error resilient video encoding mainly tried to find out a solution that maximizes bitstream robustness with low bit rate. Meanwhile, as mobile devices increasingly have video communication functionality, low power encoding has become important. Several encoding schemes have been proposed to reduce energy consumption for multimedia applications [9, 10, 11, 12]. However, these studies dealt with either error resilience or low power issues independently. We believe it is critical for both issues to be addressed together, especially in the context of energy-constrained mobile devices. In this paper, we propose a new energy-efficient, error-resilient encoding scheme. Especially, we note the dual role of intra-coding: not only does intra-coding improve error resilience, but it also contributes to reducing encoding energy consumption since it does not require motion estimation (which is the most power consuming operation in a predictive video compression algorithm). Indeed, the system designer will therefore need to evaluate the trade-off between the error resiliency level, compression efficiency, and power consumption. In this paper, we focus our attention on this tradeoff. Specifically, we (i) propose PBPAIR (Probability Based Power Aware Intra Refresh), a new energy-efficient and error-resilient encoding scheme, based on the network condition and the image content, (ii) implement our scheme as well as other existing error resilient encoding schemes on an H.263 codec, (iii) extensively compare with other error resilient encoding schemes in the context of error resiliency vs. encoding efficiency (both bit rate and energy consumption), and (iv) evaluate the trade-offs between the error resiliency level, compression efficiency, and energy consumption on top of real implementation platform. Our performance results indicate that PBPAIR saves as much as 17% to 34% energy compared with other error resilient techniques allowing faster error recovery than the previous approaches. This paper is organized as follows. In the next section, we briefly review previous work on error resilient

descriptionView Paper arrow_downwardDownload

Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC

by Sergio Faria

2023, 2014 22nd European Signal Processing Conference (EUSIPCO)

Holoscopic imaging is a prospective acquisition and display solution for providing true 3D content and fatigue-free 3D visualization. However, efficient coding schemes for this particular type of content are needed to enable proper... more

descriptionView Paper arrow_downwardDownload

Light Field Image Coding Using High-Order Intrablock Prediction

by Sergio Faria

2023, IEEE Journal of Selected Topics in Signal Processing

This paper proposes a two-stage high order intra block prediction method for light field image coding. This method exploits the spatial redundancy in lenslet light field images by predicting each image block, through a geometric... more

descriptionView Paper arrow_downwardDownload

Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction

by Sergio Faria

2023, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

Light field imaging is a promising new technology that allows the user not only to change the focus and perspective after taking a picture, as well as to generate 3D content, among other applications. However, light field images are... more

descriptionView Paper arrow_downwardDownload

Road Surface Crack Detection using a Light Field Camera

by Paulo Correia

2023, 2018 26th European Signal Processing Conference (EUSIPCO)

During traditional road surveys, inspectors capture images of pavement surface using cameras that produce 2D images, which can then be automatically processed to get a road surface condition assessment. This paper proposes a novel crack... more

descriptionView Paper arrow_downwardDownload

The Effect of Depth Compression on Multiview Rendering Quality

by Peter With

2022, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video

DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page... more

descriptionView Paper arrow_downwardDownload

Perceptual depth compression for stereo applications

by Kari Pulli

2022, Computer Graphics Forum

Conventional depth video compression uses video codecs designed for color images. Given the performance of current encoding standards, this solution seems efficient. However, such an approach suffers from many issues stemming from... more

descriptionView Paper arrow_downwardDownload

Optimal Mode Selection of Disparity-Compensated Wavelet Lifting for Multi-View Image Coding

by Amares Kaewpunya

2022, TENCON 2006 - 2006 IEEE Region 10 Conference

This paper presents the optimal mode selection of disparity-compensated (DC) wavelet lifting in a multi-view image coding framework. The optimal mode is a combination of macroblock (MB) coding and block partitioning mode of DC wavelet... more

descriptionView Paper arrow_downwardDownload

Automatic View Synthesis by Image-Domain-Warping

by Aljosa Smolic

2022, IEEE Transactions on Image Processing

Today, stereoscopic 3D (S3D) cinema is already mainstream, and almost all new display devices for the home support S3D content. S3D distribution infrastructure to the home is partly already established in form of 3D Blu-ray discs, video... more

descriptionView Paper arrow_downwardDownload

Enhanced motion cues for automatic depth extraction for 2D-to-3D video conversion

by Gustavo Moreira Alves

2022, 2014 International Telecommunications Symposium (ITS)

In this paper we present two methods of depth extraction for 2D-to-3D video conversion. One for a scene captured with a static camera and other for the case of a moving camera, both using information from the motion present on the scene.... more

descriptionView Paper arrow_downwardDownload

A Potential Heuristic-based Block Matching Algorithms for Motion Estimation in Video Compression

by MOHAMAD TAIB MISKON

2022, Journal of Telecommunication, Electronic and Computer Engineering

Motion estimation (ME) is one of the element keys in video compression that takes up to 60% in processing time. Block matching algorithm (BMA) is a technique that is used to reduce the computational complexity of ME algorithm due to its... more

descriptionView Paper arrow_downwardDownload

Block Compressive Sensing Single-View Video Reconstruction Using Joint Decoding Framework for Low Power Real Time Applications

by Syed Hasnain Adil

2022, Applied Sciences

Several real-time visual monitoring applications such as surveillance, mental state monitoring, driver drowsiness and patient care, require equipping high-quality cameras with wireless sensors to form visual sensors and this creates an... more

descriptionView Paper arrow_downwardDownload

JPEG Pleno: Standardizing a coding framework and tools for plenoptic imaging modalities

by C. Pagliari

2022

JPEG Pleno is an upcoming standard from the ISO/IEC JTC 1/SC 29/WG 1 (JPEG) Committee. It aims to provide a standard framework for coding new imaging modalities derived from representations inspired by the plenoptic function. The image... more

descriptionView Paper arrow_downwardDownload

Light Field Compression With Homography-Based Low-Rank Approximation

by Reuben A. Farrugia

2022, IEEE Journal of Selected Topics in Signal Processing

This paper describes a light field compression scheme based on a novel homography-based low rank approximation method called HLRA. The HLRA method jointly searches for the set of homographies best aligning the light field views and for... more

descriptionView Paper arrow_downwardDownload

Multiview Depth-Image Compression Using an Extended H.264 Encoder

by Peter With

2022, Lecture Notes in Computer Science

This paper presents a predictive-coding algorithm for the compression of multiple depth-sequences obtained from a multi-camera acquisition setup. The proposed depth-prediction algorithm works by synthesizing a virtual depth-image that... more

descriptionView Paper arrow_downwardDownload

Incorporating Depth-Image Based View-Prediction into H.264 for Multiview-Image Coding

by Peter With

2022, 2007 IEEE International Conference on Image Processing

We investigate the coding of multiview images obtained from a set of multiple cameras. To exploit the interview correlation, two viewprediction tools have been implemented and used in parallel: a blockbased motion compensation scheme and... more

descriptionView Paper arrow_downwardDownload

System architecture for free-viewpoint video and 3D-TV

by Peter With

2022, IEEE Transactions on Consumer Electronics

descriptionView Paper arrow_downwardDownload

Fast and Efficient Lenslet Image Compression

by Manuela Pereira

2022, ArXiv

descriptionView Paper arrow_downwardDownload

A graph learning approach for light field image compression

by Bayu Octa

2022, Applications of Digital Image Processing XLI

In recent years, light field imaging has attracted the attention of the academic and industrial communities thanks to its enhanced rendering capabilities that allow to visualise contents in a more immersive and interactive way. However,... more

descriptionView Paper arrow_downwardDownload

Threshold-free pattern-based low bit rate video coding

by Manoranjan Paul

2022, 2008 15th IEEE International Conference on Image Processing

Pattern-based video coding (PVC) has already established its superiority over H.264 in low bit rate areas because of an extra pattern-mode to segment out the arbitrary shape of the moving region in macroblock. To determine the... more

descriptionView Paper arrow_downwardDownload

Efficient H.264/AVC Video Encoder Where Pattern Is Used as Extra Mode for Wide Range of Video Coding

by Manoranjan Paul

2022, Lecture Notes in Computer Science

Pattern-based video coding representing moving regions in macroblock has very good potential for improved coding efficiency over existing standard H.264/AVC in the very low bit-rate range. However, the coding efficiency diminishes... more

descriptionView Paper arrow_downwardDownload

A real time generic variable pattern selection algorithm for very low bit-rate video coding

by Manoranjan Paul

2022, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429)

The selection of an optimal regular-shaped pattern set for very low bit-rate video coding, focusing on moving regions has been the objective of much recent research in order to try and improve bit-rate eficieciency. Selecting the optimal... more

descriptionView Paper arrow_downwardDownload

A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions

by Manoranjan Paul

2022, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698)

A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions.

descriptionView Paper arrow_downwardDownload

Threshold-free pattern-based low bit rate video coding

by Manoranjan Paul

2022, 2008 15th IEEE International Conference on Image Processing

descriptionView Paper arrow_downwardDownload

An Optimal Content-Based Pattern Generation Algorithm

by Manoranjan Paul

2022, IEEE Signal Processing Letters

Very low bit-rate video coding algorithms using predefined regular-shaped patterns to segment out moving objects at macroblock level have exhibited good potential for improved coding efficiency when embedded in the H.264 standard as an... more

descriptionView Paper arrow_downwardDownload

Video coding using arbitrarily shaped block partitions in globally optimal perspective

by Manoranjan Paul

2022, EURASIP Journal on Advances in Signal Processing

Algorithms using content-based patterns to segment moving regions at the macroblock (MB) level have exhibited good potential for improved coding efficiency when embedded into the H.264 standard as an extra mode. The content-based pattern... more

descriptionView Paper arrow_downwardDownload

Advanced very low bit rate video coding using preferential pattern selection algorithms

by Manoranjan Paul

2022

In the context of very low hit-rate video coding, pre-defined fixed pattern representations of moving regions in blocked-based motion estimation and compensation has become increasingly attractive over H.264 as the former represents an MB... more

descriptionView Paper arrow_downwardDownload

A Potential Heuristic-based Block Matching Algorithms for Motion Estimation in Video Compression

by ASSCOCIATE PROFESSOR TS. DR. HAMIDAH JANTAN JANTAN

2022

Dcomp and Dpsnr Comparison of the MDS and MARPS Algorithms for all Motion Types Figure 1: Performance Comparisons of DS and MDS Algorithms in term of Number of Computations of Foreman Sequence

Figure 2: Performance Comparisons of ARPS and MARPS Algorithms in term of Number of Computations of Claire Sequence

Performance of Block Matching Algorithms in terms of no. of Computations and PSNR of High Motion Type Performance of Block Matching Algorithms in terms of no. of Computations and PSNR of Medium Motion Type

Performance of Block Matching Algorithms in terms of no. of Computations and PSNR of Low Motion Type Table 4 shows the percentage degradation of the number of computations and PSNR values of the proposed MDS and MARPS algorithms as compared to those of DS and ARPS algorithms respectively for all motion type (H, M and L represent for high, medium and low motion type respectively). The sign (-) in Dcomp and Dpsnr indicate a loss values in their performances. MDS has reduced the number of computations by 7.56% for high motion sequence, 0.16% for medium motion sequence and 0.40% for low motion sequence. MARPS is considered as the fastest algorithm because it has greatly reduced the number of computations by 19.32%, 18.76% and 17.96% for low, medium and high motion sequence respectively. MDS and MARPS also provide good PSNR values with small degradation ratio for all motion types.

descriptionView Paper arrow_downwardDownload

A Potential Heuristic-based Block Matching Algorithms for Motion Estimation in Video Compression

by ASSCOCIATE PROFESSOR TS. DR. HAMIDAH JANTAN JANTAN

2022

descriptionView Paper arrow_downwardDownload

CSRN 2901-WSCG 2019 proceedings, Part I

by David Barina

2022

descriptionView Paper arrow_downwardDownload

Rate-distortion analysis of multiview coding in a DIBR framework

by Hamid Pourreza

2022, annals of telecommunications - annales des télécommunications

Depth image based rendering techniques for multiview applications have been recently introduced for efficient view generation at arbitrary camera positions. Encoding rate control has thus to consider both texture and depth data. Due to... more

descriptionView Paper arrow_downwardDownload

A Modified Inter-view Prediction Scheme for Multiview Video Coding to Improve View’s Interactivity

by Ayman Hamdan

2022, Proceedings of the International Conference on Computer Vision Theory and Applications

In this paper, we present a modified interview prediction multiview video coding (MVC) scheme form the perspective of viewer interactivity. This latter requires a high transmission bite-rate, when a viewer requests some view(s).... more

descriptionView Paper arrow_downwardDownload

3D Video Compression

Key research themes

1. How can video coding standards be extended to efficiently compress 3D and multiview video formats?

2. What methods are effective for compressing stereoscopic and multi-picture object (MPO) images leveraging inter-view redundancy?

3. How can compression of 3D point clouds and 3D meshes be optimized for immersive and real-time applications?

4. Can emerging deep learning and diffusion-based methods improve video compression beyond traditional codecs?

Related Topics

All papers in 3D Video Compression