Academia.eduAcademia.edu

Outline

A Technical Overview of VP9--the Latest Open-Source Video Codec

2015, SMPTE Motion Imaging Journal

https://doi.org/10.5594/J18499

Abstract

Google has recently finalized a next generation open-source video codec called VP9, as part of the libvpx repository of the WebM project (http://www.webmproject.org/). Starting from the VP8 video codec released by Google in 2010 as the baseline, various enhancements and new tools were added, resulting in the next-generation bit-stream VP9. The bit-stream was finalized with the exception of essential bug-fixes, in June 2013. Prior to the release however, all technical developments in fact were being conducted openly in the public experimental branch of the repository for many months. This paper provides a brief technical overview of the coding tools included in VP9, along with coding performance comparisons with other state-of-the-art video codecs -namely H.264/AVC and HEVC -on standard test sets. While a completely fair comparison is impossible to conduct because of the limitations of the respective encoder implementations, the tests show VP9 to be quite competitive with main-stream state-of-the-art codecs.

FAQs

sparkles

AI

What coding efficiencies are achieved through VP9's prediction block-size enhancements?add

VP9 introduces super-block sizes up to 64x64, enhancing coding efficiency by leveraging 13 endpoint block sizes, with recursive breakdown down to 4x4.

How does VP9's entropy coding improve decoding performance compared to older codecs?add

VP9 utilizes a hybrid of forward and backward adaptation for entropy coding, allowing efficient symbol tracking with reduced computational burden.

What unique features does VP9 offer for error resilience during video transmission?add

VP9 has an 'error_resilient_mode' that resets entropy contexts at each frame, allowing continued decoding post frame loss, with a marginal performance drop of 4-5%.

How does VP9 accommodate coding at varying resolutions during encoding?add

VP9 supports alternate resolution reference frames, allowing coding at different resolutions on the fly, with scaling achieved during block prediction.

What distinction does VP9's loop filtering provide for different transform block sizes?add

VP9 employs a loop filter to mitigate blocking artifacts efficiently across various transform sizes (4x4 to 32x32), optimizing visual quality at different block sizes.

References (5)

  1. Gary J. Sullivan, Jens-Rainer Ohm, Woo-Jin Han, and Thomas Wiegand, "Overview of the High Efficiency Video Coding (HEVC) Standard," IEEE Trans. on Circuits and Systems for Video Tech., vol. 22, No. 12, Dec 2012.
  2. Thomas Wiegand, Gary J. Sullivan, Gisle Bjøntegaard; Ajay Luthra. "Overview of the H.264/AVC Video Coding Standard," IEEE Trans. on Circuits and Systems for Video Technology, vol. 13 No. 7, Jan 2011.
  3. J. Bankoski, J. Koleszar, L. Quillio, J. Salonen, P. Wilkins, Y. Xu, VP8 Data Format and Decoding Guide, RFC 6386, http://datatracker.ietf.org/doc/rfc6386/
  4. J. Han, A. Saxena, and K. Rose, "Towards jointly optimal spatial prediction and adaptive transform in video/image coding," Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pp. 726-729, March 2010.
  5. Frank Bossen, Davin Flynn, Karsten Suhring, HM Software Manual, https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/branches/HM-9.2-dev/doc/software- manual.pdf