Skip to main content

ISO/IEC 14496-10:2008

Date published:

Information technology — Coding of audio-visual objects — Part 10: Advanced Video Coding

ISO/IEC 14496-10:2008 was developed jointly with the ITU-T in response to the growing need for higher compression of moving pictures for various applications such as digital storage media, television broadcasting, Internet streaming, and real-time audiovisual communication. It is also designed to enable the use of the coded video representation in a flexible manner for a wide variety of network environments. It is designed to be generic in the sense that it serves a wide range of applications, bit rates, resolutions, qualities and services. The use of ISO/IEC 14496-10:2008 allows motion video to be manipulated as a form of computer data and to be stored on various storage media, transmitted and received over existing and future networks and distributed on existing and future broadcasting channels. In the course of creating ISO/IEC 14496-10:2008, requirements from a wide variety of applications have been considered, necessary algorithmic elements have been developed, and these have been integrated into a single syntax. Hence, ISO/IEC 14496-10:2008 will facilitate video data interchange among different applications.

The coded representation specified in the syntax is designed to enable a high compression capability with minimal degradation of image quality. The algorithm is not ordinarily lossless, as the exact source sample values are typically not preserved through the encoding and decoding processes. A number of syntactical features with associated decoding processes are defined that can be used to achieve highly efficient compression, and individual selected regions can be sent without loss. The expected encoding algorithm (not specified in ISO/IEC 14496-10:2008) can select between inter and intra coding for block-shaped regions of each picture. Inter coding uses motion vectors for block-based inter-picture prediction to exploit temporal statistical dependencies between different pictures. Intra coding uses spatial prediction modes to exploit spatial statistical dependencies in the source signal within a single picture. Motion vectors and intra prediction modes may be associated with a variety of block sizes in a picture. The residual signal remaining after intra or inter prediction is then processed using a spatial transform to remove spatial correlation within each transform block. The transformed blocks are then quantised. Quantisation is an irreversible process that forms an approximation that can be represented using a reduced number of bits while incurring some loss of fidelity. Finally, the motion vectors or intra prediction modes are combined with the quantised transform coefficient information and encoded using either context-adaptive variable length codes or context-adaptive binary arithmetic coding.

Annexes A through E and G contain normative requirements and are an integral part of ISO/IEC 14496-10:2008. Annex A defines eleven profiles (Baseline, Main, Extended, High, High 10, High 4:2:2, High 4:4:4 Predictive, High 10 Intra, High 4:2:2 Intra, High 4:4:4 Intra, and CAVLC 4:4:4 Intra), each being tailored to a group of application domains, and also defines levels of capability within each of these profiles. Annex B specifies the syntax and semantics of a byte stream format for delivery of the coded video as an ordered stream of bytes or bits. Annex C specifies the Hypothetical Reference Decoder and its use to check bitstream and decoder conformance. Annex D specifies syntax and semantics for Supplemental Enhancement Information message payloads. Annex E specifies syntax and semantics of the Video Usability Information parameters of coded video sequences. Annex G specifies scalable video coding in three additional profiles (Scalable Baseline, Scalable High, and Scalable High Intra) which enable a coded video bitstream to be structured into layers, such that layered subsets of the bitstream can be independently decodable to provide video quality commensurate with t...

Get this standard Prices exclude GST
PDF ( Single user document)
$118.26 NZD
HardCopy
$152.17 NZD
Networkable PDF
Price varies
Preview only close
Prev {{ page }}/ {{ numPages }} Next
Preview only close
Prev {{ page }}/ {{ numPages }} Next
Pages: 566

Previous versions

Keep me up-to-date

Sign up to receive updates when there are changes to this standard

Related Information

Similar Standards

  • BS EN 61937-10:2011

    Digital audio. Interface for non-linear PCM encoded audio bitstreams applying IEC 60958, Non-linear PCM bitstreams according to the MPEG-4 Audio Lossless Coding (ALS) format

  • BS EN 61937-5:2006

    Digital audio. Interface for non-linear PCM encoded audio bitstreams applying IEC 60958, Non-linear PCM bitstreams according to the DTS (digital theatre systems) format(s)

  • BS EN 61937-6:2006+A1:2014

    Digital audio. Interface for non-linear PCM encoded audio bitstreams applying IEC 60958, Non-linear PCM bitstreams according to the MPEG-2 AAC and MPEG-4 AAC formats

  • BS EN 61937-8:2007

    Digital audio. Interface for non-linear PCM encoded audio bitstreams applying IEC 60958, Non-linear PCM bitstreams according to the Windows Media Audio (WMA) Professional format

Preview only close
Prev {{ page }}/ {{ numPages }} Next
Preview only close
Prev {{ page }}/ {{ numPages }} Next
Pages: 566

Previous versions

ISO/IEC 14496-10:2008

Get this standard Prices exclude GST
PDF ( Single user document)
$118.26 NZD
HardCopy
$152.17 NZD
Networkable PDF
Price varies

Request to add this standard to your subscription

ISO/IEC 14496-10:2008

Price varies
Online library subscription

Your organisation’s Account Administrator must approve a request to add a standard to your subscription.

You may add a comment to the administrator below.

Cancel