Embedded    IP Portfolio    Video Components    SVC

Scalable Video Coding (SVC)

Overview

ITU-T H.264, or AVC video coding, is fast replacing the earlier video standards in the end-to-end chain of video content delivery, personal entertainment, video surveillance, and visual communications. Scalable Video Coding (SVC) is an amendment to H.264 that is aimed at creating an efficient scalable representation of video that can be consumed according to the capabilities of diverse networks and end-terminals.SVC offers temporal (frame-rate), spatial (resolution) and quality (SNR) scalabilities that are built on an AVC-compliant base layer. This ensures backward compatibility and re-use of technologies developed for AVC.


Applications for SVC include

  • Mobile Broadcast - SVC has been provisioned for new mobile broadcasting standards such as ATSC-M / H. This standard employs the spatial scalability aspect.
  • Video surveillance - applications that require multiple resolutions and frame-rates across storage, LAN, and WAN accesses benefit from the spatial and temporal scalability aspects of SVC.
  • Multi-point video conferencing - can leverage SVC across wired / wireless networks with diverse end-terminal capabilities. In addition to resolution and frame-rate scalabilities, quality scalability is also needed to adapt the stream to the available bandwidth that will be time varying. SVC offers the possibility of eliminating additional end-to-end latency currently incurred due to complex transcoding on an MCU.
  • Video streaming and video-on-demand - applications over internet can eliminate the need to encode at multiple resolution, frame-rates, and bit-rates.
  • Home servers can have a single representation from which different versions can be derived without requiring transcoding 

Closed systems and unequal error protection based approaches can benefit from SVC readily. The benefits of SVC will increase as different network elements, such as routers, become SVC aware, sending only the necessary portion of the bitstream over to the next router based on available bandwidth assessed or based on the capabilities of the end-terminals.


Ittiam SVC


Ittiam has developed a rich portfolio of intellectual property (IP) around the H.264 standard over the past many years. This IP includes highly differentiated algorithms in the areas of motion estimation, mode selection, rate control, transcoding, low latency solutions, and error resilience / error propagation control. These IPs are embodied in Ittiam's Reference C model which forms the basis of implementations of H.264 on various platforms.

These IPs for H.264 and other video coding standards, along with additional platform-specific IPs such as multi-threaded designs, multi-core architectures, and optimizations have resulted in efficient codecs realized on multiple programmable platforms (e.g., TI processors such as OMAP3, DM6467) and cores (such as ARM Cortex A8 / A9). These codecs have been licensed to several customers in the areas of entertainment devices (personal, in-flight, in-car), video confere ncing equipments, surveillance netcams / DVRs, and transcoding devices.

Leveraging this extensive video compression and platform implementation expertise, Ittiam has developed a reference C-model for the SVC encoder and decoder. These C models are suitable for directing to implementations on a diverse set of platforms. The SVC encoder and decoder C models can also be used as a starting point for further IP development and differentiation by licensees.

SVC Decoder C Model

SVC Encoder C Model

Features

  • Fully conformant to the Scalable Baseline Profile of ITU-T H.264 Annex G
  • Additionally, supports all tools of the Scalable High Profile (except for interlaced support)
  • Resolution supported up to Level 5
  • ANSI-C implementation (for maximum portability)
  • Modular design
  • Abstracted memory access (to support both DMA-based and cache-based platform ports)
  • Fully leverages the single loop decoding feature of SVC
  • Efficient data structures to reduce external memory bandwidth
  • Optimized control code
  • Target dependency_id, quality_id, and temporal_id can be configured at the API level
  • Supports both Annex-B bytestream format and non-Annex-B NAL unit format as inputs
  • Supports 4:2:0 Planar, 4:2:2(CbYCrY), and 4:2:2 semi-planar(Y plane + Cb-Cr byte interleaved plane) output formats
  • Verified with conformance streams from Baseline, Main, High, and Scalable baseline profiles
  • Bit matches with the latest version of JSVM decoder
  • Extensively validated with internally generated streams
  • Robust to erasures and bit errors in the stream, extensively verified with error streams
  • Supports efficient error concealment
  • APIs suited for embedded platforms (Re-entrant, memory resources requested up-front, support trick-mode / seek, reset, and flush commands); easy to encapsulate as OpenMAX IL APIs with a resource manager
  • Configurable levels of trace file output for debugging and data analysis

Benefits

  • Lower memory footprint, lower cycle count, and reduced external memory bandwidth when ported to a programmable platform
  • Reduced time to port to an embedded platform
  • Easy to partition across multiple cores
  • Exhaustive validation improves the reliability of the solution
  • Flexible and extensible API

Applications

  • Mobile TV decoder
  • Conferencing end-point
  • Gateway (transcode from SVC to other legacy formats)
  • Surveillance DVR or monitoring station
  • Home media terminals

Features

  • Generates bitstreams fully conformant with the Scalable Baseline Profile
  • Support up to Level 4.0
  • Supports configurable number of spatial, temporal, and quality layers
  • Quality scalability also includes support for scan segmenting for medium grain scalability
  • Support for key pictures to control drift
  • Advanced motion estimation with support for motion partitions down to 8x8
  • Support for both dyadic and non-dyadic (1.5x) scaling ratios
  • Supports inter-layer prediction: base_mode, mv_pred, residual_pred, and intra_pred
  • Complexity reduction in enhancement layers through re-use of information from the reference layer
  • Intelligent pruning of coding modes
  • Configurability to trade-off complexity against quality
  • Subjective quality based bit allocation
  • Extension of rate control to scalable encoding to bound the bit-rate of the highest layer
  • Efficient implementation
  • Efficient data structures to minimize external memory bandwidth
  • Modular design
  • Abstraction of memory accesses (to support both DMA and cache based implementations)
  • Includes high quality decimation filters matched with the normative upsampling filters
  • Supports 4:2:0 Planar, 4:2:2(CbYCrY), and 4:2:2 semi-planar(Y plane + Cb-Cr byte interleaved plane) input formats
  • Supports both Annex-B bytestream and non Annex-B NAL unit output formats
  • Algorithms tuned for conferencing application
    • Low latency scalability
    • Slice based encoding for error resilience and packet utilization
    • Exploit region of interest to decrease bit-rate and reduce complexity at the same time
  • Quality benchmarked against JSVM encoder: better than JSVM in the conference mode; within 0.5dB of JSVM encoder at much reduced complexity
  • Validated against JSVM decoder for bit-exact reconstruction
  • APIs suited for embedded platforms (Re-entrant, memory resources requested up-front, reset, and flush commands); easy to encapsulate as OpenMAX IL APIs with a resource manager

Benefits

  • Validated set of encoder algorithms that result in a high quality
  • Unique set of features that are highly relevant for a conferencing grade scalable encoder
  • Complexity can be trade off against quality quickly for a given platform
  • Lower memory footprint, lower cycle count, and reduced external memory bandwidth when ported to a programmable platform
  • Reduced time to port to an embedded platform
  • Easy to partition across multiple cores
  • Flexible and extensible API

Applications

  • Conferencing end-point
  • Gateway (transcode from legacy formats to SVC)
  • Head-end encoder or transcoder
  • IP network camera or Surveillance DVR
  • Home media server

 

 

 
Identity
Embedded
Silicon IP
News
Documentation
Contact Us
© 2010 Ittiam. All Rights Reserved.
Best viewed in 1024x768 or above, IE7, IE8, Google Chrome or Firefox.
Flash Player 8 or above required