MPEG Endorses Video Coding For Machines Movement

MPEG is responding to growing demand for efficient video transmission among machines by re-establishing a dedicated group to investigate use cases, requirements, test conditions, evaluation methodologies, and potential coding technologies.

Called the Video Coding for Machines (VCM) Ad-hoc Group (AhG), the initial focus will be mainly compression efficiency, taking account of the fact that ability to recognize objects quickly and accurately is the goal, rather than enjoyment of the experience. The aim is therefore to seek compression performance greater than that achieved by current or forthcoming codecs for transmission of content to humans, such as Versatile Video Coding (VVC).

This comes when Cisco among others have been predicting that machine-to-machine applications will generate the fastest growth in internet video traffic over the next few years. This means that efficient compression of video data for machine use will be important for competitiveness and also for ensuring there is sufficient capacity for all applications and services, including those streaming to humans.

While the aim with conventional video coding is to compress and then reconstruct whole frames with a view to achieving the most enjoyable perception possible at the target resolution, for machines it is to preserve just critical information. But machines will vary in their requirements and so the focus of research now is to apply AI techniques to adapt compression for specific use cases, with the advantage being that success is somewhat easier to define via testing, or at any rate more direct to establish in the machine case. If the machine can perform its allotted tasks accurately enough, then video will be deemed to have been reconstructed satisfactorily. The objective would be to achieve the lowest bit rate at which performance or safety targets are met, presumably leaving some headroom.

The idea of a new codec called VCM was proposed earlier in August 2019 by China Telecom in conjunction with Gyrfalcon Technology, a developer of AI accelerators. The need had just been recognized after over 40 years of video compression history led by MPEG. The stated aim was to develop vision chips for a variety of sectors in the burgeoning Internet of Things (IoT) arena.

You might also like...

HDR & WCG For Broadcast - The Book

‘HDR & WCG For Broadcast – The Book’ is a multi-article exploration of the science and practical applications of all aspects of High Dynamic Range and Wide Color Gamut within broadcast production.

Monitoring & Compliance In Broadcast: Part 1 - Cloud, Multi-Site & Remote Systems

‘Monitoring & Compliance In Broadcast’ explores how exemplary content production and delivery standards are maintained and legal obligations are met. The series includes four Themed Content Collections, each of which tackles a different area of the media supply chain. Part 1 con…

The Ultimate Compression Technology?

Our resident provocateur Dave Shapton speculates on the nature of compression and its potential future evolutionary path.

Microphones: Part 7 - Microphones For Stereophony

Once the basic requirements for reproducing sound were in place, the most significant next step was to reproduce to some extent the spatial attributes of sound. Stereophony, using two channels, was the first successful system.

Broadcast Standards - The Book

Broadcast Standards – The Book is a unique reference resource for broadcast engineers, operators and system designers. Never before has such a huge body of broadcast industry specific information been collated from international standards bodies and distilled into a single source o…