MPEG Endorses Video Coding For Machines Movement

MPEG is responding to growing demand for efficient video transmission among machines by re-establishing a dedicated group to investigate use cases, requirements, test conditions, evaluation methodologies, and potential coding technologies.

Called the Video Coding for Machines (VCM) Ad-hoc Group (AhG), the initial focus will be mainly compression efficiency, taking account of the fact that ability to recognize objects quickly and accurately is the goal, rather than enjoyment of the experience. The aim is therefore to seek compression performance greater than that achieved by current or forthcoming codecs for transmission of content to humans, such as Versatile Video Coding (VVC).

This comes when Cisco among others have been predicting that machine-to-machine applications will generate the fastest growth in internet video traffic over the next few years. This means that efficient compression of video data for machine use will be important for competitiveness and also for ensuring there is sufficient capacity for all applications and services, including those streaming to humans.

While the aim with conventional video coding is to compress and then reconstruct whole frames with a view to achieving the most enjoyable perception possible at the target resolution, for machines it is to preserve just critical information. But machines will vary in their requirements and so the focus of research now is to apply AI techniques to adapt compression for specific use cases, with the advantage being that success is somewhat easier to define via testing, or at any rate more direct to establish in the machine case. If the machine can perform its allotted tasks accurately enough, then video will be deemed to have been reconstructed satisfactorily. The objective would be to achieve the lowest bit rate at which performance or safety targets are met, presumably leaving some headroom.

The idea of a new codec called VCM was proposed earlier in August 2019 by China Telecom in conjunction with Gyrfalcon Technology, a developer of AI accelerators. The need had just been recognized after over 40 years of video compression history led by MPEG. The stated aim was to develop vision chips for a variety of sectors in the burgeoning Internet of Things (IoT) arena.

You might also like...

Innovating The Interactive Sports Fan Experience - Monumental Sports Network Are Early Adopters

As we continue our dive into the new frontier of Interactive Rights we explore the first steps taken by an early adopter. Monumental Sports Network in Washington are far from implementing a complete portfolio of interactive enhancements to their broadcasts…

Monitoring & Compliance In Broadcast: Monitoring Cloud Infrastructure

If we take cloud infrastructures to their extreme, that is, their physical locality is unknown to us, then monitoring them becomes a whole new ball game, especially as dispersed teams use them for production.

Neutral TV Operating Systems

TV OSs have become pivotal to both smart TVs and streaming services as consumers continue to cut the cord. There is growing interest not just among TV makers but also major streaming and advertising platforms in neutral TV OSs independent…

Sweden’s Accelerating Journey From DTT To OTT

2025 may be a watershed year for the broadcast delivery switchover from DTT (Digital Terrestrial Television) to IP (Internet Protocol). We know that an increasing number of viewers of broadcaster content are turning to their favorite streaming apps to watch the…

Microphones: Part 9 - The Science Of Stereo Capture & Reproduction

Here we look at the science of using a matched pair of microphones positioned as a coincident pair to capture stereo sound images.