Name: Learning-based Image/Video Compression
Brand: Future Tech Pavilion, FUTEX
SKU: P0039700007110

:::

recommend

Deep-learning-based Intelligent Driving Assistance System

Lightweight End-to-End Deep Learning Model for Music Source Separation

Deep-learning-based object recognition, behavior,360-degree video SLAM technology for autonomous driving

Home care and intelligent life system based on vision sensor

Trace

Technical Name	Learning-based Image/Video Compression
Project Operator	National Yang Ming Chiao Tung University
Project Host	彭文孝
Summary	Learning-based imagevideo compression is emerging as a promising technology in recent years. This submission includes three major outcomes of our recent study. First, ANFIC presents an end-to-end learned lossy image codec. It features Augmented Normalizing Flows as its backbone, achieving the state-of-the-art performance among the existing learned codecs. Second, our CANF-VC extends ANFIC to video compression based on conditional coding, a new video coding paradigm. It represents the first end-to-end video codec that achieves comparable performance to HEVC. Third, we showcase how reinforcement learning can be leveraged to train an agent that learns to control the encoder parameters without making any change to the codec.
Scientific Breakthrough	ANFIC presents the first attempt to leverage variational autoencoder (VAE)-based compression in a flow-based framework. It advances by stackingextending hierarchically multiple VAE＇s. CANF-VC extends ANFIC to video compression. With recent research on conditional coding showing the sub-optimality of the traditional hybrid-based coding, CANF-VC represents a new attempt that leverages the conditional ANF to learn a video codec for conditional video coding. Lastly, in our reinforcement learning based encoder control framework, we improve upon DDPG algorithm for maximizing reconstruction quality under the rate requirement. All three methods show superior compression efficiency compared to other learned methodstraditional codecs.
Industrial Applicability	Image/video compression is a commercially proven technology. ISO/IEC & ITU-T have standardized several image/video coding technologies, which now find wide applications in social media, cloud gaming, video conferencing, etc. The rising of deep learning is opening up new opportunities for disruptive innovations to revolutionize this seemingly mature sector. Our ANFICCANF-VC represent the state-of-the-art learning-based imagevideo coding technologies in this fast-growing area. In addition, our reinforcement learning-based coding framework is able to improve on existing codecs without making any change to them. It represents the forefront technology in AI-assisted compression.
Keyword	Image compression video compression rate control end-to-end learned compression system AI-assisted compression system reinforcement learning augmented normalizing flow conditional coding codec deep learning

Contact
Wen-Hsiao Peng

Email
wpeng@cs.nycu.edu.tw

Matchmaking

other people also saw