└── README.md /README.md: -------------------------------------------------------------------------------- 1 | 2 | #
Awesome 🎉Deep Learning Based Video Compression
3 | 4 | 5 | [](https://github.com/sindresorhus/awesome)   6 | 7 | # Contents (After June 2024) 8 | - [Generative compression](#Generative) 9 | - [Architecture](#ar) 10 | - [VCM & Feature Compression](#VCM) 11 | - [Rate Control & Vraible rate](#RateControl) 12 | - [Implicit neural representation](#implicit) 13 | - [Low Complexity & Speed](#lowcomplexity) 14 | - [Motion & Prediction](#motion) 15 | - [Benchmark & Dataset & Survey](#bmk) 16 | 17 | 18 | 19 | # Group by time (Before June 2024) 20 | - [2024](#2024) 21 | - [2023](#2023) 22 | - [2022](#2022) 23 | - [2021](#2021) 24 | - [2020](#2020) 25 | - [2019](#2019) 26 | - [2018](#2018) 27 | - [2017](#2017) 28 | 29 | ------ 30 | 31 | 32 | ### Generative compression 33 | 34 | | Title | Pub. & Date 35 | |:-----|:-----| 36 | |[Ultra-Low Bitrate Face Video Compression Based on Conversions from 3D Keypoints to 2D Motion Map](http://arxiv.org/abs/2210.03335v1) | TIP 2024 37 | |[Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse](http://arxiv.org/abs/2501.13528v1) | arXiv 2025 38 | |[Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization](https://arxiv.org/abs/2410.10171) | arXiv 2024 39 | |[Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens](https://arxiv.org/pdf/2410.08485) | arXiv 2024 40 | |[Multi-Reference Generative Face Video Compression with Contrastive Learning](https://arxiv.org/pdf/2409.01029) | arXiv 2024 41 | |[When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding](https://arxiv.org/pdf/2408.08093) | Arxiv 2024 42 | |[CodingHomo: Bootstrapping Deep Homography with Video Coding](https://ieeexplore.ieee.org/document/10570492/authors#authors) | TCSVT 2024| 43 | |[I2VC: A Unified Framework for Intra- & Inter-frame Video Compression](https://arxiv.org/pdf/2405.14336) | Arixv 2024 44 | |[PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding](https://openaccess.thecvf.com/content/CVPR2024/papers/) | Arxiv 2024 45 | |[SMC++: Masked Learning of Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2406.04765/) | Arxiv 2024 | 46 | 47 | ### Architecture 48 | 49 | | Title | Pub. & Date 50 | |:-----|:-----| 51 | |[An image to tailor: I-Frame Domain Adaptation in Neural Video Compression](https://openreview.net/pdf?id=6AU7JglYSV) | NeurIPSW 2024 52 | |[Adaptive Surveillance Video Compression With Background Hyperprior](http://arxiv.org/abs/2001.06590v3) | SPL 2024 53 | |[Hybrid Scalable Video Coding with Neural Compression and Enhancement for Streaming Media](http://arxiv.org/abs/2107.05548v2) | ACM MM 2024 54 | |[End-to-end Deep Video Compression Based on Hierarchical Temporal Context Learning](http://arxiv.org/abs/2204.11723v1) | TMM 2025 55 | |[Motion Free B-frame Coding for Neural Video Compression](http://arxiv.org/abs/2309.13835v2) | arXiv 2024 56 | |[GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting](https://arxiv.org/abs/2501.12060) | arXiv 2025 57 | |[ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression](https://arxiv.org/pdf/2410.09706) | arXiv 2024 58 | |[Joint Source-Channel Optimization for UAV Video Coding and Transmission](https://arxiv.org/pdf/2408.06667) | arXiv 2024 59 | |[Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2409.11718) | ECCV 2025 60 | |[VQ-DeepVSC: A Dual-Stage Vector Quantization Framework for Video Semantic Communication](https://arxiv.org/pdf/2409.03393) | arXiv 2024 61 | |[Spatio-temporal convolutional neural network for enhanced inter prediction in video coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10648618) | TIP 2024 62 | |[NVC-1B: A Large Neural Video Coding Model](https://arxiv.org/pdf/2407.19402) | arXiv 2024 63 | |[Bi-Directional Deep Contextual Video Compression](https://arxiv.org/pdf/2408.08604) | arXiv 2024 64 | 65 | 66 | ### Video Coding for Machine & Feature compression 67 | 68 | | Title | Pub. & Date 69 | |:-----|:-----| 70 | |[Parameter-efficient instance-adaptive neural video compression](https://arxiv.org/abs/2405.08530) | ACCV 2024 71 | |[DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines](None) | arXiv 2024 72 | |[RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression](None) | arXiv 2025 73 | |[Learned Multimodal Compression for Autonomous Driving](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10743791&casa_token=z5NqhvgXTigAAAAA:nOQNaJ2qxq5Cpr8a6ViW4up32c_fLiiLw5tHbDXsoYSWWL-xN3Xz4mdyvIR87gbjqHhoxbA) | MMSP 2024 74 | | [DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy](https://arxiv.org/pdf/2410.18400) | arXiv 2024 75 | | [Picture Partitioning Design of Neural Network-Based Intra Coding For Video Coding For Machines](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10647747) | ICIP 2024 76 | | [ROI-DVC: A Region-of-Interest Based Deep Video Coding Framework](https://arxiv.org/pdf/2203.01978) | ICIP 2024 77 | | [Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2409.11718) | arXiv 2024 78 | | [On Annotation-free Optimization of Video Coding for Machines](https://arxiv.org/pdf/2406.07938) | arXiv 2024 79 | | [Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines](https://arxiv.org/pdf/2406.12367) | arXiv 2024 80 | | [Deep Video Compression with Conditional Feature Coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10566367&casa_token=YrKZPj4xYCAAAAAA:w528F9C1IXPH1oYU0Tlcuviqv6MKIKcmoqduJayXaOE8mjgiNPnR8R54M86AH-SKI1B0ilQ) | PCS 2024 81 | 82 | 83 | 84 | 85 | 86 | ### Rate Control & Vraible rate 87 | | Title | Pub. & Date 88 | |:-----|:-----| 89 | |[Perception Loss Function Adaptive to Rate for Learned Video Compression](https://openreview.net/forum?id=XQN2sBSjuQ&referrer=%5Bthe%20profile%20of%20Buu%20Phan%5D(%2Fprofile%3Fid%3D~Buu_Phan3)) | NeurIPS 2024 90 | |[Content-Adaptive Rate Control Method for User-Generated Content Videos](http://arxiv.org/abs/2412.18834v1) | TCSVT 2024 91 | |[Adaptive Rate Control for Deep Video Compression with Rate-Distortion Prediction](http://arxiv.org/abs/2412.18834v1) | arXiv 2024 92 | | [Content-adaptive Variable Resolution Framework for Intra Coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10558245&casa_token=lukOUFOmaRQAAAAA:a33FUL_WUKxotgsitmwzYzHZNdeadpHVKFWmOWBqsu_2zQDjFPa8rNJEHrhaCdeqXxJ0cmk) | ISCAS 2024 93 | | [Deep Video Codec Control for Vision Models](https://openaccess.thecvf.com/content/CVPR2024W/AI4Streaming/papers/Reich_Deep_Video_Codec_Control_for_Vision_Models_CVPRW_2024_paper.pdf) | CVPR 2024 94 | 95 | 96 | ### Implicit Neural Representation 97 | | Title | Pub. & Date 98 | |:-----|:-----| 99 | |[SNeRV: Scalable Neural Representations for Video Coding](https://openreview.net/pdf?id=ZqN4bnXSSY) | NeurIPSW 2024 100 | |[HFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for Videos](None) | ACM MM 2024 101 | |[High-Frequency Enhanced Hybrid Neural Representation for Video Compression](http://arxiv.org/abs/2410.01654v2) | arXiv 2024 102 | | [NVRC: Neural Video Representation Compression](https://arxiv.org/pdf/2409.07414) | arXiv 2024 103 | | [PNVC: Towards Practical INR-based Video Compression](https://arxiv.org/pdf/2409.00953) | arXiv 2024 104 | | [High-Frequency Enhanced Hybrid Neural Representation for Video Compression](https://arxiv.org/pdf/2411.06685) | arXiv 2024 105 | | [Fast Encoding and Decoding for Implicit Video Representation](https://link.springer.com/chapter/10.1007/978-3-031-72933-1_23) | ECCV 2024 106 | | [QS-NeRV: Real-Time Quality-Scalable Decoding with Neural Representation for Videos](https://openreview.net/pdf?id=vJbyT9bYgf) | ACM MM 2024 107 | | [Temporal Enhanced Hybrid Neural Representation for Video Compression](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10566352&casa_token=Wj65ixeg3vgAAAAA:xqiVqUfQr-OJI5dKukLPbTE3wgpw9BpJYrywM999ul_5BKUBwI-Cwne-YQHiDU5bxbGcEt8) | PCS 2024 108 | | [Combining Frame and GOP Embeddings for Neural Video Representation](https://openaccess.thecvf.com/content/CVPR2024/papers/Saethre_Combining_Frame_and_GOP_Embeddings_for_Neural_Video_Representation_CVPR_2024_paper.pdf) | CVPR 2024 109 | 110 | 111 | ### Low Complexity & Speed 112 | | Title | Pub. & Date 113 | |:-----|:-----| 114 | | [Real-Time Semantic Video Communication of General Scenes](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10647591) | ICIP 2024 115 | | [Accelerating Learned Video Compression via Low-Resolution Representation Learning](https://arxiv.org/pdf/2407.16418) | arXiv 2024 116 | | [Standard compliant video coding using low complexity, switchable neural wrappers](https://arxiv.org/pdf/2407.07395) | arXiv 2024 117 | 118 | 119 | 120 | ### Motion & Prediction 121 | | Title | Pub. & Date 122 | |:-----|:-----| 123 | |[CodingHomo: Bootstrapping Deep Homography With Video Coding](https://ieeexplore.ieee.org/document/10570492) | TCSVT 2024 124 | | [Deep Video Compression with Scaled Hierarchical Bi-directional Motion Model](https://dl.acm.org/doi/pdf/10.1145/3664647.3685524?casa_token=YML8Fy3tKDwAAAAA:3rMIk_MV86yMzc_U6FV7cl3mXydMFhWiQTFl5qetd2czGsGPHvXlhlmxXxNyLshHPn_Ui0dVk0U) | ACMMM 2024 125 | | [Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10636206) | SPL 2024 126 | | [High-Efficiency Neural Video Compression via Hierarchical Predictive Learning](https://arxiv.org/pdf/2410.02598) | arXiv 2024 127 | | [Spatial Neighbor Information Assisted Motion Compensated Temporal Filter for Video Coding](https://ieeexplore.ieee.org/document/10566466) | PCS 2024 128 | 129 | 130 | 131 | ### Benchmark & Dataset & Survey 132 | | Title | Pub. & Date 133 | |:-----|:-----| 134 | |[Human-Machine Collaborative Image and Video Compression: A Survey](https://www.vmsci.com/en/articles/5959385__HumanMachine_Collaborative_Image_and_Video_Compression_A_Survey) | ATSIP 2024 135 | |[USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s](https://arxiv.org/pdf/2409.08481) | arXiv 2024 136 | 137 | 138 | ## ✔2024 [ «🎯Back To Top» ](#) 139 | 140 | 141 | - (CVPR 2024) **Deep Video Codec Control for Vision Models** Reich C, Debnath B, Patel D, et al. [paper](https://openaccess.thecvf.com/content/CVPR2024W/AI4Streaming/papers/Reich_Deep_Video_Codec_Control_for_Vision_Models_CVPRW_2024_paper.pdf) 142 | 143 | 144 | - (ToMM 2024) **Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Zhai, Yongqi and Wang, Ronggang[paper](https://dl.acm.org/doi/pdf/10.1145/3661824) 145 | 146 | - (Trans Broadcasting 2024) **Depth Video Inter Coding Based on Deep Frame Generationl**Li, Ge and Lei, Jianjun and Pan, Zhaoqing and Peng, Bo and Ling, Nam[paper](https://ieeexplore.ieee.org/abstract/document/10485621) 147 | 148 | - (ICASSP 2024) **Rate-Quality Based Rate Control Model for Neural Video Compression**Liao, Shuhong and Jia, Chuanmin and Fan, Hongfei and Yan, Jingwen and Ma, Siwei[paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10447777) 149 | - (ICASSP 2024) **Learned Video Compression with Spatial-Temporal Optimization** Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qian[paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10446198) 150 | - (ICASSP 2024) **Region-Adaptive Video Sharpening Via Rate-Perception Optimization** Pang, Yingxue and Zhao, Shijie and Guo, Mengxi and Li, Junlin and Zhang, Li [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10446929&casa_token=maZ7QZiRtLIAAAAA:fY0Ate5C0-QcGVKcfgigdZor4FuBS-RY2l5XWgEP_EIKoNU9VkDKqyJ-3vUmPvDdtV3NBOwOCQ) 151 | - (ICASSP 2024) **Leveraging Redundancy in Feature for Efficient Learned Image Compression** Qin, Peng and Bao, Youneng and Meng, Fanyang and Tan, Wen and Li, Chao and Wang, Genhong and Liang, Yongsheng [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10447424&casa_token=LvyB8nyNgq4AAAAA:P8Bl52gCJRLfhbJZeqx77XpBpjW59ptF5lBbU3jRFQnr8MuhqrIuwsS7mtd63Hcz7iDayZK2Kw) 152 | - (ICASSP 2024) **A Tri-Dynamic Preprocessing Framework for UGC Video Compression** Zhao, Fei and Guo, Mengxi and Zhao, Shijie and Li, Junlin and Zhang, Li and Xie, Xiaodong [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10445837&casa_token=1Gg25Sz6eJkAAAAA:rQRn-jysJ7-nDKslGAHIzSUJCeHKN-90xegmQsv5o-HGRqkAiVuEE9nhWV-qzlOICuKL17vCHQ) 153 | - (ICASSP 2024) **Improving Learned Video Compression by Exploring Spatial Redundancy** Yang, Jiayu and Yang, Chunhui and Zhai, Yongqi and Wang, Qi and Pan, Xinghao and Wang, Ronggang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10448496&casa_token=ladLsmyfXJMAAAAA:wPVZPmeZ260DtCgmKXy9smwYmZR4x6yhKpMnRHgkTVuZZoMqhO-cbvktMJvFdnWXW5vxak1BXA) 154 | - (ICASSP 2024) **Learned Video Compression with Spatial-Temporal Optimization** Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qian [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10446198&casa_token=yXWdEXN4tkMAAAAA:o-lFjdjZZ2mSXCTXzEuhN7z6w37arL3vMauGcMJwRftbBqu_XnunaByqAcmf3VXpqsXePQJowQ) 155 | 156 | 157 | - (WCACV 2024) **MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device** van Rozendaal, Ties and others [paper](https://arxiv.org/pdf/2310.01258.pdf) 158 | 159 | - (TPAMI 2024) **VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision** Sheng, Xihua and Li, Li and Liu, Dong and Li, Houqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10411051&casa_token=SATXNmT8_RUAAAAA:lIcbJH933NSAAoZP_IGqEPJy6dkK3J9soKCjCWGLZa-qRS8m5oJR4Tcy9XADF2ssEzSfsdvM) 160 | - (TPAMI 2024) **A Coding Framework and Benchmark towards Low-Bitrate Video Understanding** Tian, Yuan and Lu, Guo and Yan, Yichao and Zhai, Guangtao and Chen, Li and Gao, Zhiyong [paper](https://ieeexplore.ieee.org/iel7/34/4359286/10440520.pdf) 161 | 162 | 163 | - (TIP 2024) **Cross-Component Prediction Boosted With Local and Non-Local Information in Video Coding** Zhang, Kai and Deng, Zhipin and Zhang, Li [paper](https://ieeexplore.ieee.org/document/10413275) 164 | 165 | - (TCSVT 2024) **Exploiting Bidirectional Quality Impulse for Reference Picture Resampled Gaming Video Coding** Fang, Xiaohan and Chen, Peilin and Wang, Meng and Xie, Xi and Wang, Shiqi and Wang, Shanshe and Ma, Siwei [paper](https://ieeexplore.ieee.org/document/10477392/) 166 | 167 | - (TCSVT 2024) **Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression** Becking, Daniel and M{\"u}ller, Karsten and Haase, Paul and Kirchhoffer, Heiner and Tech, Gerhard and Samek, Wojciech and Schwarz, Heiko and Marpe, Detlev and Wiegand, Thomas [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10412190) 168 | 169 | 170 | - (Arxiv 2024) **Efficient Learned Wavelet Image and Video Coding**Meyer, Anna and Prativadibhayankaram, Srivatsa and Kaup, Andre[paper](https://arxiv.org/pdf/2405.12631) 171 | - (Arxiv 2024) **Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression** Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dong[paper](https://arxiv.org/pdf/2405.04274) 172 | - (Arxiv 2024) **Parameter-Efficient Instance-Adaptive Neural Video Compression** Yang, Hyunmo and Oh, Seungjun and Park, Eunbyung[paper](https://arxiv.org/pdf/2405.08530) 173 | 174 | - (Arxiv 2024) **Task-Aware Encoder Control for Deep Video Compression**Ge, Xingtong and Luo, Jixiang and Zhang, Xinjie and Xu, Tongda and Lu, Guo and He, Dailan and Geng, Jing and Wang, Yan and Zhang, Jun and Qin, Hongwei[paper](https://arxiv.org/pdf/2404.04848.pdf) 175 | - (Arxiv 2024) **Image and Video Compression using Generative Sparse Representation with Fidelity Controls**Jiang, Wei and Wang, Wei[paper](https://arxiv.org/pdf/2404.06076.pdf) 176 | - (Arxiv 2024) **A Perspective on Deep Vision Performance with Standard Image and Video Codecs**Reich, Christoph and Hahn, Oliver and Cremers, Daniel and Roth, Stefan and Debnath, Biplob[paper](https://arxiv.org/pdf/2404.12330) 177 | 178 | - (Arxiv 2024) **Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression** Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dong[paper](https://arxiv.org/pdf/2405.04274) 179 | - (Arxiv 2024) **CGVC-T: Contextual Generative Video Compression with Transformers** Du, Pengli and Liu, Ying and Ling, Nam[paper](https://ieeexplore.ieee.org/abstract/document/10496072?casa_token=W5EwodaNwdMAAAAA:Ki0F2KsNW7G3tB-_Qp3r92XBObDsMoHn5gQszGarpIfsrs57pHMu9Cx7rrl3nyh-Mu_YbLdDWg) 180 | - (Arxiv 2024) **Low-Latency Neural Stereo Streaming** Hou, Qiqi and Farhadzadeh, Farzad and Said, Amir and Sautiere, Guillaume and Le, Hoang[paper](https://arxiv.org/pdf/2403.17879.pdf) 181 | 182 | - (Arxiv 2024) **Analysis of Neural Video Compression Networks for 360-Degree Video Coding** Regensky, Andy and Brand, Fabian and Kaup, Andr{\'e}[paper](https://arxiv.org/pdf/2402.10257.pdf) 183 | 184 | 185 | - (Arxiv 2024) **Extreme Video Compression with Pre-trained Diffusion Models** Li, Bohan and Liu, Yiming and Niu, Xueyan and Bai, Bo and Deng, Lei and G{\"u}nd{\"u}z, Deniz [paper](https://arxiv.org/pdf/2402.08934v1.pdf) 186 | 187 | - (Arxiv 2024) **Boosting Neural Representations for Videos with a Conditional Decoder** Zhang, Xinjie and Yang, Ren and He, Dailan and Ge, Xingtong and Xu, Tongda and Wang, Yan and Qin, Hongwei and Zhang, Jun [paper](https://arxiv.org/pdf/2402.18152v1.pdf) 188 | 189 | - (Arxiv 2024) **Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding** Menon, Vignesh V and Zhu, Jingwen and Rajendran, Prajit T and Afzal, Samira and Schoeffmann, Klaus and Callet, Patrick Le and Timmerer, Christian [paper](https://arxiv.org/pdf/2401.15343.pdf) 190 | 191 | - (Arxiv 2024) **VQ-NeRV: A Vector Quantized Neural Representation for Videos** Xu, Yunjie and Feng, Xiang and Qin, Feiwei and Ge, Ruiquan and Peng, Yong and Wang, Changmiao[paper](https://arxiv.org/pdf/2403.12401v1.pdf) 192 | - (Arxiv 2024) **Low-Rate, Low-Distortion Compression with Wasserstein Distortion** Qiu, Yang and Wagner, Aaron B [paper](https://arxiv.org/pdf/2401.16858.pdf) 193 | - (Arxiv 2024) **LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression** Jiang, Wei and Li, Junru and Zhang, Kai and Zhang, Li[paper](https://arxiv.org/pdf/2402.00680.pdf) 194 | - (Arxiv 2024) **Immersive Video Compression using Implicit Neural Representations** Kwan, Ho Man and Zhang, Fan and Gower, Andrew and Bull, David[paper](https://arxiv.org/pdf/2402.01596.pdf) 195 | - (Arxiv 2024) **Cool-chic video: Learned video coding with 800 parameters** Leguay, Thomas and Ladune, Th{\'e}o and Philippe, Pierrick and D{\'e}forges, Olivier[paper](https://arxiv.org/pdf/2402.03179.pdf) 196 | - (Arxiv 2024) **A Neural-network Enhanced Video Coding Framework beyond ECM** Zhao, Yanchen and He, Wenxuan and Jia, Chuanmin and Wang, Qizhe and Li, Junru and Li, Yue and Lin, Chaoyi and Zhang, Kai and Zhang, Li and Ma, Siwei [paper](https://arxiv.org/pdf/2402.08397.pdf) 197 | - (Arxiv 2024) **Motion-Adaptive Inference for Flexible Learned B-Frame Compression** Akin Yilmaz, M and Ugur Ulas, O and Bilican, Ahmet and Murat Tekalp, A [paper](https://arxiv.org/pdf/2402.08550.pdf) 198 | - (Arxiv 2024) **Analysis of Neural Video Compression Networks for 360-Degree Video Coding** Regensky, Andy and Brand, Fabian and Kaup, Andr{\'e} [paper](https://arxiv.org/pdf/2402.10257.pdf) 199 | 200 | 201 | 202 | 203 | - (VICP 2024) **High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference** Zhang, Zhiyu and Tang, Anni and Zhu, Chen and Lu, Guo and Xie, Rong and Song, Li [paper](https://arxiv.org/pdf/2401.16858.pdf) 204 | 205 | - (MMM 2024) **Hierarchical Bi-directional Temporal Context Mining for Improved Video Compression** Lin, Zijian and Luo, Jianping [paper](https://link.springer.com/chapter/10.1007/978-3-031-53305-1_31) 206 | 207 | 208 | 209 | --- 210 | 211 | 212 | ## ✔2023 [ «🎯Back To Top» ](#) 213 | 214 | - (NeurIPS 2023) **HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation** Kwan, Ho Man and Gao, Ge and Zhang, Fan and Gower, Andrew and Bull, David [paper](https://arxiv.org/pdf/2306.09818.pdf) [code](https://hmkx.github.io/hinerv/) 215 | 216 | 217 | - (TPAMI 2023) **Compressed-SDR to HDR Video Reconstruction** Wang, Hu and Ye, Mao and Zhu, Xiatian and Li, Shuai and Li, Xue and Zhu, Ce [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10373884&casa_token=0YwYzQr-mtIAAAAA:fRzSWMZHeYm4f4lBNIZFowtNF9ZZxh5Lm7B36KGipe_6I1WgOWB50iqbNvOWdEP4tSU5DH8JIw) 218 | - (TIP 2023) **Sur-driven video coding rate control for jointly optimizing perceptual quality and buffer control** Yang, Zetao and Gao, Wei and Li, Ge and Yan, Yiqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10266980&casa_token=AgyJU8S3bVUAAAAA:y7CRQnfjtPHN4HBuayf_xeY5QlAMdhhNlCO6CnNUqhMuwMuqyzhuBG52CxQAEphl8_0nfzXx) 219 | - (Trans BROADCASTING 2023) **Virtual-Competitors-Based Rate Control for 360-Degree Video Coding** Lin, Jielian and Lin, Hongbin and Xu, Yiwen and Kang, Yuanxun and Zhao, Tiesong [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10341540) 220 | 221 | - (Neurocomputing 2023) **Multiple Hypotheses Based Motion Compensation for Learned Video Compression** Lin, Rongqun and Wang, Meng and Zhang, Pingping and Wang, Shiqi and Kwong, Sam [paper](https://www.sciencedirect.com/science/article/pii/S0925231223005192?casa_token=4V7fGASs-pYAAAAA:8Lk-HCwftOicqBzj2F6i3YVJCOd9MUnokVmDANZRU1D7mwIPauX_pAAcCaMqiVWKCNzkwFSp) 222 | 223 | 224 | - (ACMMM 2023) **High Visual-Fidelity Learned Video Compression** Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi [paper](https://arxiv.org/pdf/2310.04679.pdf) 225 | - (ACMMM 2023) **DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision** Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi [paper](https://dl.acm.org/doi/10.1145/3581783.3612500) 226 | - (ACMMM 2023) **Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers** Chen, Zhenghao and Relic, Lucas and Azevedo, Roberto and Zhang, Yang and Gross, Markus and Xu, Dong and Zhou, Luping and Schroers, Christopher [paper](https://studios.disneyresearch.com/app/uploads/2023/09/Neural-Video-Compression-with-Spatio-Temporal-Cross-Covariance-Transformers-Paper.pdf) 227 | - (ACMMM 2023) **Peering into The Sketch: Ultra-Low Bitrate Face Compression for Joint Human and Machine Perception** Mao, Yudong and Chen, Peilin and Wang, Shurun and Wang, Shiqi and Wu, Dapeng [paper](https://dl.acm.org/doi/pdf/10.1145/3581783.3613799?casa_token=CE_6kUxeIREAAAAA:xmZFIQJFRKkIZE1VBMyq3npr-gzcJQ4cyJAHDNPivRjQZJ4jcpy5MfJO9WkRIwpFwwBR_11yH7gZkg) 228 | 229 | 230 | 231 | 232 | - (TMM 2023) **End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding** Tang, Tong and Yin, Zhiyang and Li, Jie and Wang, Honggang and Wu, Dapeng and Wang, Ruyan [paper](https://ieeexplore.ieee.org/abstract/document/10285532) 233 | 234 | - (TMM 2023) **Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression** Zhang, Yun and Lin, Haoqin and Sun, Jing and Zhu, Linwei and Kwong, Sam [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10349945&casa_token=zhkj56wMmBQAAAAA:393OXq2npAAaHU5t6XW2D2o-By7m80ucHcQPUPG9tGAf2D78ibRSD-dnKhFrhilz2CNNc78K8g) 235 | 236 | - (TMM 2023) **Enhanced Context Mining and Filtering for Learned Video Compression** Guo, Haifeng and Kwong, Sam and Ye, Dongjie and Wang, Shiqi [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10254316&casa_token=D3_YJB3zg1YAAAAA:8BqE8OyoHJrRtmMj4tshdGApp4sKsOG11-rREhnW-0WhLDWJZ43MFiFsbsKdiNMVr6lX8zz-) 237 | - (TMM 2023) **Content-adaptive Rate-Distortion Modeling for Frame-level Rate Control in Versatile Video Coding** Liao, Junqi and Li, Li and Liu, Dong and Li, Houqiang [paper](https://ieeexplore.ieee.org/iel7/6046/4456689/10413636.pdf?casa_token=TdPkR5FGB4oAAAAA:Nu-0r0eo2oZwYNHrheWWHhY0XciQvsuO9a9lX-DDhpYLGgowapfbLMALVrkFzq7Omro025xV) 238 | 239 | 240 | 241 | - (TOMM 2023) **Principal Component Approximation Network for Image Compression** Zhang, Shupei and Zhao, Chenqiu and Basu, Anup [paper](https://dl.acm.org/doi/abs/10.1145/3637490) 242 | 243 | - (ICCV 2023) **Non-Semantics Suppressed Mask Learning for Unsupervised Video Semantic Compression** Abdulmotaleb El{-}Saddik and Tao Mei and Rita Cucchiara and Marco Bertini and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain [paper](https://scholar.googleusercontent.com/scholar.bib?q=info:uv3Zxyq_gOkJ:scholar.google.com/&output=citation&scisdr=ClHXww84EOfu32LPvEI:AFWwaeYAAAAAZcHJpEI-LTSd7LD2FMwPd1sdahA&scisig=AFWwaeYAAAAAZcHJpOVN6Jgku4ecVTjwiEMA-e4&scisf=4&ct=citation&cd=-1&hl=zh-CN) 244 | 245 | - (ICIP 2023) **FGC-VC: Flow-Guided Context Video Compression** Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Guo, Xiaotong [paper](https://ieeexplore.ieee.org/abstract/document/10222501) 246 | - (ICIP 2023) **Block-Based Motion Estimation for Deep-Learned Video Coding** S. Pientka, M. Schäfer, J. Pfaff, H. Schwarz, D. Marpe and T. Wiegand [paper](https://ieeexplore.ieee.org/document/10222411) 247 | - (ICIP 2023) **Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation** Meng, Xiandong and Zhu, Shuyuan and Ma, Siwei and Zeng, Bing [paper](https://ieeexplore.ieee.org/abstract/document/10222381) 248 | - (ICIP 2023) **Multi-scale deformable alignment and content-adaptive inference for flexible-rate bi-directional video compression** Y{\i}lmaz, M Ak{\i}n and Ulas, O Ugur and Tekalp, A Murat [paper](https://arxiv.org/pdf/2306.16544.pdf) 249 | - (ICIP 2023) **Machine-Attention-based Video Coding for Machines** Lee, Yegi and Kim, Shin and Yoon, Kyoungro and Lim, Hanshin and Kwak, Sangwoon and Choo, Hyon-Gon [paper](https://ieeexplore.ieee.org/abstract/document/10222037) 250 | - (ICIP 2023) **Predictive Coding for Animation-Based Video Compression** Konuko, Goluck and Lathuili{\`e}re, St{\'e}phane and Valenzise, Giuseppe [paper](https://arxiv.org/pdf/2307.04187.pdf) 251 | - (ICIP 2023) **Blurry Video Compression: A Trade-Off Between Visual Enhancement and Data Compression** Argaw, Dawit Mureja and Kim, Junsik and Kweon, In So [paper](https://openaccess.thecvf.com/content/WACV2024/papers/Argaw_Blurry_Video_Compression_A_Trade-Off_Between_Visual_Enhancement_and_Data_WACV_2024_paper.pdf) 252 | - (TCSVT 2023) **End-to-end learnable multi-scale feature compression for vcm** Kim, Yeongwoong and Jeong, Hyewon and Yu, Janghyun and Kim, Younhee and Lee, Jooyoung and Jeong, Se Yoon and Kim, Hui Yong [paper](https://arxiv.org/pdf/2306.16670.pdf) 253 | - (TCSVT 2023) **Camera Pose-Based Background Modeling for Video Coding in Moving Cameras** Fang, Zheng and Zheng, Mingkui and Chen, Pingping and Chen, Zhifeng and Wu, Dapeng Oliver [paper](https://ieeexplore.ieee.org/abstract/document/10261273) 254 | - (TCSVT 2023) **Sparse-to-Dense: High Efficiency Rate Control for End-to-end Scale-Adaptive Video Coding** Chen, Jiancong and Wang, Meng and Zhang, Pingping and Wang, Shurun and Wang, Shiqi [paper](https://ieeexplore.ieee.org/abstract/document/10246313) 255 | - (TCSVT 2023) **MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding** Jia, Chuanmin and Ye, Feng and Dong, Fanke and Lin, Kai and Chiariglione, Leonardo and Ma, Siwei and Sun, Huifang and Gao, Wen [paper](https://arxiv.org/pdf/2309.07589.pdf) 256 | - (TCSVT 2023) **DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework** Xue, Dongmei and Ma, Haichuan and Li, Li and Liu, Dong and Xiong, Zhiwei and Li, Houqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10210614) 257 | 258 | - (CVPR 2023) **Towards Scalable Neural Representation for Diverse Videos** He, Bo and Yang, Xitong and Wang, Hanyu and Wu, Zuxuan and Chen, Hao and Huang, Shuaiyi and Ren, Yixuan and Lim, Ser-Nam and Shrivastava, Abhinav [paper](https://arxiv.org/abs/2303.14124) 259 | - (CVPR 2023) **DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos** Zhao, Qi and Asif, M Salman and Ma, Zhan [paper](https://openaccess.thecvf.com/content/CVPR2023/papers/Zhao_DNeRV_Modeling_Inherent_Dynamics_via_Difference_Neural_Representation_for_Videos_CVPR_2023_paper.pdf) 260 | - (CVPR 2023) **HNeRV: A Hybrid Neural Representation for Videos** Chen, Hao and Gwilliam, Matt and Lim, Ser-Nam and Shrivastava, Abhinav [paper](https://arxiv.org/pdf/2304.02633.pdf) 261 | - (CVPR 2023) **Motion Information Propagation for Neural Video Compression** Qi, Linfeng and Li, Jiahao and Li, Bin and Li, Houqiang and Lu, Yan [paper](https://openaccess.thecvf.com/content/CVPR2023/papers/Qi_Motion_Information_Propagation_for_Neural_Video_Compression_CVPR_2023_paper.pdf) 262 | 263 | - (ICASSP 2023) **LCCM-VC: LEARNED CONDITIONAL CODING MODES FOR VIDEO CODING** Hadi Hadizadeh and Ivan V. Bajic [paper](https://arxiv.org/pdf/2210.15883.pdf) 264 | 265 | 266 | 267 | - (Arxiv 2023) **Implicit-explicit Integrated Representations for Multi-view Video Compression** Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Li[paper](https://arxiv.org/pdf/2311.13846.pdf) 268 | - (Arxiv 2023) **Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation** Peng, Tianhao and Gao, Ge and Sun, Heming and Zhang, Fan and Bull, David[paper](https://arxiv.org/pdf/2312.02605.pdf) 269 | - (Arxiv 2023) **Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data** Le, Hieu and Santos, Hernan and Tao, Jian[paper](https://arxiv.org/pdf/2307.04216.pdf) 270 | - (Arxiv 2023) **Offline and Online Optical Flow Enhancement for Deep Video Compression** Tang, Chuanbo and Sheng, Xihua and Li, Zhuoyuan and Zhang, Haotian and Li, Li and Liu, Dong[paper](https://arxiv.org/pdf/2307.05092.pdf) 271 | - (Arxiv 2023) **CANF-VC++: Enhancing Conditional Augmented Normalizing Flows for Video Compression with Advanced Techniques** Chen, Peng-Yu and Peng, Wen-Hsiao [paper](https://arxiv.org/pdf/2309.05382.pdf) 272 | - (Arxiv 2023) **Implicit-explicit Integrated Representations for Multi-view Video Compression** Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Li [paper](https://arxiv.org/pdf/2311.17350.pdf) 273 | - (Arxiv 2023) **C3: High-performance and low-complexity neural compression from a single image or video** Kim, Hyunjik and Bauer, Matthias and Theis, Lucas and Schwarz, Jonathan Richard and Dupont, Emilien [paper](https://arxiv.org/pdf/2312.02753.pdf) 274 | 275 | - (Arxiv 2023) **Interactive Face Video Coding: A Generative 276 | Compression Framework** Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2302.09919.pdf) 277 | - (Arxiv 2023) **MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression** Chen, Yi-Hsin and Xie, Hong-Sheng and Chen, Cheng-Wei and Gao, Zong-Lin and Peng, Wen-Hsiao and Benjak, Martin and Ostermann, J{\"o}rn[paper](https://arxiv.org/pdf/2312.15829.pdf) 278 | - (Arxiv 2023) **Interactive Face Video Coding: A Generative 279 | Compression Framework** Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2302.09919.pdf) 280 | - (Arxiv 2023) **Butterfly: Multiple Reference Frames Feature Propagation Mechanism for Neural Video Compression** Wang, Feng and Ruan, Haihang and Xiong, Fei and Yang, Jiayu and Li, Litian and Wang, Ronggang [paper](https://arxiv.org/pdf/2303.02959.pdf) 281 | - (Arxiv 2023) **IBVC: Interpolation-driven B-frame Video Compression** Liu, Meiqin and Xu, Chenming and Yao, Chao and Lin, Weisi and Zhao, Yao [paper](https://arxiv.org/pdf/2309.13835.pdf) 282 | - (Arxiv 2023) **Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression** Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Li, Xing [paper](https://arxiv.org/pdf/2310.12733.pdf) 283 | - (Arxiv 2023) **Effortless Cross-Platform Video Codec: A Codebook-Based Method** Tian, Kuan and Guan, Yonghang and Xiang, Jinxi and Zhang, Jun and Han, Xiao and Yang, Wei [paper](https://arxiv.org/pdf/2310.10292.pdf) 284 | 285 | - (Arxiv 2023) **Generative Face Video Coding Techniques and Standardization Efforts: A Review** Chen, Bolin and Chen, Jie and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2311.02649.pdf) 286 | - (Arxiv 2023) **Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs** Said, Amir and Le, Hoang and Farhadzadeh, Farzad [paper](https://arxiv.org/pdf/2312.00921.pdf) 287 | - (Arxiv 2023) **Hyperspectral Image Compression Using Sampling and Implicit Neural Representations** Rezasoltani, Shima and Qureshi, Faisal Z [paper](https://arxiv.org/pdf/2312.01558.pdf) 288 | - (Arxiv 2023) **Deep Hierarchical Video Compression** Lu, Ming and Duan, Zhihao and Zhu, Fengqing and Ma, Zhan [paper](https://arxiv.org/pdf/2312.07126.pdf) 289 | - (Arxiv 2023) **VCD: A Video Conferencing Dataset for Video Compression** Naderi, Babak and Cutler, Ross and Khongbantabam, Nabakumar Singh and Hosseinkashi, Yasaman [paper](https://arxiv.org/pdf/2309.07376.pdf) 290 | 291 | 292 | --- 293 | 294 | ## ✔2022 [ «🎯Back To Top» ](#) 295 | 296 | --- 297 | 298 | - (Arxiv 2022) **VCT: A Video Compression Transformer** Mentzer, Fabian and Toderici, George and Minnen, David and Hwang, Sung-Jin and Caelles, Sergi and Lucic, Mario and Agustsson, Eirikur [paper](https://arxiv.org/pdf/2206.07307.pdf) 299 | 300 | 301 | - (ECCV 2022) **Neural Video Compression Using GANs for Detail Synthesis and Propagation** Mentzer, Fabian and Agustsson, Eirikur and Ball{\'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19809-0_32.pdf) 302 | - (ECCV 2022) **Canf-vc: Conditional augmented normalizing flows for video compression** Ho, Yung-Han and Chang, Chih-Peng and Chen, Peng-Yu and Gnutti, Alessandro and Peng, Wen-Hsiao [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19787-1_12.pdf) 303 | - (ECCV 2022) **AlphaVC: High-Performance and Efficient Learned Video Compression** Shi, Yibo and Ge, Yunying and Wang, Jing and Mao, Jue [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19800-7_36.pdf) 304 | - (ECCV 2022) **E-nerv: Expedite neural video representation with disentangled spatial-temporal context** Li, Zizhang and Wang, Mengmeng and Pi, Huaijin and Xu, Kechun and Mei, Jianbiao and Liu, Yong [paper](https://arxiv.org/pdf/2207.08132.pdf) 305 | - (ACM MM 2022) **Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression** Li, Jiahao and Li, Bin and Lu, Yan. [paper](https://arxiv.org/pdf/2207.05894.pdf) 306 | 307 | - (TMM 2022) **Temporal Context Mining for Learned Video Compression** Sheng, Xihua and Li, Jiahao and Li, Bin and Li, Li and Liu, Dong and Lu, Yan [paper](https://arxiv.org/pdf/2111.13850.pdf) 308 | 309 | - (TCSVT 2022) **HMFVC: A Human-Machine Friendly Video Compression Scheme** Huang, Zhimeng and Jia, Chuanmin and Wang, Shanshe and Ma, Siwei [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9894405) 310 | 311 | - (arXiv preprint 2022) **CONTENT-ADAPTIVE MOTION RATE ADAPTION FOR LEARNED VIDEO COMPRESSION** Chen, Chih-Hsuan Lin Yi-Hsin and Peng, Wen-Hsiao [[paper](http://mapl.nctu.edu.tw/content/pages/research/PCS_2022.pdf)] 312 | 313 | - (CVPRW 2022) **Learned Low Bitrate Video Compression with Space-Time Super-Resolution** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [[paper](https://openaccess.thecvf.com/content/CVPR2022W/CLIC/papers/Yang_Learned_Low_Bitrate_Video_Compression_With_Space-Time_Super-Resolution_CVPRW_2022_paper.pdf)] 314 | - (CVPRW 2022) **Learned Low Bitrate Video Compression With Space-Time Super-Resolution** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [[paper](https://openaccess.thecvf.com/content/CVPR2022W/CLIC/papers/Yang_Learned_Low_Bitrate_Video_Compression_With_Space-Time_Super-Resolution_CVPRW_2022_paper.pdf)] 315 | - (CVPR 2022) **Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction** Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu [[paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Hu_Coarse-To-Fine_Deep_Video_Coding_With_Hyperprior-Guided_Mode_Prediction_CVPR_2022_paper.pdf)] 316 | - (CVPR 2022) **Learning Based Multi-Modality Image and Video Compression**, Lu, Guo and Zhong, Tianxiong and Geng, Jing and Hu, Qiang and Xu, Dong [[paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Lu_Learning_Based_Multi-Modality_Image_and_Video_Compression_CVPR_2022_paper.pdf)] 317 | - (CVPR 2022) **LSVC: A Learning-based Stereo Video Compression Framework**, Chen, Zhenghao and Lu, Guo and Hu, Zhihao and Liu, Shan and Jiang, Wei and Xu, Dong [[paper]](https://openaccess.thecvf.com/content/CVPR2022/papers/Chen_LSVC_A_Learning-Based_Stereo_Video_Compression_Framework_CVPR_2022_paper.pdf) 318 | 319 | - (TPAMI 2022) **Multi-modality deep restoration of extremely compressed face videos**, Zhang, Xi and Wu, Xiaolin. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9730053) 320 | 321 | - (arXiv preprint 2022) **A Coding Framework and Benchmark towards Compressed Video Understanding**, Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao. [[paper]](https://arxiv.org/pdf/2202.02813.pdf) 322 | 323 | - (Under review ICLR 2022) **Learning Perceptual Compression of Facial Video**, Shukor, Mustafa and Xu, YAO and Damodaran, Bharath Bhushan and Hellier, Pierre. [[paper]](https://openreview.net/pdf?id=4ZEJ_Z18NH) 324 | 325 | - (Under review ICLR 2022) **Uncertainty-Aware Deep Video Compression with Ensembles**, Ma, Wufei and Li, Jiahao and Li, Bin and Lu, Yan. [[paper]](https://openreview.net/pdf?id=vkZtFD0zga8) 326 | 327 | - (Signal Processing: Image Communication 2022) **Learning to compress videos without computing motion**, Chen, Meixu and Goodall, Todd and Patney, Anjul and Bovik, Alan C. [[paper]](https://reader.elsevier.com/reader/sd/pii/S0923596522000029?token=0DD9114AD904612721941553941BA62D7D7F1FCC292AF6C26D121372C2E69C81B4ACDCBD040F51AA44EEF35A1038DE80&originRegion=us-east-1&originCreation=20220419084824) 328 | 329 | - (arXiv preprint 2022) **Multi-View Video Coding with GAN Latent Learning**, Lan, Chengdong and Luo, Cheng and Yan, Hao and Zhao, Tiesong and Kwong, Sam. [[paper]](https://arxiv.org/pdf/2205.03599.pdf) 330 | 331 | - (ICASSP 2022) **Rate Control for Learned Video Compression**, Li, Yanghao and Chen, Xinyao and Li, Jisheng and Wen, Jiangtao and Han, Yuxing and Liu, Shan and Xu, Xiaozhong. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9746080) 332 | 333 | - (TCSVT 2022) **Edge-Based Video Compression Texture Synthesis using Generative Adversarial Network**, Zhu, Chen and Xu, Jun and Feng, Donghui and Xie, Rong and Song, Li. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9762281) 334 | 335 | 336 | 337 | --- 338 | 339 | ## ✔2021 [ «🎯Back To Top» ](#) 340 | 341 | --- 342 | - (NeurIPS 2021) **Nerv: Neural representations for videos**Chen, Hao and He, Bo and Wang, Hanyu and Ren, Yixuan and Lim, Ser Nam and Shrivastava, Abhinav [[paper]](https://proceedings.neurips.cc/paper_files/paper/2021/file/b44182379bf9fae976e6ae5996e13cd8-Paper.pdf) 343 | 344 | - (ICLR 2021) **Hierarchical autoregressive modeling for neural video compression**, Yang, Ruihan and Yang, Yibo and Marino, Joseph and Mandt, Stephan. [[paper]](https://arxiv.org/pdf/2010.10258.pdf) 345 | 346 | - (TPAMI 2021) **An end-to-end learning framework for video compression**, Lu, Guo and Zhang, Xiaoyun and Ouyang, Wanli and Chen, Li and Gao, Zhiyong and Xu, Dong. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9072487) 347 | 348 | - (TIP 2021) **End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression**, Y{\i}lmaz, M Ak{\i}n and Tekalp, A Murat. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9667275) 349 | 350 | - (CVPR 21) **Online-trained Upsampler for Deep Low Complexity Video Compression**, Klopp, Jan P and Liu, Keng-Chi and Chien, Shao-Yi and Chen, Liang-Gee. [[paper]](https://openaccess.thecvf.com/content/ICCV2021/papers/Klopp_Online-Trained_Upsampler_for_Deep_Low_Complexity_Video_Compression_ICCV_2021_paper.pdf) 351 | 352 | - (NIPS 21) **Deep Contextual Video Compression**, Li, Jiahao and Li, Bin and Lu, Yan. [[paper]](https://proceedings.neurips.cc/paper/2021/file/96b250a90d3cf0868c83f8c965142d2a-Paper.pdf) 353 | 354 | - (CVPR 21) **ELF-VC: Efficient Learned Flexible-Rate Video Coding**, Rippel, Oren and Anderson, Alexander G and Tatwawadi, Kedar and Nair, Sanjay and Lytle, Craig and Bourdev, Lubomir. [[paper]](https://openaccess.thecvf.com/content/ICCV2021/papers/Rippel_ELF-VC_Efficient_Learned_Flexible-Rate_Video_Coding_ICCV_2021_paper.pdf) 355 | 356 | - (CVPR 21) **FVC: A New Framework towards Deep Video Compression in Feature Space**, Hu, Zhihao and Lu, Guo and Xu, Dong. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Hu_FVC_A_New_Framework_Towards_Deep_Video_Compression_in_Feature_CVPR_2021_paper.pdf) 357 | 358 | - (CVPR 21) **Deep Perceptual Preprocessing for Video Coding**, Aaron Chadha, Yiannis Andreopoulos. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Chadha_Deep_Perceptual_Preprocessing_for_Video_Coding_CVPR_2021_paper.pdf) 359 | 360 | - (CVPR 21) **Deep learning in latent space for video prediction and compression**, Liu, Bowen and Chen, Yu and Liu, Shiyu and Kim, Hun-Seok. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Liu_Deep_Learning_in_Latent_Space_for_Video_Prediction_and_Compression_CVPR_2021_paper.pdf) 361 | 362 | - (ICIP 21) **Variable-Rate Video Compression[C]//2021 IEEE International Conference on Image Processing**, Lin, Jianping and Liu, Dong and Liang, Jie and Li, Houqiang and Wu, Feng. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9506269) VR 363 | 364 | - (VCIP 21) **DVC-P: Deep Video Compression with Perceptual Optimizations**, Zhang, Saiping and Mrak, Marta and Herranz, Luis and Blanch, Marc G{\'o}rriz and Wan, Shuai and Yang, Fuzheng. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9675350) 365 | 366 | - (MTICTI 2021) **Review and Evaluation of End-to-End Video Compression with Deep-Learning**, Yasin, Hajar Maseeh and Ameen, Siddeeq Yosef. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9664790) 367 | 368 | - (arXiv preprint 2021) **Deep Video Coding with Dual-Path Generative Adversarial Network**, Zhao, Tiesong and Feng, Weize and Zeng, Hongji and Niu, Yuzhen and Liu, Jiaying. [[paper]](https://arxiv.org/pdf/2111.14474.pdf) 369 | 370 | - (arXiv preprint 2021) **Versatile Learned Video Compression**, Feng, Runsen and Guo, Zongyu and Zhang, Zhizheng and Chen, Zhibo. [[paper]](https://arxiv.org/pdf/2111.03386.pdf) 371 | 372 | - (arXiv preprint 2021) **A. Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression**, Brand, Fabian and Seiler, J{\"u}rgen and Kaup, Andr{\'e}. [[paper]](https://arxiv.org/pdf/2112.08011.pdf) 373 | 374 | 375 | - (arXiv preprint 2021) **Implicit Neural Video Compression**, Zhang, Yunfan and van Rozendaal, Ties and Brehmer, Johann and Nagel, Markus and Cohen, Taco. [[paper]](https://arxiv.org/pdf/2112.11312.pdf) 376 | 377 | - (arXiv preprint 2021) **Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression**, Guo, Zongyu and Feng, Runsen and Zhang, Zhizheng and Jin, Xin and Chen, Zhibo. [[paper]](https://arxiv.org/pdf/2110.01805.pdf) MV 378 | 379 | - (arXiv preprint 2021) **Learning Cross-Scale Prediction for Efficient Neural Video Compression**, Paul, Somdyuti and Norkin, Andrey and Bovik, Alan C. [[paper]](https://arxiv.org/pdf/2112.13309.pdf) MV 380 | 381 | - (arXiv preprint 2021) **Neural Video Compression using GANs for Detail Synthesis and Propagation**, Mentzer, Fabian and Agustsson, Eirikur and Ball{\'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George. [[paper]](https://arxiv.org/pdf/2107.12038.pdf) 382 | 383 | - (arXiv preprint 2021) **Neural weight step video compression**, Czerkawski, Mikolaj and Cardona, Javier and Atkinson, Robert and Michie, Craig and Andonovic, Ivan and Clemente, Carmine and Tachtatzis, Christos. [[paper]](https://arxiv.org/pdf/2112.01504.pdf) 384 | 385 | - (arXiv preprint 2021) **Perceptual Learned Video Compression with Recurrent Conditional GAN**, Yang, Ren and Van Gool, Luc and Timofte, Radu. [[paper]](https://arxiv.org/pdf/2109.03082.pdf) 386 | 387 | 388 | --- 389 | 390 | ## ✔2020 [ «🎯Back To Top» ](#) 391 | 392 | --- 393 | 394 | - (AAAI 20) **Learned video compression via joint spatial-temporal correlation exploration**, Yang, Ren and Mentzer, Fabian and Gool, Luc Van and Timofte, Radu. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/6825/6679) 395 | 396 | 397 | - (CVPR 20) **Learning for video compression with hierarchical quality and recurrent enhancement**, Liu, Haojie and Shen, Han and Huang, Lichao and Lu, Ming and Chen, Tong and Ma, Zhan. [[paper]](https://openaccess.thecvf.com/content_CVPR_2020/papers/Yang_Learning_for_Video_Compression_With_Hierarchical_Quality_and_Recurrent_Enhancement_CVPR_2020_paper.pdf) 398 | 399 | - (CVPR 20) **M-LVC: Multiple frames prediction for learned video compression**, Lin, Jianping and Liu, Dong and Li, Houqiang and Wu, Feng. [[paper]](https://openaccess.thecvf.com/content_CVPR_2020/papers/Lin_M-LVC_Multiple_Frames_Prediction_for_Learned_Video_Compression_CVPR_2020_paper.pdf) 400 | 401 | - (CVPR 20) **Learned video compression with feature-level residuals**, Feng R, Wu Y, Guo Z, et al. [[paper]](https://openaccess.thecvf.com/content_CVPRW_2020/papers/w7/Feng_Learned_Video_Compression_With_Feature-Level_Residuals_CVPRW_2020_paper.pdf) 402 | 403 | - (ACCV 20) **Feedback recurrent autoencoder for video compression**, Lin, Golinski, Adam and Pourreza, Reza and Yang, Yang and Sautiere, Guillaume and Cohen, Taco S. [[paper]](https://openaccess.thecvf.com/content/ACCV2020/papers/Golinski_Feedback_Recurrent_Autoencoder_for_Video_Compression_ACCV_2020_paper.pdf) 404 | 405 | - (CSUR 20) **Deep learning-based video coding: A review and a case study**, Liu, Dong and Li, Yue and Lin, Jianping and Li, Houqiang and Wu, Feng. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3368405) 406 | 407 | --- 408 | 409 | ## ✔2019 [ «🎯Back To Top» ](#) 410 | 411 | --- 412 | 413 | - (ICCV 19) **Dvc: An end-to-end deep video compression framework**, Lu, Guo and Ouyang, Wanli and Xu, Dong and Zhang, Xiaoyun and Cai, Chunlei and Gao, Zhiyong. [[paper]](https://openaccess.thecvf.com/content_CVPR_2019/papers/Lu_DVC_An_End-To-End_Deep_Video_Compression_Framework_CVPR_2019_paper.pdf) 414 | 415 | - (ICCV 19) **Learned video compression**, Rippel, Oren and Nair, Sanjay and Lew, Carissa and Branson, Steve and Anderson, Alexander G and Bourdev, Lubomir. [[paper]](https://openaccess.thecvf.com/content_ICCV_2019/papers/Rippel_Learned_Video_Compression_ICCV_2019_paper.pdf) 416 | 417 | - (NIPS 19) **Deep generative video compression**, Lombardo, Salvator and Han, Jun and Schroers, Christopher and Mandt, Stephan. [[paper]](https://proceedings.neurips.cc/paper/2019/file/f1ea154c843f7cf3677db7ce922a2d17-Paper.pdf) 418 | 419 | 420 | 421 | 422 | 423 | - (TCSVT 19) **Image and video compression with neural networks: A review**, Ma, Siwei and Zhang, Xinfeng and Jia, Chuanmin and Zhao, Zhenghui and Wang, Shiqi and Wang, Shanshe. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8693636) 424 | 425 | 426 | 427 | --- 428 | 429 | ## ✔2018 [ «🎯Back To Top» ](#) 430 | 431 | --- 432 | 433 | - (ECCV 18) **Video compression through image interpolation**, Wu, Chao-Yuan and Singhal, Nayan and Krahenbuhl, Philipp. [[paper]](https://openaccess.thecvf.com/content_ECCV_2018/papers/Chao-Yuan_Wu_Video_Compression_through_ECCV_2018_paper.pdf) 434 | 435 | 436 | 437 | --- 438 | 439 | ## ✔2017 [ «🎯Back To Top» ](#) 440 | 441 | --- 442 | 443 | - (VCIP 17) **Video compression based on spatio-temporal resolution adaptation**, Afonso, Mariana and Zhang, Fan and Bull, David R. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8517114) 444 | 445 | --- 446 | --------------------------------------------------------------------------------