Yueyu Hu

About MeHi there! This is Yueyu. Starting June 2025, I’ll be joining Apple as a Machine Learning Engineer. I recently completed my Ph.D. in Electrical Engineering at New York University, where I worked with Prof. Yao Wang at NYU Video Lab. I received a master's degree in computer science at Peking University, advised by
                  Prof. Jiaying Liu
                  at STRUCT. My research spans 3D vision, machine learning, and image / video / point cloud compression. I’m passionate about building systems that bring visual understanding and efficient communication closer together.

                  Beyond research, I am also interested in
                  photography,
                  trains and railways, meteorology, science-fiction and computer network development.
 I love New York City, where I spent my Ph.D. student life. Visiting NYC? Check out my NYC Experience for some recommendations!

                  WHIM is where I share my debugging experiences, observations, or otherwise. Check it out!
Friends
                  
Useful LinksNYC Experience:
                  See my recommendation for visiting NYC.
Windy:
                    A website to provide visualized numerical weather forecast results by, e.g. GFS, ECMWF.
DLVC:
                      An open-sourced deep-learning driven video codec.
FVC-NIC:
                      Open-Source learned image compression framework.
BDWM:
                      The most popular & official Bulletin Board System (BBS) at Peking University.

                Recent Publications
                
                    Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang.
                    "Standard compliant video coding using low complexity, switchable neural wrappers",
                    IEEE ICIP, 2024. 

                    [arXiv]
                  
                    Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham and Yao Wang.
                    "One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing",
                    to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024. 

                    [Workshop]
                    [arXiv]
                    [PDF]
                    [supp]
                  
                    Yueyu Hu, Ran Gong, Qi Sun, and Yao Wang.
                    "Low Latency Point Cloud Rendering with Learned Splatting",
                    to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024. 

                    [Workshop]
                    [PDF]
                    [supp] 
                    [Code]
                  
                    Yueyu Hu, Ran Gong, Yao Wang.
                    "Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering", arXiv:2406.05915, 2024. 

                    [arXiv]
                  
                    Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang.
                    "Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers",
                    Data Compression Conference (DCC), 2024. 

                    [Code]
                    [Poster]
                  
                    Wenhan Yang, Haofeng Huang, Yueyu Hu, Ling-Yu Duan, and Jiaying Liu.
                    "Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics",
                    Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2024. 

                    Yueyu Hu and Yao Wang.
                    "Learning Neural Volumetric Field for Point Cloud Geometry Compression",
                    2022 Picture Coding Symposium (PCS), 2022. 

                    [Code]
                    [arXiv]
                  
                    Yueyu Hu, Wenhan Yang, Zhan Ma, and Jiaying Liu.
                    "Learning End-to-End Lossy Image Compression: A Benchmark",
                    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

                    [Project]
                    [arXiv]
                    [IEEE]
                    [Code]
                  
                    Yueyu Hu, Wenhan Yang, and Jiaying Liu.
                    "Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression",
                    Proc. AAAI Conference on Artificial Intelligence (AAAI), 2020.

                    [PDF]
                    [Poster]
                    [Project]
                    [TF Code]
                    [PyTorch Code]
                    [Spotlight]
                  
                    Yueyu Hu, Wenhan Yang, Mading Li, and Jiaying Liu.
                    "Progressive Spatial Recurrent Neural Network for Intra Prediction",
                    IEEE Transactions on Multimedia (TMM), 2019.

                    [PDF]
                    [Code]
                  
                    Yueyu Hu, Shuai Yang, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu.
                    "Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach",
		    Proc. IEEE International Conference on Multimedia & Expo (ICME), 2020. (Best Paper Award)

                    [PDF]
                    [Project]
		    [Slides]
                  
                    Shuai Yang, Yueyu Hu, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu.
                    "Towards Coding for Human and Machine Vision: Scalable Face Image Coding",
                    IEEE Transactions on Multimedia (TMM), 2020.

                    [PDF]
                  
                      [Full list of publications]
                  
Projects
                  
                            Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach
                          
                            In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine).
                          
                            [PDF]
                            [Supplementary]
                            [Project]
                          
                            Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression
                          
                           End-to-End learned image compression framework with a coarse-to-fine hyper-prior model,
                            featuring Signal Preserving Hyper Transform and Information Aggregation Reconstruction sub-network.
                            The proposed model achieve superior performance to existing hyper-prior models with context models.
                          
                            [PDF]
                            [Poster]
                            [Project]
                            [Code]
                            [Spotlight]
                          
                            Learning End-to-End Lossy Image Compression: A Benchmark
                          
                            A survey and benchmark of existing end-to-end learned image compression methods.
                          
                            [arXiv]
                            [Project]

                  
Yueyu Hu, Ph.D. 
 胡越予
 yyhu AT nyu·edu

About MeHi there! This is Yueyu. Starting June 2025, I’ll be joining Apple as a Machine Learning Engineer. I recently completed my Ph.D. in Electrical Engineering at New York University, where I worked with Prof. Yao Wang at NYU Video Lab. I received a master's degree in computer science at Peking University, advised by
                  Prof. Jiaying Liu
                  at STRUCT. My research spans 3D vision, machine learning, and image / video / point cloud compression. I’m passionate about building systems that bring visual understanding and efficient communication closer together.

                  Beyond research, I am also interested in
                  photography,
                  trains and railways, meteorology, science-fiction and computer network development.
 I love New York City, where I spent my Ph.D. student life. Visiting NYC? Check out my NYC Experience for some recommendations!

                    WHIM is where I share my debugging experiences, observations, or otherwise. Check it out!

                Selected Publications
                
                    Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang.
                    "Standard compliant video coding using low complexity, switchable neural wrappers",
                    IEEE ICIP, 2024. 

                    [arXiv]
                  
                    Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham and Yao Wang.
                    "One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing",
                    to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024. 

                    [Workshop]
                    [arXiv]
                    [PDF]
                    [supp]
                  
                    Yueyu Hu, Ran Gong, Qi Sun, and Yao Wang.
                    "Low Latency Point Cloud Rendering with Learned Splatting",
                    to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024. 

                    [Workshop]
                    [PDF]
                    [supp]
                    [Code]
                  
                    Yueyu Hu, Ran Gong, Yao Wang.
                    "Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering", arXiv:2406.05915, 2024. 

                    [arXiv]
                  
                    Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang.
                    "Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers",
                    Data Compression Conference (DCC), 2024. 

                    [Code]
                    [Poster]
                  
                    Wenhan Yang, Haofeng Huang, Yueyu Hu, Ling-Yu Duan, and Jiaying Liu.
                    "Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics",
                    Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2024. 

                    Yueyu Hu and Yao Wang.
                    "Learning Neural Volumetric Field for Point Cloud Geometry Compression",
                    Accepted by 2022 Picture Coding Symposium (PCS), 2022.

                    [Code]
                    [arXiv]
                  
                    Yueyu Hu, Wenhan Yang, Zhan Ma, and Jiaying Liu.
                    "Learning End-to-End Lossy Image Compression: A Benchmark",
                    Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.

                    [Project]
                    [arXiv]
                    [IEEE]
                    [Code]
                  
                    Yueyu Hu, Wenhan Yang, and Jiaying Liu.
                    "Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression",
                    Proc. AAAI Conference on Artificial Intelligence (AAAI), 2020.

                    [PDF]
                    [Poster]
                    [Project]
                    [Code]
                    [Spotlight]
        
                    Yueyu Hu, Wenhan Yang, Mading Li, and Jiaying Liu.
                    "Progressive Spatial Recurrent Neural Network for Intra Prediction",
                    IEEE Transactions on Multimedia (TMM), 2019.

                    [PDF]
		    [Code]
                  
                    Yueyu Hu, Shuai Yang, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu.
                    "Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach",
		    Proc. IEEE International Conference on Multimedia & Expo (ICME), 2020. (Best Paper Award)

                    [PDF]
                    [Project]
		    [Slides]
                  
                    Shuai Yang, Yueyu Hu, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu.
                    "Towards Coding for Human and Machine Vision: Scalable Face Image Coding",
                    IEEE Transactions on Multimedia (TMM), 2020.

                    [PDF]
                  
                      [Full list of publications]
                  
Projects
                  
                            Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach
                          
                            In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine).
                          
                            [PDF]
                            [Supplementary]
                            [Project]
                          
                            Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression
                          
                            End-to-End learned image compression framework with a coarse-to-fine hyper-prior model,
                            featuring Signal Preserving Hyper Transform and Information Aggregation Reconstruction sub-network.
                            The proposed model achieve superior performance to existing hyper-prior models with context models.
                          
                            [PDF]
                            [Poster]
                            [Project]
                            [TF Code]
                            [PyTorch Code]
                            [Spotlight]
                          
                            Learning End-to-End Lossy Image Compression: A Benchmark
                          
                            A survey and benchmark of existing end-to-end learned image compression methods.
                          
                            [arXiv]
                            [Project]
                          
Friends
                
Useful LinksNYC Experience:
                  See my recommendation for visiting NYC.
Windy:
                  A website to provide visualized numerical weather forecast results by, e.g. GFS, ECMWF.
DLVC:
                  An open-sourced deep-learning driven video codec.
FVC-NIC:
                  Open-Source learned image compression framework.
BDWM:
                  The most popular & official Bulletin Board System (BBS) at Peking University.

	Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine). [PDF] [Supplementary] [Project]
	Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression End-to-End learned image compression framework with a coarse-to-fine hyper-prior model, featuring Signal Preserving Hyper Transform and Information Aggregation Reconstruction sub-network. The proposed model achieve superior performance to existing hyper-prior models with context models. [PDF] [Poster] [Project] [Code] [Spotlight]
	Learning End-to-End Lossy Image Compression: A Benchmark A survey and benchmark of existing end-to-end learned image compression methods. [arXiv] [Project]