Yueyu HU, 胡越予

Ph.D. Candidate, New York University

yyhu AT nyu·edu



About Me

Hi there! This is Yueyu. I am a 4th-year PhD student (expected to graduate in May 2025) working with Prof. Yao Wang at NYU Video Lab. I received a master's degree in computer science at Peking University, advised by Prof. Jiaying Liu at STRUCT. My research interests include computer vision, machine learning and image/video compression. I am also interested in photography, trains and railways, meteorology, science-fiction and computer network development.

I am open to job opportunities starting in Summer 2025.

WHIM is where I share my debugging experiences, observations, or otherwise. Check it out!

Friends


Useful Links

NYC Experience: See my recommendation for visiting NYC.

Windy: A website to provide visualized numerical weather forecast results by, e.g. GFS, ECMWF.

DLVC: An open-sourced deep-learning driven video codec.

FVC-NIC: Open-Source learned image compression framework.

BDWM: The most popular & official Bulletin Board System (BBS) at Peking University.

Recent Publications
  1. Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang. "Standard compliant video coding using low complexity, switchable neural wrappers", IEEE ICIP, 2024.
    [arXiv]
  2. Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham and Yao Wang. "One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing", to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024.
    [Workshop] [arXiv] [PDF] [supp]
  3. Yueyu Hu, Ran Gong, Qi Sun, and Yao Wang. "Low Latency Point Cloud Rendering with Learned Splatting", to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024.
    [Workshop] [PDF] [supp] [Code]
  4. Yueyu Hu, Ran Gong, Yao Wang. "Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering", arXiv:2406.05915, 2024.
    [arXiv]
  5. Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang. "Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers", Data Compression Conference (DCC), 2024.
    [Code] [Poster]
  6. Wenhan Yang, Haofeng Huang, Yueyu Hu, Ling-Yu Duan, and Jiaying Liu. "Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics", Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2024.
  7. Yueyu Hu and Yao Wang. "Learning Neural Volumetric Field for Point Cloud Geometry Compression", 2022 Picture Coding Symposium (PCS), 2022.
    [Code] [arXiv]
  8. Yueyu Hu, Wenhan Yang, Zhan Ma, and Jiaying Liu. "Learning End-to-End Lossy Image Compression: A Benchmark", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.
    [Project] [arXiv] [IEEE] [Code]
  9. Yueyu Hu, Wenhan Yang, and Jiaying Liu. "Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression", Proc. AAAI Conference on Artificial Intelligence (AAAI), 2020.
    [PDF] [Poster] [Project] [TF Code] [PyTorch Code] [Spotlight]
  10. Yueyu Hu, Wenhan Yang, Mading Li, and Jiaying Liu. "Progressive Spatial Recurrent Neural Network for Intra Prediction", IEEE Transactions on Multimedia (TMM), 2019.
    [PDF] [Code]
  11. Yueyu Hu, Shuai Yang, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu. "Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach", Proc. IEEE International Conference on Multimedia & Expo (ICME), 2020. (Best Paper Award)
    [PDF] [Project] [Slides]
  12. Shuai Yang, Yueyu Hu, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu. "Towards Coding for Human and Machine Vision: Scalable Face Image Coding", IEEE Transactions on Multimedia (TMM), 2020.
    [PDF]
  13. [Full list of publications]

Projects

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine).

[PDF] [Supplementary] [Project]

Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression

End-to-End learned image compression framework with a coarse-to-fine hyper-prior model, featuring Signal Preserving Hyper Transform and Information Aggregation Reconstruction sub-network. The proposed model achieve superior performance to existing hyper-prior models with context models.

[PDF] [Poster] [Project] [Code] [Spotlight]

Learning End-to-End Lossy Image Compression: A Benchmark

A survey and benchmark of existing end-to-end learned image compression methods.

[arXiv] [Project]

Yueyu HU, 胡越予

Ph.D. Candidate, New York University

yyhu AT nyu·edu



About Me

Hi there! This is Yueyu. I am a 4th-year PhD student (expected to graduate in May 2025) working with Prof. Yao Wang at NYU Video Lab. I received a master's degree in computer science at Peking University, advised by Prof. Jiaying Liu at STRUCT. My research interests include computer vision, machine learning and image/video compression. I am also interested in photography, trains and railways, meteorology, science-fiction and computer network development.

I am open to job opportunities starting in Summer 2025.

WHIM is where I share my debugging experiences, observations, or otherwise. Check it out!

Selected Publications
  1. Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang. "Standard compliant video coding using low complexity, switchable neural wrappers", IEEE ICIP, 2024.
    [arXiv]
  2. Yueyu Hu, Onur G. Guleryuz, Philip A. Chou, Danhang Tang, Jonathan Taylor, Rus Maxham and Yao Wang. "One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing", to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024.
    [Workshop] [arXiv] [PDF] [supp]
  3. Yueyu Hu, Ran Gong, Qi Sun, and Yao Wang. "Low Latency Point Cloud Rendering with Learned Splatting", to appear at CVPR Workshop (AIS: Vision, Graphics and AI for Streaming), 2024.
    [Workshop] [PDF] [supp] [Code]
  4. Yueyu Hu, Ran Gong, Yao Wang. "Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering", arXiv:2406.05915, 2024.
    [arXiv]
  5. Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, and Yao Wang. "Standard Compatible Efficient Video Coding with Jointly Optimized Neural Wrappers", Data Compression Conference (DCC), 2024.
    [Code] [Poster]
  6. Wenhan Yang, Haofeng Huang, Yueyu Hu, Ling-Yu Duan, and Jiaying Liu. "Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics", Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 2024.
  7. Yueyu Hu and Yao Wang. "Learning Neural Volumetric Field for Point Cloud Geometry Compression", Accepted by 2022 Picture Coding Symposium (PCS), 2022.
    [Code] [arXiv]
  8. Yueyu Hu, Wenhan Yang, Zhan Ma, and Jiaying Liu. "Learning End-to-End Lossy Image Compression: A Benchmark", Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021.
    [Project] [arXiv] [IEEE] [Code]
  9. Yueyu Hu, Wenhan Yang, and Jiaying Liu. "Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression", Proc. AAAI Conference on Artificial Intelligence (AAAI), 2020.
    [PDF] [Poster] [Project] [Code] [Spotlight]
  10. Yueyu Hu, Wenhan Yang, Mading Li, and Jiaying Liu. "Progressive Spatial Recurrent Neural Network for Intra Prediction", IEEE Transactions on Multimedia (TMM), 2019.
    [PDF] [Code]
  11. Yueyu Hu, Shuai Yang, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu. "Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach", Proc. IEEE International Conference on Multimedia & Expo (ICME), 2020. (Best Paper Award)
    [PDF] [Project] [Slides]
  12. Shuai Yang, Yueyu Hu, Wenhan Yang, Ling-Yu Duan, and Jiaying Liu. "Towards Coding for Human and Machine Vision: Scalable Face Image Coding", IEEE Transactions on Multimedia (TMM), 2020.
    [PDF]
  13. [Full list of publications]

Projects

Towards Coding for Human and Machine Vision: A Scalable Image Coding Approach

In this paper, we come up with a novel image coding framework by leveraging both the compressive and the generative models, to support machine vision and human perception tasks jointly. By introducing advanced generative models, we train a flexible network to reconstruct images from compact feature representations and the reference pixels. Experimental results demonstrate the superiority of our framework in both human visual quality and facial landmark detection, which provide useful evidence on the emerging standardization efforts on MPEG VCM (Video Coding for Machine).

[PDF] [Supplementary] [Project]

Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression

End-to-End learned image compression framework with a coarse-to-fine hyper-prior model, featuring Signal Preserving Hyper Transform and Information Aggregation Reconstruction sub-network. The proposed model achieve superior performance to existing hyper-prior models with context models.

[PDF] [Poster] [Project] [TF Code] [PyTorch Code] [Spotlight]

Learning End-to-End Lossy Image Compression: A Benchmark

A survey and benchmark of existing end-to-end learned image compression methods.

[arXiv] [Project]

Friends


Useful Links

NYC Experience: See my recommendation for visiting NYC.

Windy: A website to provide visualized numerical weather forecast results by, e.g. GFS, ECMWF.

DLVC: An open-sourced deep-learning driven video codec.

FVC-NIC: Open-Source learned image compression framework.

BDWM: The most popular & official Bulletin Board System (BBS) at Peking University.