I am currently a PhD student at the University of Virginia, advised by Dr. Sheng Li. Prior to that, I received my bachelor's degree in Electronic Engineering from Southeast University. I am interested in multimodal large language models and vision-language models. I co-organized workshops on Generative AI for photography and education at WACV 2026 and NeurIPS 2025.
I also spend time at Amazon Search Science & AI and Adobe Research as a Research Scientist Intern.
Full publications on Google Scholar.
The First Workshop on Large Foundation Models for Educational Assessment
At Neural Information Processing Systems (NeurIPS) 2024
In Proceedings of Machine Learning Research (PMLR), Volume 264, 2024
Website
/
Preface
The First Workshop on Generative AI for Photography
At Winter Conference on Applications of Computer Vision (WACV) 2026
Website
A Survey of Trustworthy Representation Learning across Domains
ACM Transactions on Knowledge Discovery from Data (TKDD) 2024
Paper
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Preparing for IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2025
The Clever Hans Mirage: A Comprehensive Survey on Spurious Correlations in Machine Learning
Submitted to Transactions on Machine Learning Research (TMLR) 2025
Learning to Understand Multi-image Aesthetics
Preparing for Computer Vision and Pattern Recognition
(CVPR) 2026
Generalizing (CLIP) to Unseen Domains with Text-guided Augmentation
European Conference on Computer Vision
(ECCV) 2024
Paper
Tag-grounded Visual Instruction Tuning with Retrieval Augmentation
Main of Empirical Methods in Natural Language Processing
(EMNLP) 2024
Paper
Revealing an Overlooked Challenge in Class-Incremental Graph Learning
Transactions on Machine Learning Research
(TMLR) 2024
Paper
No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users
Findings of Empirical Methods in Natural Language Processing
(EMNLP) 2025
Paper
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Preparing for International Conference on Learning Representations
(ICLR) 2026
Paper
Words to Graphics: Leveraging LLMs for Scientific Scalable Vector Graphics Generation
Preprint 2025
Reviewer / Program Committee Member
Neural Information Processing Systems (NeurIPS)
International Conference on Learning Representations (ICLR)
International Conference on Machine Learning (ICML)
Computer Vision and Pattern Recognition (CVPR)
Transactions on Machine Learning Research (TMLR)
Organizer / Program Chair
WACV 2026 Workshop on Generative AI for Photography
NeurIPS 2024 Workshop on Large Foundation Models for Educational Assessment
Recipient
Notable Reviewer, ICLR 2025
DS 4002: Data Science Project Course (Spring 2023)
CS 5012: Foundations of Computer Science (Fall 2022)
I am interested in photography. Some of them are available at instagram and website. I used to play the Auto Battler game and was among top 100 (ranked #52) best players nationwide.
The Americans - Jack Kerouac
Manifesto of Surrealism - Andre Breton
More is Different - P.W. Anderson