I am currently a PhD student at the University of Virginia, advised by Dr. Sheng Li. I work on Multimodal Intelligence.
Specifically, I am interested in:
- Visual understanding and reasoning with MLLMs (post-training, test-time scaling)
- Vision–Language alignment (VLMs, MLLMs)
- Multimodal LLM agents
I am also interested in applications of Generative AI and co-organized workshops on GenAI for
Photography and
Education at
WACV'26 and
NeurIPS'24.
News: I'm currently seeking full-time industry opportunities. Feel free to reach out if you're interested.
Experience
Research Scientist Intern, Adobe Research
San Jose, CA, May 2025 - Nov 2025
Research Scientist Intern, Adobe Research
San Jose, CA, May 2024 - Nov 2024
Applied Scientist Intern, Amazon Search
Palo Alto, CA, May 2023 - Aug 2023
Recipient
NeurIPS Scholar Award, 2025
ICLR Notable Reviewer, 2025
Organized Workshops
The First Workshop on Generative AI for Photography
WACV 2026
The First Workshop on Large Foundation Models for Educational Assessment
NeurIPS 2024
Service
Teaching
DS 4002: Data Science Project Course (Spring 2023)
CS 5012: Foundations of Computer Science (Fall 2022)
Misc
I am interested in photography. Some of them are available at
Instagram. I used to play the Auto Battler game and was among top 100 (ranked #52) best players nationwide. Here is my
homepage.
Links: