Zehong Ma(马泽红)’s Homepage
I am now a third-year Ph.D. student in VMC group under the supervision of Professor Shiliang Zhang at Peking University, Beijing, China.
My research interests are computer vision and multimodal representation learning, including open-vocabulary recognition, multimodal large language model, and image/video generation.
Educations
- Aug. 2018-Jul. 2022 B.E. in School of Software, Northwestern Polytechnical University.
- Aug. 2022-now. Ph.D. in the School of Computer Science, Peking University, China.
Publications
- Efficient Multi-modal Long Context Learning for Training-free Adaptation (ICML25) [pdf]
- Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval (TMM25) [pdf]
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24) [pdf]
- Text-Guided Visual Feature Refinement for Text-Based Person Search (ICMR21 Oral) [pdf]
Honors and Awards
- 2021. China National Scholarship
- 2020. Outstanding Student Model of Northwestern Polytechnical University (The highest honor for undergraduates, with only 20 students in the university)
- 2020. China National Scholarship
- 2019. China National Scholarship
- 2020. National Champion of China Robotics Competition Basketball Robot Project(Autonomy&Challenge)
- 2020. National First Prize of DJI Robomaster Competition
- 2019. National Third Runner-up of China Robotics Competition Advanced Vision Project(3D Measurement)