Zehong Ma(马泽红)’s Homepage

I am now a third-year Ph.D. student in VMC group under the supervision of Professor Shiliang Zhang at Peking University, Beijing, China.

My research interests are computer vision and multimodal representation learning, including open-vocabulary recognition, multimodal large language model, and image/video generation.

Educations

  • Aug. 2018-Jul. 2022 B.E. in School of Software, Northwestern Polytechnical University.
  • Aug. 2022-now. Ph.D. in the School of Computer Science, Peking University, China.

Publications

  • Efficient Multi-modal Long Context Learning for Training-free Adaptation (ICML25) [pdf]
  • Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval (TMM25) [pdf]
  • OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24) [pdf]
  • Text-Guided Visual Feature Refinement for Text-Based Person Search (ICMR21 Oral) [pdf]

Honors and Awards

  • 2021. China National Scholarship
  • 2020. Outstanding Student Model of Northwestern Polytechnical University (The highest honor for undergraduates, with only 20 students in the university)
  • 2020. China National Scholarship
  • 2019. China National Scholarship
  • 2020. National Champion of China Robotics Competition Basketball Robot Project(Autonomy&Challenge)
  • 2020. National First Prize of DJI Robomaster Competition
  • 2019. National Third Runner-up of China Robotics Competition Advanced Vision Project(3D Measurement)

Projects