Portrait
Xinliang Zhang
Ph.D. Student
Institute of Medical Technology, Peking University Health Science Center
About Me
I am a Ph.D. student at the Institute of Medical Technology, Peking University Health Science Center. My research focuses on Multimodal Large Language Models for efficient reasoning and comprehension. My interests include Multimodal Large Language Models, Weakly Supervised Image Segmentation, and Medical Image Analysis. I am supervised by Prof. Yanye Lu.
Education
  • Peking University
    Peking University
    Institute of Medical Technology, Peking University Health Science Center
    Ph.D. Student
    Sep. 2025 - Present
  • Tianjin University
    Tianjin University
    College of Intelligence and Computing
    M.S Student
    Sep. 2021 - Jul. 2024
  • Ocean University of China
    Ocean University of China
    School of Computer Science and Technology
    B.S. Student
    Sep. 2017 - Jul. 2021
Experience
  • Peking University
    Peking University
    Research Intern
    May. 2024 - Sep. 2025
Honors & Awards
  • Outstanding cadres of Tianjin University
    2024
  • Outstanding graduates of Shandong Province
    2021
  • China Underwater Robot Professional Contest - Second Prize
    2019
News
2026
我们的集群新增了BeeGFS文件系统,IO问题终于得以解决! BeeGFS搭建教程 压力测试
Jan 06
2025
Our paper is accepted by TNNLs(中科院一区Top)
Jun 14
Our paper is accepted by AAAI'25(CCF-A)
Mar 24
"恭喜我们的网站复活了" MILab
Mar 21
Our paper is accepted by ESWA(中科院一区Top)
Feb 01
2024
Our paper is accepted by AAAI'24(CCF-A)
Mar 24
Selected Publications (view all )
AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Xinliang Zhang, Lei Zhu, Hangzhou He, Shuang Zeng, Ourui Fu, Jiakui Hu, Zhengjian Yao, Yanye Lu

Submitted to CVPR'26 2025

In this study, we propose a object-level visual representation compression strategy for multimodal large language models.

AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Xinliang Zhang, Lei Zhu, Hangzhou He, Shuang Zeng, Ourui Fu, Jiakui Hu, Zhengjian Yao, Yanye Lu

Submitted to CVPR'26 2025

In this study, we propose a object-level visual representation compression strategy for multimodal large language models.

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation
Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

Kaiwen Li, Hangzhou He, Shuang Zeng, Xinliang Zhang, Yuanwei Li, Lei Zhu, Yanye Lu

IEEE Transactions on Medical Imaging(TMI) 2025 中科院一区Top

In this paper, we introduce point annotations to fundus vessel segmentation and propose a novel method, called Points-based Vessel segmentation Network (PVN), to enhance the segmentation accuracy.

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

Kaiwen Li, Hangzhou He, Shuang Zeng, Xinliang Zhang, Yuanwei Li, Lei Zhu, Yanye Lu

IEEE Transactions on Medical Imaging(TMI) 2025 中科院一区Top

In this paper, we introduce point annotations to fundus vessel segmentation and propose a novel method, called Points-based Vessel segmentation Network (PVN), to enhance the segmentation accuracy.

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation
Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

Lei Zhu, Xinliang Zhang, Hangzhou He, Qian Chen, Sha Li, Shuang Zeng, Yibao Zhang, Qiushi Ren, Yanye Lu

IEEE Transactions on Neural Networks and Learning Systems(TNNLS) 2025 中科院一区Top

Previous methods online-trained classification branch to provide pseudo annotations for supervising the segmentation branch, which makes the classification branch dominate the whole concurrent training process. We propose a bidirectional supervision mechanism to achieve mutual promotion for End2End weakly supervised semantic segmentation field.

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

Lei Zhu, Xinliang Zhang, Hangzhou He, Qian Chen, Sha Li, Shuang Zeng, Yibao Zhang, Qiushi Ren, Yanye Lu

IEEE Transactions on Neural Networks and Learning Systems(TNNLS) 2025 中科院一区Top

Previous methods online-trained classification branch to provide pseudo annotations for supervising the segmentation branch, which makes the classification branch dominate the whole concurrent training process. We propose a bidirectional supervision mechanism to achieve mutual promotion for End2End weakly supervised semantic segmentation field.

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Hangzhou He, Lei Zhu, Xinliang Zhang, Shuang Zeng, Qian Chen, Yanye Lu

Association for the Advancement of Artificial Intelligence (AAAI) 2025 Imageomics Oral

we adopt common words as base concept vocabulary and leverage auxiliary unlabeled images to construct a Vision-to-Concept (V2C) tokenizer that can explicitly quantize images into their most relevant visual concepts, thus creating a vision-oriented concept bottleneck tightly coupled with the multimodal model.

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Hangzhou He, Lei Zhu, Xinliang Zhang, Shuang Zeng, Qian Chen, Yanye Lu

Association for the Advancement of Artificial Intelligence (AAAI) 2025 Imageomics Oral

we adopt common words as base concept vocabulary and leverage auxiliary unlabeled images to construct a Vision-to-Concept (V2C) tokenizer that can explicitly quantize images into their most relevant visual concepts, thus creating a vision-oriented concept bottleneck tightly coupled with the multimodal model.

Generative learning-based lightweight MRI brain tumor segmentation with missing modalities
Generative learning-based lightweight MRI brain tumor segmentation with missing modalities

Xinliang Zhang, Qian Chen, Hangzhou He, Lei Zhu, Zhaoheng Xie, Yanye Lu

Expert Systems with Applications 2025 中科院一区Top

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

Generative learning-based lightweight MRI brain tumor segmentation with missing modalities

Xinliang Zhang, Qian Chen, Hangzhou He, Lei Zhu, Zhaoheng Xie, Yanye Lu

Expert Systems with Applications 2025 中科院一区Top

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label
Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label

Xinliang Zhang, Lei Zhu, Hangzhou He, Lujia Jin, Yanye Lu

Association for the Advancement of Artificial Intelligence (AAAI) 2024 CCF-A

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label

Xinliang Zhang, Lei Zhu, Hangzhou He, Lujia Jin, Yanye Lu

Association for the Advancement of Artificial Intelligence (AAAI) 2024 CCF-A

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

All publications