I am a Ph.D. student at Department of Electronic Engineering, The Chinese University of Hong Kong, advised by Prof. Hongliang Ren. I collaborate closely with Dr. Long Bai at Alibaba DAMO Academy, Prof. Huxin Gao at our MEDICAL MECHATRONICS Lab and Prof. Jinlin Wu at Hong Kong Institute of Science & Innovation, Chinese Academy of Sciences. Previously, I received the M. Sc. degree in Department of Electronic Engineering from The Chinese University of Hong Kong in 2023, advised by Prof. Hongliang Ren. I also received my Bachelor degree in Automation at College of Mechatronics and Control Engineering, Shenzhen University, advised by Prof. Xiaopin Zhong.

My research interests include computer vision and its applications in medical image analysis and surgical robotic perception. I recently work on large vision-language models and downstream surgical robot control.

🔥 News

📝 Selected Publications [Find full list at Google Scholar]

† indicates equal contribution; * represents the corresponding authors.

  • EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
    Guankun Wang†, Long Bai†, Junyi Wang†, Kun Yuan†, Zhen Li, Tianxu Jiang, Xiting He, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu, Jiazheng Wang, Fan Zhang, Nicolas Padoy, Nassir Navab, Hongliang Ren*
    Medical Image Analysis (MedIA), 2025.

  • Rethinking data imbalance in class incremental surgical instrument segmentation
    Shifang Zhao, Long Bai, Kun Yuan, Feng Li, Jieming Yu, Wenzhen Dong, Guankun Wang, Mobarak Islam Hoque, Nicolas Padoy, Nassir Navab, Hongliang Ren*
    Medical Image Analysis (MedIA), 2025.

  • CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
    Guankun Wang†, Han Xiao†, Huxin Gao, Renrui Zhang, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li, Hongliang Ren
    Proceedings of the 33rd ACM International Conference on Multimedia (ACM Multimedia 2025)

  • EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy
    Chi Kit Ng†, Long Bai†, Guankun Wang†, Yupeng Wang, Huxin Gao, Kun Yuan, Chenhan Jin, Tieyong Zeng, Hongliang Ren*
    The Conference on Robot Learning (CoRL) 2025

  • SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting
    Yiming Huang†, Long Bai†, Beilei Cui†, Kun Yuan, Guankun Wang, Mobarakol Islam, Nicolas Padoy, Nassir Navab, Hongliang Ren*
    International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2025)

  • Multimodal Graph Representation Learning for Robust Surgical Workflow Recognition with Adversarial Feature Disentanglement
    Long Bai†, Boyi Ma†, Ruohan Wang, Guankun Wang, Beilei Cui, Zhongliang Jiang, Mobarakol Islam, Zhe Min, Jiewen Lai, Nassir Navab, Hongliang Ren*
    Information Fusion (IF), 2025.

  • EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery
    Guankun Wang†, Rui Tang†, Mengya Xu†, Long Bai, Huxin Gao, Hongliang Ren*
    Advanced Intelligent Systems (AIS), 2025.

  • STAR: Empowering Semi-Supervised Medical Image Segmentation with SAM-based Teacher-Student Architecture and Contrastive Consistency Regularization
    Qiwei Liang, Rulin Zhou, Yijing Zhou, Guankun Wang, Peng Peng, Xiaopin Zhong*
    Expert Systems with Applications (ESAP), 2025.

  • Surgical-VQLA++: Adversarial contrastive learning for calibrated robust visual question-localized answering in robotic surgery
    Long Bai†, Guankun Wang†, Mobarakol Islam†, Lalithkumar Seenivasan, An Wang, Hongliang Ren*
    Information Fusion (IF), 2024.

  • ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection
    Mengya Xu†, Wenjin Mo†, Guankun Wang†, Huxin Gao, An Wang, Long Bai, Chaoyang Lyu, Xiaoxiao Yang, Zhen Li, Hongliang Ren*
    IEEE International Conference on Robotics and Automation (ICRA 2025). (CCF-B)

  • PDZSeg: Adapting the Foundation Model for Dissection Zone Segmentation with Visual Prompts in Robot-assisted Endoscopic Submucosal Dissection
    Mengya Xu, Wenjin Mo, Guankun Wang, Huxin Gao, An Wang, Zhen Li, Xiaoxiao Yang, Hongliang Ren*
    International Conference on Information Processing in Computer-Assisted Interventions (IPCAI 2025).

  • TSUBF-Net: Trans-spatial UNet-like network with Bi-direction fusion for segmentation of adenoid hypertrophy in CT
    Rulin Zhou, Yingjie Feng, Guankun Wang, Xiaopin Zhong, Zongze Wu, Qiang Wu* & Xi Zhang
    Neural Computing and Applications (NCAP), 2025.

  • Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery
    Long Bai†, Guankun Wang†, Jie Wang, Xiaoxiao Yang, Huxin Gao, Xin Liang, An Wang, Mobarakol Islam, Hongliang Ren*
    IEEE International Conference on Robotics and Automation (ICRA 2024). (CCF-B)

  • Uncertainty-aware Out-of-distribution Detection in Capsule Endoscopy Diagnosis
    Qiaozhi Tan†, Long Bai†, Guankun Wang†, Mobarakol Islam, Hongliang Ren*
    IEEE International Symposium on Biomedical Imaging (ISBI 2024).

  • Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy Scenes: Entropy-based Mini-Batch Pseudo-Replay
    Guankun Wang†, Long Bai†, Yanan Wu, Tong Chen, Hongliang Ren*
    Computers in Biology and Medicine (CBM), 2023.

🎖 Selected Awards

  • 2025.10 First Place Award at ACM MM 2025 Workshop
  • 2025.10 Best Poster Award at IROS 2025 Workshop
  • 2025.10 MICCAI Young Scientist Award (5/3677)
  • 2025.06 The IHU Strasbourg and NDI Bench to Bedside Award: Honorable Mention
  • 2025.05 MRC Symposium Best Application Award
  • 2024.05 Best Poster Award of ICRA 2024 C4SR+ Workshop
  • 2023.11 HAMEN FAN SCHOLARSHIP for M.Sc.in ELECTRONIC ENGINEERING
  • 2023.11 Dean’s List 2022-2023
  • 2023.07 Best Poster Award at ICRA 2023 Workshop
  • 2022.10 Department Admission Scholarship
  • 2022.07 Outstanding Graduate Student

đź“– Educations

  • 2024.08 - 2028.07, Ph.D., Electronic Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
  • 2022.09 - 2023.07, M.Sc., Electronic Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
  • 2018.09 - 2022.07, B.Eng., Automation, Shenzhen University, Shenzhen, China

đź’» Work Experience

  • 2023.09 - 2024.07, [Junior Research Assistant] Department of Electronic Engineering, The Chinese University of Hong Kong, Hong Kong. Supervised by Prof. Hongliang Ren.
  • 2021.07 - 2021.12, [Application Engineer Intern] Research and Development Department, DAS Intelligence, Shenzhen, China.

đź’¬ Professional Services

  • Regular Journal and Conference Reviewer: Information Fusion, IEEE Transactions on Circuits and Systems for Video Technology, Scientific Reports, IJCARS, International Journal of Machine Learning and Cybernetics, Image and Vision Computing, ICRA, MICCAI, ISBI, IPCAI
  • Conference Session Chair: ACM MM 2025, ICBIR 2025, ICIA 2025