Research
His research focuses on learning a generalizable representation for visual intelligence, covering multiple application domains such as autonomous driving, medical imaging and aerial imaging. Some representative publications are listed below.
|
|
Universal Fine-Grained Visual Categorization by Concept Guided Learning
Qi Bi, Beichen Zhou, Wei Ji, Gui-Song Xia
IEEE Transactions on Image Processing (T-IP), 2025
code
dataset
Advancing fine-grained visual categorization (FGVC) from object-centric to scene-centric and adverse viewpoints; proposing a fine-grained land-cover dataset (FGLCD); feasible to multiple fine-grained categorization and detection tasks.
|
|
Samba: Severity-aware Recurrent Modeling for Cross-domain Medical Image Grading
Qi Bi, Jingjun Yi, Hao Zheng, Wei Ji, Haolan Zhan, Yawen Huang, Yuexiang Li, Yefeng Zheng
Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
code
A Severity-aware Recurrent Modeling method, dubbed as Samba, is proposed for general disease grading within- and cross-domain medical images on three modalities.
|
|
Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Segmentation
Qi Bi, Jingjun Yi, Hao Zheng, Haolan Zhan, Yawen Huang, Wei Ji, Yuexiang Li, Yefeng Zheng
Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
code
A frequency-aware parameter-efficient fine-tuning method, dubbed as FADA, is proposed for domain generalized semantic segmentation. It is feasible to various vision foundation model.
|
|
Learning Content-enhanced Mask Transformer for Domain Generalized Urban-scene Segmentation
Qi Bi, Shaodi You, Theo Gevers
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024
code
Learning domain generalized scene segmentation by content-enhanced mask attention mechanism.
|
|
Learning Generalized Segmentation for Foggy-Scenes by Bi-directional Wavelet Guidance
Qi Bi, Shaodi You, Theo Gevers
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024
code
Learning scene segmentation that can be generalized to arbitrary unseen foggy target domains from only a clear source domain; the first work for this task.
|
|
Learning Generalized Medical Image Segmentation from Decoupled Feature Queries
Qi Bi, Jingjun Yi, Hao Zheng, Wei Ji, Yawen Huang, Yuexiang Li, Yefeng Zheng
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024
code
Learning domain generalized medical image segmentation by querying from decoupled features; the first work to leverage Vision Transformer for domain generalized medical image segmentation.
|
|
Interactive Learning of Intrinsic and Extrinsic Properties for All-day Semantic Segmentation
Qi Bi, Shaodi You, Theo Gevers
IEEE Transactions on Image Processing (T-IP), 2023
code
dataset
Learning robust scene semantic segmentation under all-day scenarios; proposing the first all-day semantic segmentation dataset All-day CityScapes.
|
|
Segment anything is not always perfect: An investigation of sam on different real-world applications
Wei Ji, Jingjing Li, Qi Bi, Wenbo Li, Li Cheng
CVPR 1st workshop on Vision-based InduStrial InspectiON, 2023
Best paper award
code
Benchmarking Segment Anything (SAM) on multiple real-world scenarios.
|
|
Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection
Wei Ji, Jingjing Li, Qi Bi, Chuan Guo, Jie Liu, Li Cheng
International Conference on Learning Representations (ICLR), 2022
code
Learning deep unsupervised RGB-D saliency detection, by engaging depth information to improve pseudo-labels in the training process.
|
|
All Grains, One Scheme (AGOS): Learning Multi-grain Instance Representation for Aerial Scene Classification
Qi Bi, Beichen Zhou, Kun Qin, Qinghao Ye, Gui-Song Xia
IEEE Transactions on Geoscience and Remote Sensing (T-GRS), 2022
code
Extending deep multiple instance learning into a multi-grain framework while maintaining the same semantic scheme, dubbed as AGOS; learning discriminative aerial scene representation by AGOS.
|
|
Label-efficient Hybrid-supervised Learning for Medical Image Segmentation
Junwen Pan*, Qi Bi*, Yanzhan Yang, Pengfei Zhu, Cheng Bian
* : equal contribution
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2022
Learning weakly semi-supervised medical image segmentation by the proposed dynamic instance indicator and dynamic co-regularization framework.
|
|
Local semantic enhanced convnet for aerial scene recognition
Qi Bi, Kun Qin, Han Zhang, Gui-Song Xia
IEEE Transactions on Image Processing (T-IP), 2021
code
Learning aerial scene representation by modeling context-aware class peak response.
|
|
Joint semantic mining for weakly supervised RGB-D salient object detection
Jingjing Li, Wei Ji, Qi Bi, Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu
Advances in Neural Information Processing Systems (NeurIPS), 2021
code
dataset
Learning weakly-supervised RGB-D salient object detection (SOD) from the image, depth map and image caption; proposing a dataset for caption based SOD dubbed as CapS.
|
|
Local-global dual perception based deep multiple instance learning for retinal disease classification
Qi Bi, Shuang Yu, Wei Ji, Cheng Bian, Lijun Gong, Hanruo Liu, Kai Ma, Yefeng Zheng
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2021
MICCAI2021 travel awards
MICCAI2021 young scientist award candidate
Learning retinal diseases from fundus images by local-global representation.
|
|
Learning calibrated medical image segmentation via multi-rater agreement modeling
Wei Ji, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Qi Bi, Jingjing Li, Hanruo Liu, Li Cheng, Yefeng Zheng
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Best paper candidate
code
Learning medical image segmentation from multiple annotations by multi-rater modeling.
|
|
A multiple-instance densely-connected ConvNet for aerial scene classification
Qi Bi, Kun Qin, Zhili Li, Han Zhang, Kai Xu, Gui-Song Xia
IEEE Transactions on Image Processing (T-IP), 2020
code
Modeling discriminative aerial scene representation by deep multiple instance learning.
|
Supervision
Noud Corten, Improved Road Crack Severity Measurement Using Deep Convolutional Networks by Storing Spatial Information, November 2021-August 2022 (completed).
Carlo Airaghi, Multi-Stage Multiscale Training Architecture for Semantic Segmentation of Remote Sensing Images, April 2021- December 2021 (completed).
Silvan Murre, Layout2Land: Semi-Supervised Learning of a Layout and Style Reconfigurable GAN, March 2021-June 2021 (completed).
|
Teaching
2025 Computer Vision 2 (UvA, Lecturer)
2024 Vision & Autonomous Robotics (UvA, Lecturer)
2024 Computer Vision 1 (UvA, Lecturer)
2024 Computer Vision 2 (UvA, Teaching Assistant)
2023 Computer Vision 1 (UvA, Teaching Assistant)
2023 Computer Vision 2 (UvA, Teaching Assistant)
2022 Computer Vision 1 (UvA, Teaching Assistant)
2021 Computer Vision 1 (UvA, Teaching Assistant)
2020 Computer Vision 1 (UvA, Teaching Assistant)
|
|