Publications
[Google Scholar]†: Equal contribution; *: Corresponding author
2026
-
VL-HTR: Learning Human-Target Representation From Vision-Language Model. TCYB, 2026.
-
TumorChain: Interleaved Multimodal Chain-of-Thought Reasoning for Traceable Clinical Tumor Analysis. ICLR, 2026.
2025
-
TransGOP-R: Transformer-based Real-World Gaze Object Prediction. TMM, 2025.
-
Frequency-Aware B-Line and Pleural Line Analysis in Lung Ultrasound Videos. JHBI, 2025.
-
Prompting Vision-Language Model for Nuclei Instance Segmentation and Classification. TMI, 2025.
2024
-
Pixel Distillation: Cost-flexible Distillation across Image Sizes and Heterogeneous Networks. TPAMI, 2024.
-
Progressive Adapting and Pruning: Domain-Incremental Learning for Saliency Prediction. ACM TOMM, 2024.
-
Contextual Dependency Vision Transformer for spectrogram-based multivariate time series analysis. Neurocomputing, 2024.
-
Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports. arXiv preprint, 2024.
-
Position-based anchor optimization for point supervised dense nuclei detection. Neural Networks, 2024.
2023 & Earlier
-
Semantic-aware Knowledge Distillation with Parameter-free Feature Uniformization. Visual Intelligence, 2023.
-
Generalized Weakly Supervised Object Localization. TNNLS, 2022.
-
Strengthen learning tolerance for weakly supervised object localization. CVPR, 2021.
-
Eliminating indefiniteness of clinical spectrum for better screening COVID-19. JBHI, 2021.
-
Learning object detectors with semi-annotated weak labels. TCSVT, 2019.
-
Poseflow: A deep motion representation for understanding human behaviors in videos. CVPR, 2018.