News
1 July 2025
One paper is accepted by ACM MM 2025
28 June 2025
Two papers are accepted by ICCV 2025
1 April 2025
Two papers are accepted by NAACL 2025
10 February 2025
One paper is accepted by TOMM
7 March 2025
One paper is accepted by TOMM
4 February 2025
One paper is accepted by ICML 2025
16 January 2025
One paper is accepted by TCSVT
1 January 2025
One paper is accepted by ICASSP 2025
24 December 2024
One paper is accepted by TMM
15 November 2024
One paper is accepted by TOMM
25 October 2024
One paper is accepted by NeurIPS 2024 (Spotlight)
15 October 2024
One paper is accepted by MMM 2024
14 October 2024
One paper is accepted by PRCV 2024
15 September 2024
One paper is accepted by BMCV 2024
15 July 2024
One paper is accepted by ACM MM 2024
7 July 2024
One paper is accepted by TIP
30 May 2024
One paper is accepted by ICMR 2024
15 February 2024
One paper is accepted by ECCV 2024
15 January 2024
One paper is accepted by AAAI 2024
15 September 2023
One paper is accepted by BMVC 2023
15 July 2023
One paper is accepted by ACM MM 2023
15 June 2023
One paper is accepted by CVPR 2023
15 January 2023
One paper is accepted by TOMM 2023
15 November 2022
One paper is accepted by TIP 2022
10 October 2022
Three papers is accepted by ACM MM 2022
22 April 2022
One paper is accepted by TCSVT 2022
25 February 2022
One paper is accepted by ISEEIE 2022
18 October 2021
One paper is accepted by Journal of Physics
10 October 2021
One paper is accepted by PRICAI 2021
3 July 2020
One paper is accepted by ECCV 2020
10 May 2019
One paper is accepted by IJCAI 2019
10 September 2019
One paper is accepted by TOMM
10 October 2018
One paper is accepted by ACM MM 2018
![]() |
Shuo Wang
Lab for Data Science
Email: shuowangcv@ustc.edu.cn |
Hello, I’m Shuo Wang! I am currently an Associate Research Fellow, School of Information Science and Technology,
University of Science and Technology of China (USTC), China. His research interests mainly include machine learning and multimedia data analysis, such as large-scale multimedia indexing and retrieval, multimedia data embedding, and video understanding.
Selected Publications
![]() |
Accelerating Diffusion Transformer via Error-Optimized Cache
Junxiang Qiu, Shuo Wang*, Jinda Lu, Lin Liu, Houcheng Jiang, Yanbin Hao ACM MM, 2025 |
![]() |
Accelerating Diffusion Transformer via Gradient-Optimized Cache
Junxiang Qiu, Lin Liu, Shuo Wang*, Jinda Lu, Kezhou Chen, Yanbin Hao ICCV, 2025 |
![]() |
Dynamic Multimodal Prototype Learning in Vision-Language Models
Xingyu Zhu, Shuo Wang*, Beier Zhu, Miaoge Li, Yunfan Li, Junfeng Fang, Zhicai Wang, Dongsheng Wang, Hanwang Zhang ICCV, 2025 |
![]() |
Symmetric Hallucination with Knowledge Transfer for Few-shot Learning
Shuo Wang, Xinyu Zhang, Meng Wang, Xiangnan He IEEE Transactions on Multimedia, 2024 |
![]() |
Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Xingyu Zhu, Beier Zhu, Yi Tan, Shuo Wang*, Yanbin Hao, Hanwang Zhang NeurIPS (Spotlight), 2024 |
![]() |
Selective Vision-Language Subspace Projection for Few-shot CLIP
Xingyu Zhu, Beier Zhu, Yi Tan, Shuo Wang*, Yanbin Hao, Hanwang Zhang ACM MM, 2024 |
![]() |
Feature Mixture on Pre-Trained Model for Few-Shot Learning
Shuo Wang, Jinda Lu, Haiyang Xu, Yanbin Hao, Xiangnan He IEEE Transactions on Image Processing, 2024 |
![]() |
Boosting Few-Shot Learning via Attentive Feature Regularization
Xingyu Zhu, Shuo Wang*, Jinda Lu, Yanbin Hao, Haifeng Liu, Xiangnan He AAAI, 2024 |
![]() |
Semantic-based Selection, Synthesis, and Supervision for Few-shot Learning
Jinda Lu, Shuo Wang*, Xinyu Zhang, Yanbin Hao, Xiangnan He* ACM MM, 2023 |
![]() |
Spatio-Temporal Collaborative Module for Efficient Action Recognition
Yanbin Hao, Shuo Wang*, Yi Tan, Xiangnan He, Zhenguang Liu, Meng Wang IEEE Transactions on Image Processing, 2022 |
![]() |
Multi-directional Knowledge Transfer for Few-shot Learning
Shuo Wang, Xinyu Zhang, Yanbin Hao, Chengbing Wang, Xiangnan He ACM MM, 2022 |
![]() |
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Yanbin Hao, Shuo Wang*, Pei Cao, Xinjian Gao, Tong Xu, Jinmeng Wu, Xiangnan He IEEE Transactions on Circuits and Systems for Video Technology, 2022 |
![]() |
Large-scale Few-shot Learning via Multi-modal Knowledge Discovery
Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian, Meng Wang ECCV, 2020 |
![]() |
Connectionist Temporal Fusion for Sign Language Translation
Shuo Wang, Dan Guo, Wengang Zhou, Zhengjun Zha, Meng Wang ACM MM, 2018 |
![]() |
Method and Apparatus for Training Classifier
Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian US Patent App. 17/892,908, 2023 |
Other Publications (Five years)
![]() |
Interventional Feature Generation for Few-shot Learning
Shuo Wang, Jinda Lu, Huixia Ben, Yanbin Hao, Xingyu Gao, Meng Wang ACM Transactions on Multimedia Computing, Communications and Applications 2025, 2025 |
![]() |
Multimodal Generation with Consistency Transferring
Junxiang Qiu, Jinda Lu, Shuo Wang Findings of the Association for Computational Linguistics: NAACL 2025, 2025 |
![]() |
Mixture of Multimodal Adapters for Sentiment Analysis
Kezhou Chen, Huixia Ben, Shuo Wang, Shengeng Tang, Yanbin Hao Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2025 |
![]() |
Linguistics-Vision Monotonic Consistent Network for Sign Language Production
Xu Wang, Shengeng Tang, Peipei Song, Shuo Wang, Dan Guo, Richang Hong International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 |
![]() |
Gloss-Driven Conditional Diffusion Models for Sign Language Production
Shengeng Tang, Feng Xue, Jingjing Wu, Shuo Wang, Richang Hong ACM Transactions on Multimedia Computing, Communications and Applications, 2025 |
![]() |
DAMO: Data-and Model-aware Alignment of Multi-modal LLMs
Jinda Lu, Junkang Wu, Jinghan Li, Xiaojun Jia, Shuo Wang YiFan Zhang, Junfeng Fang, Xiang Wang, Xiangnan He ICML 2025, 2025 |
![]() |
Video Corpus Moment Retrieval with Query-specific Context Learning and Progressive Localization
Long Zhang, Peipei Song, Zhangling Duan, Shuo Wang, Xiaojun Chang, Xun Yang TCSVT 2025, 2025 |
![]() |
CVLP-NaVD: Contrastive Visual-Language Pre-training Models for Non-annotated Visual Description
Haoran Li, Yanbin Hao, Jiarui Yu, Bin Zhu, Shuo Wang, Tong Xu ACM Transactions on Multimedia Computing, Communications and Applications, 2024 |
![]() |
Pseudo Content Hallucination for Unpaired Image Captioning
Huixia Ben, Shuo Wang*, Meng Wang, Richang Hong Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024 |
![]() |
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
Fangzhou Song, Bin Zhu, Yanbin Hao, Shuo Wang European Conference on Computer Vision, 2024 |
![]() |
JPA: A Joint-Part Attention for Mitigating Overfocusing on 3D Human Pose Estimation
Dengqing Yang, Zhenhua Tang, Jinmeng Wu, Shuo Wang, Lechao Cheng, Yanbin Hao Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024 |
![]() |
GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation
Shuo Wang, Enlong Xie, Jinda Lu, Jinghan Li, Yanbin Hao Proceedings of the 35th British Machine Vision Conference (BMVC), 2024 |
![]() |
Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis
Kezhou Chen, Shuo Wang*, Yanbin Hao International Conference on Multimedia Modeling, 2024 |
![]() |
How Can Contrastive Pre-training Benefit Audio-visual Segmentation? A Study from Supervised and Zero-shot Perspectives
Jiarui Yu, Haoran Li, Yanbin Hao, Jinmeng Wu, Tong Xu, Shuo Wang, Xiangnan He Proceedings of the 34th British Machine Vision Conference (BMVC), 2023 |
![]() |
Bi-directional Distribution Alignment for Transductive Zero-Shot Learning
Zhicai Wang, Yanbin Hao, Tingting Mu, Ouxiang Li, Shuo Wang, Xiangnan He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023 |
![]() |
Boosting Hyperspectral Image Classification with Dual Hierarchical Learning
Shuo Wang, Huixia Ben, Yanbin Hao, Xiangnan He, Meng Wang ACM Transactions on Multimedia Computing, Communications and Applications, 2023 |
![]() |
Hierarchical Hourglass Convolutional Network for Efficient Video Classification
Yi Tan, Yanbin Hao, Hao Zhang, Shuo Wang, Xiangnan He Proceedings of the 30th ACM International Conference on Multimedia, 2022 |
![]() |
Parameterization of Cross-token Relations with Relative Positional Encoding for Vision MLP
Zhicai Wang, Yanbin Hao, Xingyu Gao, Hao Zhang, Shuo Wang, Tingting Mu, Xiangnan He ACM MM, 2022 |
![]() |
IPFC: An Attentive Face Completion Network with Identity Preserving
Xin Ni, Haiyong Xie, Yuyan Yang, Shuo Wang, Wenshan Wang, Yifeng Liu 2022 International Symposium on Electrical, Electronics and Information Engineering (ISEEIE), 2022 |
![]() |
Space-time Separate Modeling for Efficient Video Classification
Pei Cao, Shuo Wang*, Jinmeng Wu, Yanbin Hao Journal of Physics: Conference Series, 2021 |
![]() |
Thinking in Patch: Towards Generalizable Forgery Detection with Patch Transformation
Xueqi Zhang, Shuo Wang, Chenyu Liu, Min Zhang, Xiaohan Liu, Haiyong Xie PRICAI 2021: Trends in Artificial Intelligence: 18th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2021, Hanoi, Vietnam, November 8–12, 2021, Proceedings, Part III 18, 2021 |
More Publications See Google Scholar
Grants
长三角科技创新共同体联合攻关项目 | 课题负责人兼课题参与人 | 2024.12-2027.11 |
国家自然科学基金青年科学基金项目(C类) | 项目负责人 | 2023.01-2024.12 |
安徽高校协同创新项目 | 联合牵头负责人兼课题负责人 | 2021.08-2023.08 |
JKW 国防科技创新项目 | 课题负责人 | 2020.12-2023.08 |
Professional Services
- CCF多媒体技术专业委员会执行委员
- 中国中文信息学会社会媒体处理专业委员会(SMP)委员
Education and Experiences
Postdoc Research Fellow, University of Science and Technology of China Advisor: Prof. Xiangnan He, Mar. 2021 - Mar. 2023 |
Hefei University of Technology (HFUT) Ph.D. Student of Signal and Information Processing Sep. 2015 - Jan. 2021, Hefei, Anhui, China Advisor: Prof. Meng Wang |
Hefei University of Technology (HFUT) Bachelor"s Degree in Electronics Engineering Sep. 2011 - Jun. 2015, Hefei, Anhui, China Advisor: Prof. Meng Wang |
Last update: 19 Aug. 2025. The webpage template borrows from Xiangnan He.