News

1 July 2025
28 June 2025
1 April 2025
10 February 2025
7 March 2025
4 February 2025
16 January 2025
1 January 2025
24 December 2024
15 November 2024
25 October 2024
15 October 2024
14 October 2024
15 September 2024
15 July 2024
7 July 2024
30 May 2024
15 February 2024
15 January 2024
15 September 2023
15 July 2023
15 June 2023
15 January 2023
15 November 2022
10 October 2022
22 April 2022
25 February 2022
18 October 2021
10 October 2021
3 July 2020
10 May 2019
10 September 2019
10 October 2018
Shuo Wang 

Lab for Data Science
Department of Electronic Engineering and Information Science
University of Science and Technology of China

Email: shuowangcv@ustc.edu.cn


Hello, I’m Shuo Wang! I am currently an Associate Research Fellow, School of Information Science and Technology, University of Science and Technology of China (USTC), China. His research interests mainly include machine learning and multimedia data analysis, such as large-scale multimedia indexing and retrieval, multimedia data embedding, and video understanding.

Selected Publications


pdf
Accelerating Diffusion Transformer via Error-Optimized Cache
Junxiang Qiu, Shuo Wang*, Jinda Lu, Lin Liu, Houcheng Jiang, Yanbin Hao
ACM MM, 2025  

pdf
Accelerating Diffusion Transformer via Gradient-Optimized Cache
Junxiang Qiu, Lin Liu, Shuo Wang*, Jinda Lu, Kezhou Chen, Yanbin Hao
ICCV, 2025  

pdf
Dynamic Multimodal Prototype Learning in Vision-Language Models
Xingyu Zhu, Shuo Wang*, Beier Zhu, Miaoge Li, Yunfan Li, Junfeng Fang, Zhicai Wang, Dongsheng Wang, Hanwang Zhang
ICCV, 2025  

pdf
Symmetric Hallucination with Knowledge Transfer for Few-shot Learning
Shuo Wang, Xinyu Zhang, Meng Wang, Xiangnan He
IEEE Transactions on Multimedia, 2024  

pdf
Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Xingyu Zhu, Beier Zhu, Yi Tan, Shuo Wang*, Yanbin Hao, Hanwang Zhang
NeurIPS (Spotlight), 2024  

pdf
Selective Vision-Language Subspace Projection for Few-shot CLIP
Xingyu Zhu, Beier Zhu, Yi Tan, Shuo Wang*, Yanbin Hao, Hanwang Zhang
ACM MM, 2024  

pdf
Feature Mixture on Pre-Trained Model for Few-Shot Learning
Shuo Wang, Jinda Lu, Haiyang Xu, Yanbin Hao, Xiangnan He
IEEE Transactions on Image Processing, 2024  

pdf
Boosting Few-Shot Learning via Attentive Feature Regularization
Xingyu Zhu, Shuo Wang*, Jinda Lu, Yanbin Hao, Haifeng Liu, Xiangnan He
AAAI, 2024  

pdf
Semantic-based Selection, Synthesis, and Supervision for Few-shot Learning
Jinda Lu, Shuo Wang*, Xinyu Zhang, Yanbin Hao, Xiangnan He*
ACM MM, 2023  

pdf
Spatio-Temporal Collaborative Module for Efficient Action Recognition
Yanbin Hao, Shuo Wang*, Yi Tan, Xiangnan He, Zhenguang Liu, Meng Wang
IEEE Transactions on Image Processing, 2022  

pdf
Multi-directional Knowledge Transfer for Few-shot Learning
Shuo Wang, Xinyu Zhang, Yanbin Hao, Chengbing Wang, Xiangnan He
ACM MM, 2022  

pdf
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Yanbin Hao, Shuo Wang*, Pei Cao, Xinjian Gao, Tong Xu, Jinmeng Wu, Xiangnan He
IEEE Transactions on Circuits and Systems for Video Technology, 2022  

pdf
Large-scale Few-shot Learning via Multi-modal Knowledge Discovery
Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian, Meng Wang
ECCV, 2020  

pdf
Connectionist Temporal Fusion for Sign Language Translation
Shuo Wang, Dan Guo, Wengang Zhou, Zhengjun Zha, Meng Wang
ACM MM, 2018  

pdf
Method and Apparatus for Training Classifier
Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian
US Patent App. 17/892,908, 2023  

Other Publications (Five years)


pdf
Interventional Feature Generation for Few-shot Learning
Shuo Wang, Jinda Lu, Huixia Ben, Yanbin Hao, Xingyu Gao, Meng Wang
ACM Transactions on Multimedia Computing, Communications and Applications 2025, 2025  

pdf
Multimodal Generation with Consistency Transferring
Junxiang Qiu, Jinda Lu, Shuo Wang
Findings of the Association for Computational Linguistics: NAACL 2025, 2025  

pdf
Mixture of Multimodal Adapters for Sentiment Analysis
Kezhou Chen, Huixia Ben, Shuo Wang, Shengeng Tang, Yanbin Hao
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2025  

pdf
Linguistics-Vision Monotonic Consistent Network for Sign Language Production
Xu Wang, Shengeng Tang, Peipei Song, Shuo Wang, Dan Guo, Richang Hong
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025  

pdf
Gloss-Driven Conditional Diffusion Models for Sign Language Production
Shengeng Tang, Feng Xue, Jingjing Wu, Shuo Wang, Richang Hong
ACM Transactions on Multimedia Computing, Communications and Applications, 2025  

pdf
DAMO: Data-and Model-aware Alignment of Multi-modal LLMs
Jinda Lu, Junkang Wu, Jinghan Li, Xiaojun Jia, Shuo Wang YiFan Zhang, Junfeng Fang, Xiang Wang, Xiangnan He
ICML 2025, 2025  

pdf
Video Corpus Moment Retrieval with Query-specific Context Learning and Progressive Localization
Long Zhang, Peipei Song, Zhangling Duan, Shuo Wang, Xiaojun Chang, Xun Yang
TCSVT 2025, 2025  

pdf
CVLP-NaVD: Contrastive Visual-Language Pre-training Models for Non-annotated Visual Description
Haoran Li, Yanbin Hao, Jiarui Yu, Bin Zhu, Shuo Wang, Tong Xu
ACM Transactions on Multimedia Computing, Communications and Applications, 2024  

pdf
Pseudo Content Hallucination for Unpaired Image Captioning
Huixia Ben, Shuo Wang*, Meng Wang, Richang Hong
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024  

pdf
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
Fangzhou Song, Bin Zhu, Yanbin Hao, Shuo Wang
European Conference on Computer Vision, 2024  

pdf
JPA: A Joint-Part Attention for Mitigating Overfocusing on 3D Human Pose Estimation
Dengqing Yang, Zhenhua Tang, Jinmeng Wu, Shuo Wang, Lechao Cheng, Yanbin Hao
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024  

pdf
GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation
Shuo Wang, Enlong Xie, Jinda Lu, Jinghan Li, Yanbin Hao
Proceedings of the 35th British Machine Vision Conference (BMVC), 2024  

pdf
Hierarchical Supervised Contrastive Learning for Multimodal Sentiment Analysis
Kezhou Chen, Shuo Wang*, Yanbin Hao
International Conference on Multimedia Modeling, 2024  

pdf
How Can Contrastive Pre-training Benefit Audio-visual Segmentation? A Study from Supervised and Zero-shot Perspectives
Jiarui Yu, Haoran Li, Yanbin Hao, Jinmeng Wu, Tong Xu, Shuo Wang, Xiangnan He
Proceedings of the 34th British Machine Vision Conference (BMVC), 2023  

pdf
Bi-directional Distribution Alignment for Transductive Zero-Shot Learning
Zhicai Wang, Yanbin Hao, Tingting Mu, Ouxiang Li, Shuo Wang, Xiangnan He
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023  

pdf
Boosting Hyperspectral Image Classification with Dual Hierarchical Learning
Shuo Wang, Huixia Ben, Yanbin Hao, Xiangnan He, Meng Wang
ACM Transactions on Multimedia Computing, Communications and Applications, 2023  

pdf
Hierarchical Hourglass Convolutional Network for Efficient Video Classification
Yi Tan, Yanbin Hao, Hao Zhang, Shuo Wang, Xiangnan He
Proceedings of the 30th ACM International Conference on Multimedia, 2022  

pdf
Parameterization of Cross-token Relations with Relative Positional Encoding for Vision MLP
Zhicai Wang, Yanbin Hao, Xingyu Gao, Hao Zhang, Shuo Wang, Tingting Mu, Xiangnan He
ACM MM, 2022  

pdf
IPFC: An Attentive Face Completion Network with Identity Preserving
Xin Ni, Haiyong Xie, Yuyan Yang, Shuo Wang, Wenshan Wang, Yifeng Liu
2022 International Symposium on Electrical, Electronics and Information Engineering (ISEEIE), 2022  

pdf
Space-time Separate Modeling for Efficient Video Classification
Pei Cao, Shuo Wang*, Jinmeng Wu, Yanbin Hao
Journal of Physics: Conference Series, 2021  

pdf
Thinking in Patch: Towards Generalizable Forgery Detection with Patch Transformation
Xueqi Zhang, Shuo Wang, Chenyu Liu, Min Zhang, Xiaohan Liu, Haiyong Xie
PRICAI 2021: Trends in Artificial Intelligence: 18th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2021, Hanoi, Vietnam, November 8–12, 2021, Proceedings, Part III 18, 2021  

More Publications See Google Scholar


Grants

长三角科技创新共同体联合攻关项目 课题负责人兼课题参与人 2024.12-2027.11
国家自然科学基金青年科学基金项目(C类) 项目负责人 2023.01-2024.12
安徽高校协同创新项目 联合牵头负责人兼课题负责人 2021.08-2023.08
JKW 国防科技创新项目 课题负责人 2020.12-2023.08

Professional Services

  • CCF多媒体技术专业委员会执行委员
  • 中国中文信息学会社会媒体处理专业委员会(SMP)委员

Education and Experiences

Postdoc Research Fellow, University of Science and Technology of China
Advisor: Prof. Xiangnan He,      Mar. 2021 - Mar. 2023
Hefei University of Technology (HFUT)
Ph.D. Student of Signal and Information Processing      Sep. 2015 - Jan. 2021, Hefei, Anhui, China
Advisor: Prof. Meng Wang
Hefei University of Technology (HFUT)
Bachelor"s Degree in Electronics Engineering      Sep. 2011 - Jun. 2015, Hefei, Anhui, China
Advisor: Prof. Meng Wang

Last update: 19 Aug. 2025. The webpage template borrows from Xiangnan He.