Portrait of Yuhao Shen
Photo

Yuhao Shen (沈宇豪)

Hey, I'm Yuhao Shen, a direct PhD student at the College of Control Science and Engineering, Zhejiang University, advised by Prof. Cong Wang. I also received my B.E. degree from Zhejiang University.

My research lies in MLSys, LLM Inference, AI Infrastructure, and Edge Computing. Over the past two years, I have been deeply engaged in the field of speculative sampling and decoding. I was previously a research intern at Qwen Application and received an internship offer from the Tencent Hunyuan Qingyun Project. Currently, I am researching RL rollout acceleration in the Tongyi Qwen Foundation Model Infra group. Outside academia, I enjoy playing basketball, video games, and photography.

Recent News

Education

Zhejiang University logo Zhejiang University, Hangzhou, China
Direct Ph.D. in Control Science and Engineering, 2024 - 2029
Advisor: Cong Wang
Zhejiang University logo Zhejiang University, Hangzhou, China
Bachelor of Engineering in Control Science and Engineering, 2020 - 2024
Advisor: Cong Wang

Publications

ECHO illustration

ECHO: Elastic Speculative Decoding with Sparse Gating for High-Concurrency Scenarios

Xinyi Hu*, Yuhao Shen*, Baolin Zhang, Hengxin Zhang, Jun Dai, Shuang Ge, Lei Chen, Yue Li, Mingcheng Wan (Equal Contribution)

International Conference on Machine Learning (ICML) Spotlight, 2026

Double illustration

Double: Breaking the Acceleration Limit via Double Retrieval Speculative Parallelism

Yuhao Shen, Tianyu Liu, Junyi Shen, Jinyang Wu, Quan Kong, Huan Li, Cong Wang

ACL 2026 Best Paper Candidate

Spark illustration

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Jinyang Wu, S Yang, C Yang, Yuhao Shen, S Zhang, Z Wen, J Tao

ACL 2026 Main Conference

Atlas illustration

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Jinyang Wu, G Zhai, R Jin, J Yuan, Yuhao Shen, S Zhang, Z Wen, J Tao

ACL 2026 Findings

ParallelVLM illustration

ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding

Quan Kong*, Yuhao Shen*, Yicheng Ji, Huan Li, Cong Wang (Equal Contribution)

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

Hetero 2 Pipe illustration

Hetero2 Pipe: Pipelining Multi-DNN Inference on Heterogeneous Mobile Processors under Co-Execution Slowdown

Yuhao Shen, Z Wang, T Wang, C Gu, Z Wen, Y Shu, Cong Wang

IEEE 45th International Conference on Distributed Computing Systems (ICDCS), 2025

SENGraph illustration

SENGraph: A self-learning evolutionary and node-aware graph network for soft sensing in industrial processes

F Yan, Cong Wang, Z Wang, Yuhao Shen, C Yang

IEEE Transactions on Neural Networks and Learning Systems, 2024

Vision-TTT illustration

Vision-TTT: Efficient and Expressive Visual Representation Learning with Test-Time Training

Quan Kong, Yanru Xiao, Yuhao Shen, Cong Wang

arXiv (2026)

SSL illustration

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Jinyang Wu, C Yang, Yuhao Shen, F Xu, B Ni, C Liao, Y Liu, H Wang, S Nie, S Zhang, et al.

arXiv (2026)

TALON illustration

TALON: Confidence-Aware Speculative Decoding with Adaptive Token Trees

Tianyu Liu, Q Lv, Yuhao Shen, X Sun, X Sun

arXiv (2026)

Profession

Qwen Foundation Model logo Research Intern, Qwen Foundation Model
May. 2026 - Present
Topic: Speculative Decoding, RL Infra
Advisor: Yucheng Li, Huiqiang Jiang
Hangzhou, China
Qwen Application logo Research Intern, Qwen Application
Nov. 2025 - May. 2026
Topic: Speculative Decoding, AI Infra
Advisor: Ye Shuang, Jun Dai, Lei Chen
Hangzhou, China