Brief Bio
I am a 4th-year PhD candidate at the University of Central Florida, advised by Prof. Jun Wang.
I received my Master of Science degree from Georgia Tech and my Bachelor of Engineering degree from Wuhan University.
I have also worked as research intern at Microsoft Research
and Google AI Research.
I am one of the cohorts for ML and System Rising Stars 2025.
My research interests lie in the intersections of machine learning and computer systems (MLSys).
I build efficient and resilient machine learning systems for large-scale AI models.
Preprint
-
TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload
Zhiben Chen*, Youpeng Zhao*, Yang Sui, Jun Wang, Yuzhang Shang
Preprint
[Arxiv]
[Project]
[Code]
(*: equal contribution)
Selected Publications
-
GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving
Shakya Jayakody*, Youpeng Zhao*†, Chinmay Nehate, Jun Wang
MLSys 2026
[Arxiv]
[Code]
(*: equal contribution , †: project lead)
-
MeRino: Entropy-driven Design for Generative Language Models on IoT Devices
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang
AAAI 2025
[Arxiv]
-
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching
Youpeng Zhao, Di Wu, Jun Wang
ISCA 2024
[Arxiv]
Links
Fun Facts
- I used to play some elecctronic keyboard and won a national prize in 2008 🎹
- I played a bit college Esports (R6 Siege) and was the founding member of GT R6 Team in 2020 ⚔️
- My Chinese name is 有朋, which means having friends from all over the world 🤓
- My English name is Kenneth, but people usually call me Ken or Kenny 😎