Youpeng Zhao
赵有朋
Brief Bio
I am a PhD candidate at the University of Central Florida, advised by Prof. Jun Wang.
My research interests are machine learning systems (MLSys) and large-scale foundation models.
I build efficient, scalable, and reliable systems for emerging AI applications like large language models (LLMs).
News
- 09/2025: Received a research grant from Lambda. Thanks Lambda!
- 08/2025: I joined Google as a student researcher.
- 06/2025: One paper accepted by ASAP 2025. Congrats Shakya!
- 05/2025: I joined Microsoft Research as a research intern.
- 03/2025: Selected as a cohort of MLSys Rising Stars 2025. Thanks MLCommons!
Selected Papers
- MeRino: Entropy-driven Design for Generative Language Models on IoT Devices.
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang.
AAAI 2025. [Arxiv] [Poster]
- ALISE: Accelerating Large Language Model Serving with Speculative Scheduling.
Youpeng Zhao, Jun Wang.
ICCAD 2024. [Arxiv] [Slides]
- ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching.
Youpeng Zhao, Di Wu, Jun Wang.
ISCA 2024. [Arxiv] [Poster] [Slides] [Video]
Professional Services
- Conference Reviewer: NeurIPS, ICLR, ICML, AISTATS, ACCV, WACV.
- Journal Reviewer: IEEE TC, ACM TOIT, IEEE TSUSC.
- Program Committee: AAAI, MLSys (AE), EuroSys (Shadow PC).
Fun Facts
- I used to play some elecctronic keyboard and won a national prize in 2008 🎹
- My first name is Youpeng (有朋), which in Chinese, means having friends from all over the world 🤓
- My English name is Kenneth, but people usually call me Ken or Kenny 😎