Youpeng Zhao
赵有朋
Brief Bio
I am a CS PhD candidate at University of Central Florida, advised by Jun Wang.
My research focuses on novel hardware-software co-design methodologies to build reliable, scalabale and susatinable systems for emerging AI applications, such as large language models (LLMs).
I am open to collaborations with both industry and academic researchers (preferrably within U.S.). Email if interested!
News
Selected Publications
- MeRino: Entropy-driven Design for Generative Language Models on IoT Devices.
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang.
AAAI 2025. [Arxiv] [Poster]
- ALISE: Accelerating Large Language Model Serving with Speculative Scheduling.
Youpeng Zhao, Jun Wang.
ICCAD 2024. [Arxiv] [Slides]
- ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching.
Youpeng Zhao, Di Wu, Jun Wang.
ISCA 2024. [Arxiv] [Poster] [Slides] [Video]
Professional Services
- Conference Reviewer: NeurIPS, ICML, ICLR, ACCV, WACV.
- Journal Reviewer: IEEE TC, ACM TOIT, IEEE TSUSC.
- Program Committee: AAAI, MLSys, EuroSys.
Fun Facts
- I used to play some elecctronic keyboard and won a national prize in 2008 🎹
- My first name is Youpeng (有朋), which in Chinese, means having friends from all over the world 🤓
- My English name is Kenneth, but people usually call me Ken or Kenny 😎