Youpeng Zhao
赵有朋
Brief Bio
I am a PhD candidate at University of Central Florida, under the advisory of Prof. Jun Wang.
My research interests lie in the intersection of machine learning and computer systems, e.g. machine learning systems (MLSys).
My goal is to leverage algorithm-system co-design to build efficient and reliable systems for emerging AI applications, such as large language models (LLMs).
- 08/2022 - Present: Graduate Research Assistant at University of Central Florida.
I work on designing efficient and sustainable machine learning systems.
- 05/2024 - 08/2024: Research Intern at d-Matrix.
I worked on system performance modeling and optimization for video generation models.
- 03/2021 - 08/2022: Research Staff Member at Samsung Researh China.
I worked on efficient deep learning algorithms for Samsung mobile devices.
Selected Publications
- MeRino: Entropy-driven Design for Generative Language Models on IoT Devices.
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang.
AAAI 2025. [Arxiv] [Poster]
- ALISE: Accelerating Large Language Model Serving with Speculative Scheduling.
Youpeng Zhao, Jun Wang.
IEEE/ACM ICCAD 2024. [Arxiv] [Slides]
- ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching.
Youpeng Zhao, Di Wu, Jun Wang.
IEEE/ACM ISCA 2024. [Arxiv] [Poster] [Slides] [Video]
Fun Facts
- I used to play some elecctronic keyboard and won a national prize in 2008 🎹
- My first name is Youpeng (有朋), which in Chinese, means having friends from all over the world 🤓
- My English name is Kenneth, but people usually call me Ken or Kenny 😎