youpeng.zhao [at] ucf [dot] edu
Brief Bio
I am a CS PhD candidate at University of Central Florida, under the advisory of Prof. Jun Wang.
My research interests are Foundation Models (FMs) and Machine Learning Systems (MLSys).
Before starting my PhD, I worked as a research staff member at Samsung Research China from 2021 to 2022. I obtained my M.Sc. in ECE from Georgia Tech in 2020
and B.Eng. in Automation from Wuhan University in 2018.
- 08/2022 - Present: Graduate Research Assistant at University of Central Florida.
I work on designing efficient machine learning systems.
- 05/2024 - 08/2024: Intern at d-Matrix.
I work on system performance modeling and optimization for diffusion transformer (DiT) models.
- 03/2021 - 08/2022: Research Staff Member at Samsung Researh China (Beijing).
I work on efficient deep learning algorithms for Samsung mobile devices.
Publications
- ALISE: Accelerating Large Language Model Serving with Speculative Scheduling.
Youpeng Zhao, Jun Wang.
IEEE/ACM ICCAD 2024. [Arxiv] [Slides]
- ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching.
Youpeng Zhao, Di Wu, Jun Wang.
IEEE/ACM ISCA 2024. [Arxiv] [Poster] [Slides] [Video]
- MeRino: Entropy-driven Design for Generative Language Models on IoT Devices.
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang.
Technical Report. [Arxiv]
- Parameter Efficient Vision Transformer with Linear Attention.
Youpeng Zhao, Huadong Tang, Yingying Jiang, Yong A, Qiang Wu, Jun Wang.
IEEE ICIP 2023. [Paper] [Arxiv] [Video]
Fun Facts
- I used to play some elecctronic keyboard and won a national prize in 2008 🎹
- My first name is Youpeng (有朋), which in Chinese, means having friends from all over the world 🤓
- My English name is Kenneth, but people usually call me Ken or Kenny 😎