youpeng.zhao [at] ucf [dot] edu
Brief Bio
I am a CS PhD student at University of Central Florida, under the advisory of Prof. Jun Wang.
My research interests are Large Language Models (LLMs) and Machine Learning Systems (MLSys).
Before starting my PhD, I worked as a research staff member at Samsung Research China from 2021 to 2022. I obtained my M.Sc. in ECE from Georgia Tech in 2020
and B.Eng. in Automation from Wuhan University in 2018.
- 05/2024 - 08/2024: Intern at d-Matrix.
I work on system performance modeling for Foundation Model inference.
- 08/2022 - Present: Graduate Research Assistant at University of Central Florida.
I work on designing efficient LLM systems.
- 03/2021 - 08/2022: Research Staff Member at Samsung Researh China (Beijing).
I work on efficient deep learning for mobile devices.
Publications
- ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching.
Youpeng Zhao, Di Wu, Jun Wang.
IEEE/ACM ISCA 2024. [Arxiv] [Poster]
- MeRino: Entropy-driven Design for Generative Language Models on IoT Devices.
Youpeng Zhao, Ming Lin, Huadong Tang, Qiang Wu, Jun Wang.
Technical Report. [Arxiv]
- Parameter Efficient Vision Transformer with Linear Attention.
Youpeng Zhao, Huadong Tang, Yingying Jiang, Yong A, Qiang Wu, Jun Wang.
IEEE ICIP 2023. [Paper] [Arxiv] [Video]
- Class-Aware Contextual Information for Semantic Segmentation.
Huadong Tang, Youpeng Zhao, Yingying Jiang, Zhuoxin Gan, Qiang Wu.
IEEE ICASSP 2023. [Paper]
Fun Facts
- I used to play some elecctronic keyboard won a national prize in 2008.
- My first name is Youpeng (有朋), which in Chinese, means having friends from all over the world :)
- My English name is Kenneth, but people usually call me Ken.