I am currently a Research Computing Scientist at a stealth startup, where I work on large language models (LLMs) and model compression techniques. Previously, I was a postdoctoral associate in the Department of Computer Science at the University of Pittsburgh, where I worked with Professor Junyu Liu, Professor Peyman Givi, and Professor Juan Jose Mendoza Arenas.
My cat Xiang is a Domestic Shorthair. He is a very curious and playful cat. He enjoys playing with his toys and chasing after laser pointers. He also loves to nap in the sun and cuddle with me. He looks very academic as a cat:)
I received my Ph.D. in Computer Science from Purdue University in August 2024, under the supervision of Professor Xuehai Qian. Prior to that, I earned my Bachelor's Degree from Tsinghua University in 2018, where I worked with Professor Tianling Ren and Professor Shouyi Yin.
Before transferring to Purdue in Fall 2022, I spent four years (2018 to 2022) in the Viterbi School of Engineering at the University of Southern California as a Ph.D. student.
My current research focuses on developing efficient algorithms and systems for large language models, with particular emphasis on model compression techniques that enable deployment of powerful AI models on resource-constrained devices while maintaining performance.