Personal Homepage

Hello, I'm Zhenhuan Wei.

My current research focuses on AI Infra, training and inference optimization for large language models, with a particular emphasis on distributed systems.

I will soon join Alibaba T-Head, where I will work on framework adaptation and performance optimization for large language models on domestic PPU chips.

Previously, I interned at ByteDance in the Data-AML team, where I contributed to the research and development of a large model framework serving ByteDance's internal search and advertising businesses. Before that, I interned at Tencent IEG, focusing on inference optimization for a TTS large model.

Feel free to reach out: zhenhuan2002@163.com

Visit My Blog