I am a final-year master student in Tsinghua University, advised by Prof. Yangdong Deng. Before that, I received my bachelor degree from Wuhan University in June 2022. Currently, I am interested in efficient pre-training and inference for Large Language Model(LLM), data efficiency, etc. I am committed to build software and algorithms that really work in practice. Outside of research, I enjoy running, swimming and books on history and sociology. Email  /  Google Scholar  /  Github  /  Zhihu |
|
|
|
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
Longwei Zou, Qingyang Wang Han Zhao Jiangang Kong Yi Yang Yangdong Deng ACL, 2024 github / arXiv Parallelize the structure of LLM to reduce the inference latency. |
|
A Multi-Level Framework for Accelerating Training Transformer Models
Longwei Zou, Han Zhang Yangdong Deng ICLR, 2024 github / arXiv Accelerate transformer training through a series of smaller models. |
Design and source code from Jon Barron's and Tianxiang Sun's websites. |