I go by `SiriusNEO` on most websites. And I usually use `Chaos`
as a nickname (a variant of my real name). (Github | Zhihu |
Scholar)
Research
My research interest lies in building practical, scalable and efficient systems for
machine learning (MLSys).
My work spans the full stack of ML systems, including algorithm-system co-design (Twilight),
serving system (ParrotServe)
, and deep learning compiler (Core maintainer @TileLang,
contributors @TVM).
My long-term research goal is to make powerful AI accessible to everyone.
Currently, I'm working on the following topics. If you share the same interest with me, please don't hesitate to
email me and we can schedule a discussion.
Deep Learning Compiler. I'm now interested in the following topics: Multi-backend design in DL
compilers, Host Codegen / Host DSL, efficient auto-tuning system and Agent-friendly compiler design.
Efficient Attention/System for LLM Agents, specially on memory(context) management problems in
single ReAct agent w/ sub-agents architecture.
Beside the research, I also enjoy crafting some educational materials for helping people learn DSLs, some of my
works include: Triton-Puzzles-Lite, TileLang-Puzzles. (total 1k+ stars)
Publications
(* decates equivalent contributions, † decates the corresponding author)
Twilight: Adaptive Attention Sparsity with Hierarchical Top-p Pruning
Chaofan Lin, Jiaming Tang, Shuo Yang, Hanshuo Wang, Tian Tang,
Boyu Tian, Ion Stoica, Song Han, Mingyu Gao†
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS'25 Spotlight)
Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative
Decoding
Yilong Zhao*, Jiaming Tang*, Kan Zhu, Zihao Ye, Chi-Chih Chang, Chaofan
Lin, Jongseok Park, Guangxuan Xiao, Mohamed S. Abdelfattah, Mingyu Gao, Baris Kasikci,
Song Han, Ion Stoica
Ninth Annual Conference on Machine Learning and Systems (MLSys'26)
[2024/6] Graduated from SJTU with Zhiyuan Outstanding Student Scholarship! My bachelor thesis
`Large Language Model Inference Serving System for Requests with Long System Prompts`
is awarded with Best Bachelor Thesis (Top 1%) in SJTU!
[2024/3] Parrot is accepted by OSDI'24! Code and paper are both released.
[2023/9] I will be joining IIIS, Tsinghua as a PhD student starting 24Fall.