Penghui Yang 杨鹏辉

Penghui Yang is currently a PhD student at the College of Computing and Data Science, Nanyang Technological University, supervised by Prof. Bo An. He received his B.Sc. degree in Computer Science from Nanjing University of Aeronautics and Astronautics in 2023, advised by Prof. Sheng-Jun Huang. Previously, he collaborated closely with Dr. Ming-Kun Xie and Prof. Lei Feng, and he is currently working closely with Dr. Cunxiao Du.

Google Scholar | DBLP

News
  • [May 2025] One paper was accepted by KDD'25.

  • [Jul 2023] One paper was accepted by ICCV'23.

Research Highlights

My research focuses on efficient, scalable methods for accelerating and compressing AI models. My work spans from knowledge distillation for computer vision to my current focus on speculative decoding for LLM inference, aiming to make AI more practical and widely accessible.

I also explore AI for Science by developing software-in-the-loop frameworks. By integrating physics-based simulators (like CALPHAD and DFT) with AI decision loops, I aim to build autonomous systems that are both efficient and scientifically reliable, bridging the gap between neural heuristics and trustworthy discovery.

If you are interested in collaboration, feel free to get in touch with me.

Publications ( show selected / show all by date / show all by topic )

Topics: Speculative Decoding / Knowledge Distillation / AI for Science / Others
*: co-first author, †: corresponding author

MATAI: A Generalist Machine Learning Framework for Property Prediction and Inverse Design of Advanced Alloys
Yanchen Deng*, Chendong Zhao*, Yixuan Li*, Bijun Tang, Xinrun Wang, Zhonghan Zhang, Yuhao Lu, Penghui Yang, Jianguo Huang, Yushan Xiao, Cuntai Guan, Zheng Liu, Bo An

arXiv 2025 Paper

AutoMAT: A Hierarchical Framework for Autonomous Alloy Discovery
Penghui Yang*, Chendong Zhao*, Bijun Tang, Zhonghan Zhang, Xinrun Wang, Yanchen Deng, Yuhao Lu, Cuntai Guan, Zheng Liu, Bo An

arXiv 2025 Paper

LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
Penghui Yang*†, Cunxiao Du*†, Fengzhuo Zhang, Haonan Wang, Tianyu Pang, Chao Du, Bo An

ES-FoMo ICML 2025 Paper | Project Page | Code

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Sailor2 Team

arXiv 2025 Paper | Project Page | Code

Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head
Penghui Yang, Chen-Chen Zong, Sheng-Jun Huang, Lei Feng, Bo An

KDD 2025 Paper | Code | Poster

FedDLAD: A Federated Learning Dual-Layer Anomaly Detection Framework for Enhancing Resilience Against Backdoor Attacks
Binbin Ding, Penghui Yang, Sheng-Jun Huang

IJCAI 2025 Paper | Code

A Unified Open Adapter for Open-World Noisy Label Learning: Data-Centric and Learning-Based Insights
Chen-Chen Zong, Peng-Hui Yang, Ming-Kun Xie, Sheng-Jun Huang

TCSVT 2025 Paper | Code

Multi-Label Knowledge Distillation
Penghui Yang*, Ming-Kun Xie*, Chen-Chen Zong, Lei Feng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

ICCV 2023 Paper | Code | Poster